🍋
Menu
Best Practice Beginner 2 min read 362 words

PDF/A for Long-Term Archiving: Standards and Best Practices

PDF/A is an ISO-standardized subset of PDF designed for digital preservation. Government agencies, legal firms, and healthcare organizations use PDF/A to ensure documents remain readable decades from now without special software.

Key Takeaways

  • Standard PDFs can reference external fonts, use proprietary encryption, embed JavaScript, and link to external resources.
  • For most archiving needs, PDF/A-2b is the best choice.
  • Every font used in the document must be fully embedded — not just referenced or subsetted.
  • After creating a PDF/A file, always validate it against the target standard.
  • ## Key Requirements ### Embedded Fonts Every font used in the document must be fully embedded — not just referenced or subsetted.

What Makes PDF/A Different

Standard PDFs can reference external fonts, use proprietary encryption, embed JavaScript, and link to external resources. Any of these dependencies can break over time — a font gets discontinued, an encryption standard becomes unsupported, or a linked resource disappears. PDF/A eliminates these risks by requiring complete self-containment.

PDF/A Conformance Levels

Level Standard Key Requirements
PDF/A-1a ISO 19005-1 Full text extraction, logical structure tags, Unicode mapping
PDF/A-1b ISO 19005-1 Visual reproduction only (no structure requirements)
PDF/A-2b ISO 19005-2 Adds JPEG 2000, transparency, layers, PDF attachments
PDF/A-2u ISO 19005-2 Like 2b plus Unicode text mapping
PDF/A-3b ISO 19005-3 Allows embedding any file type (XML, CSV, CAD)

Choosing the Right Level

For most archiving needs, PDF/A-2b is the best choice. It supports modern PDF features (transparency, JPEG 2000) while maintaining broad compatibility. Use PDF/A-1b when maximum compatibility with older validators is required. Choose PDF/A-3b when you need to embed source data files alongside the rendered document.

Key Requirements

Embedded Fonts

Every font used in the document must be fully embedded — not just referenced or subsetted. This ensures the text renders correctly even if the font is no longer available. Font embedding typically increases file size by 200-500 KB per font family.

No External Dependencies

PDF/A files cannot contain links to external content, embedded multimedia requiring external codecs, JavaScript, or executable content. All resources must be contained within the file itself.

Color Management

All colors must be specified in a device-independent color space (ICC profiles must be embedded). RGB and CMYK colors without profiles are not permitted — this ensures colors render consistently regardless of the viewing device.

Metadata

PDF/A requires XMP metadata including title, creator, creation date, and modification date. This metadata must be synchronized with the document's Info dictionary for PDF/A-1.

Validation

After creating a PDF/A file, always validate it against the target standard. Common issues that cause validation failures include: missing embedded fonts, RGB colors without ICC profiles, and JavaScript remnants from form fields.