Skip to main content

PDF Tags Overview

PDF tags are structural elements that define the logical reading order and hierarchy of a document's content. They are essential for creating accessible PDFs that can be properly interpreted by screen readers and other assistive technologies.

Note on Artifacts

The Artifact tag documented here refers to the Artifact structure element (SE) introduced in PDF 2.0. This differs from traditional artifacts in PDF 1.7, which are marked content sequences (MCSs) that are not part of the structure tree. Traditional artifacts are handled differently and are not represented by tags in the structure tree. When working with PDF 2.0, Artifact SEs should be classified using Type and Subtype attributes.

Why Use PDF Tags?

  • Improve document accessibility for users with disabilities
  • Enable proper content reflow on different devices and screen sizes
  • Maintain logical reading order of content
  • Support better content extraction and repurposing
  • Meet accessibility compliance requirements

Tag Structure

PDF tags are organized in a hierarchical structure, similar to HTML. The structure typically includes:

Getting Started

Select a tag from the navigation menu on the left to learn more about its specific usage, attributes, and best practices. Each tag documentation includes examples and implementation notes.