Artifact
| Description | Denotes content that is non-structural or decorative, such as background graphics or other content not intended to be read in the logical order. |
|---|---|
| Namespace | 2.0 |
| Category | grouping block inline |
Attributes
Specifies how the element is placed relative to surrounding content (e.g., block-level or inline flow).
Defines the direction of text flow (e.g., left-to-right, right-to-left, or vertical).
Sets the background color for the element’s content area.
Specifies the color of the border around the element.
Indicates the style of the border (e.g., solid, dashed, dotted).
Defines the thickness of the border line in user space units (such as points).
Determines the space between the element’s border (or boundary) and its inner content.
Applies the primary color (fill or stroke) for the text or graphic content.
Main indicator of type. This semantic association allows tools to present and support interaction with the object in a manner that is consistent with user expectations about other objects of that type.
Differences
Well tagged PDF:
The 'Artifact' element in Well-Tagged PDF is used for content that is decorative or non-essential to the logical structure. It marks items that should be ignored in content extraction and reflow.
Artifacts must be clearly marked to distinguish them from meaningful content. They should be excluded from the logical reading order to prevent interference with content reusability.
PDFUA:
In PDF/UA, the 'Artifact' element is critical for accessibility by identifying content that does not contribute to the semantic meaning, such as decorative images or layout markers.
Artifacts must be properly tagged and excluded from the primary reading order, ensuring that assistive technologies can skip them and focus on meaningful content.
Use cases
Soft hyphen is artifacted
Try itDots representing leader line in table of content are artifacted
Try itA presentation of non-interactive form field is artifacted
Try itArtifacted label in list item
Try itTag Relationships
Permitted Parent Tags
Permitted Child Tags
Click on any tag to view its details.
Related Matterhorn Protocol checkpoints
- Artifact is tagged as real content.
- Real content is marked as artifact.
- Content marked as Artifact is present inside tagged content.
- Tagged content is present inside content marked as Artifact.
- Tags are not in logical reading order.
- Structure elements are nested in a semantically inappropriate manner. (e.g. a table inside a heading).
- The structure type (after applying any role-mapping as necessary) of a structure element is not semantically appropriate.
- Headers and footers are not marked as pagination artifacts.
- Header or footer artifacts are not classified as Header or Footer subtypes.
- The appearance stream of a PrinterMark annotation is not marked as Artifact.