Knowledge Base API
Read-only API endpoints for accessing PDF tagging reference data in both human-friendly Markdown and machine-friendly JSON formats
Base URL
https://taggedpdfschool.pdfa.orgResponse Formats
Authentication
Currently unauthenticated (read-only). All endpoints are publicly accessible.
Available Endpoints
Human-friendly format for documentation
GET https://taggedpdfschool.pdfa.org/api/db/md/tagsOpenExample Usage:
curl https://taggedpdfschool.pdfa.org/api/db/md/tagsHuman-friendly format for documentation
GET https://taggedpdfschool.pdfa.org/api/db/md/structureRelationshipsOpenExample Usage:
curl https://taggedpdfschool.pdfa.org/api/db/md/structureRelationshipsHuman-friendly format for documentation
GET https://taggedpdfschool.pdfa.org/api/db/md/attributesOpenExample Usage:
curl https://taggedpdfschool.pdfa.org/api/db/md/attributesHuman-friendly format for documentation
GET https://taggedpdfschool.pdfa.org/api/db/md/propertiesOpenExample Usage:
curl https://taggedpdfschool.pdfa.org/api/db/md/propertiesHuman-friendly format for documentation
GET https://taggedpdfschool.pdfa.org/api/db/md/matterhornOpenExample Usage:
curl https://taggedpdfschool.pdfa.org/api/db/md/matterhornResponses are generated at request time from the in-app data layer bundled at build time
Structure relationships are a computed aggregation describing parent/child tag constraints
If you add or modify the underlying JSON assets, rebuild/redeploy the app to refresh endpoint output
All endpoints support standard HTTP methods and return appropriate status codes
What is MCP?
The Model Context Protocol (MCP) is an open standard that enables AI assistants and development tools to access external data sources and tools. Our MCP server provides programmatic access to all PDF tagging reference data, allowing AI assistants like Cursor IDE and GitHub Copilot to query tags, attributes, properties, and Matterhorn Protocol checkpoints.
Configuration
To use the MCP server, add the following configuration to your IDE's MCP settings:
Cursor IDE
Add this configuration to your Cursor MCP settings file:
{
"mcpServers": {
"tagged-pdf-school": {
"url": "https://taggedpdfschool.pdfa.org/api/mcp",
"transport": "http"
}
}
}Location: ~/.cursor/mcp.json (macOS/Linux) or %APPDATA%\Cursor\User\globalStorage\mcp.json (Windows), or via Settings → Features → Model Context Protocol
GitHub Copilot
Add this configuration to your GitHub Copilot MCP settings:
{
"mcpServers": {
"tagged-pdf-school": {
"url": "https://taggedpdfschool.pdfa.org/api/mcp",
"transport": "http"
}
}
}Location: Check GitHub Copilot documentation for MCP configuration file location
Available Tools
The MCP server provides seven powerful tools for querying our database:
- get_attribute - Get attribute details by name
- get_tag - Get tag information and specifications
- get_property - Get property definitions and values
- get_checkpoint - Get Matterhorn Protocol checkpoint details
- search_attributes - Search attributes by keyword
- search_tags - Search tags by keyword
- get_all_databases - Get summary of all available databases
How to Use
After configuring the MCP server, restart your IDE. You can then ask your AI assistant questions like: 'Get information about the Document tag', 'Search for attributes related to borders', or 'What does checkpoint 01-001 check for?'. The AI assistant will use the MCP tools to query our database and provide accurate, up-to-date information.
Markdown Format
- Documentation pages
- Human-readable reference
- AI-readable reference
- Educational content
- Quick lookups
JSON Format
- Automated tooling
- Scripts and integrations
- Data processing
- API consumption