Change Tracking & Comments

On this page

Change Tracking
Configuration
Output
PDF Comments
Configuration
Output

Extract underlined/strikethrough text with HTML markup and PDF comments with their locations.

Change Tracking

Add HTML tags around text formatting to detect document changes.

Configuration

{
  "document_url": "https://example.com/document.pdf",
  "options": {
    "extraction_mode": "hybrid"
  },
  "advanced_options": {
    "enable_change_tracking": true
  }
}

Requirements: Only works with hybrid or metadata extraction mode (not ocr).

Output

<change><u>underlined text</u></change> for underlined text
<change><s>deleted text</s></change> for strikethrough text
<change><s>old</s> <u>new</u></change> for change sequences

PDF Comments

Extract text annotations from PDF documents with their content and locations.

Configuration

{
  "document_url": "https://example.com/annotated.pdf",
  "advanced_options": {
    "read_comments": true
  }
}

Output

Comments include content and normalized bounding box coordinates:

{
  "content": "Review comment text",
  "bbox": [0.1, 0.2, 0.3, 0.4]
}

The bbox array contains [left, top, width, height] normalized to [0,1] relative to page dimensions.

Supported Languages Split

Get Started

Core Functions

Configurations

FAQ

Security and Privacy

On-Premise

Change Tracking & Comments

Change Tracking

Configuration

Output

PDF Comments

Configuration

Output

Get Started

Core Functions

Configurations

FAQ

Security and Privacy

On-Premise

​Change Tracking

​Configuration

​Output

​PDF Comments

​Configuration

​Output

Change Tracking

Configuration

Output

PDF Comments

Configuration

Output