Skip to main content
POST
/
parse_async
Async Parse
import requests

url = "https://platform.reducto.ai/parse_async"

payload = {
    "async": { "priority": False },
    "input": "<string>",
    "enhance": {
        "agentic": [],
        "summarize_figures": True
    },
    "retrieval": {
        "chunking": { "chunk_mode": "disabled" },
        "filter_blocks": [],
        "embedding_optimized": False
    },
    "formatting": {
        "add_page_markers": False,
        "table_output_format": "dynamic",
        "merge_tables": False,
        "include": []
    },
    "spreadsheet": {
        "split_large_tables": {
            "enabled": True,
            "size": 50
        },
        "include": [],
        "clustering": "accurate",
        "exclude": []
    },
    "settings": {
        "ocr_system": "standard",
        "force_url_result": False,
        "return_ocr_data": False,
        "return_images": [],
        "embed_pdf_metadata": False,
        "persist_results": False,
        "timeout": 900
    }
}
headers = {
    "Authorization": "Bearer <token>",
    "Content-Type": "application/json"
}

response = requests.post(url, json=payload, headers=headers)

print(response.json())
{
  "job_id": "<string>"
}

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json
input
required

The URL of the document to be processed. You can provide one of the following: 1. A publicly available URL 2. A presigned S3 URL 3. A reducto:// prefixed URL obtained from the /upload endpoint after directly uploading a document 4. A jobid:// prefixed URL obtained from a previous /parse invocation

async
object

The configuration options for asynchronous processing (default synchronous).

enhance
object
retrieval
object
formatting
object
spreadsheet
object
settings
object

Response

Successful Response

job_id
string
required
I