Skip to main content
POST
/
extract_async
Extract Async
import requests

url = "https://platform.reducto.ai/extract_async"

payload = {
    "async": { "priority": False },
    "input": "<string>",
    "parsing": {
        "enhance": {
            "agentic": [],
            "summarize_figures": True
        },
        "retrieval": {
            "chunking": { "chunk_mode": "disabled" },
            "embedding_optimized": False,
            "filter_blocks": []
        },
        "formatting": {
            "add_page_markers": False,
            "include": [],
            "merge_tables": False,
            "table_output_format": "dynamic"
        },
        "spreadsheet": {
            "clustering": "accurate",
            "exclude": [],
            "include": [],
            "split_large_tables": {
                "enabled": True,
                "size": 50
            }
        },
        "settings": {
            "embed_pdf_metadata": False,
            "force_url_result": False,
            "ocr_system": "standard",
            "persist_results": False,
            "return_images": [],
            "return_ocr_data": False,
            "timeout": 900
        }
    },
    "instructions": { "system_prompt": "Be precise and thorough." },
    "settings": {
        "include_images": False,
        "optimize_for_latency": False,
        "array_extract": False,
        "citations": {
            "enabled": True,
            "numerical_confidence": True
        }
    }
}
headers = {
    "Authorization": "Bearer <token>",
    "Content-Type": "application/json"
}

response = requests.post(url, json=payload, headers=headers)

print(response.json())
{
  "job_id": "<string>"
}

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json
input
required

The URL of the document to be processed. You can provide one of the following: 1. A publicly available URL 2. A presigned S3 URL 3. A reducto:// prefixed URL obtained from the /upload endpoint after directly uploading a document 4. A jobid:// prefixed URL obtained from a previous /parse invocation

async
object

The configuration options for asynchronous processing (default synchronous).

parsing
object

The configuration options for parsing the document. If you are passing in a jobid:// URL for the file, then this configuration will be ignored.

instructions
object

The instructions to use for the extraction.

settings
object

The settings to use for the extraction.

Response

Successful Response

job_id
string
required
I