Skip to main content
Every Reducto endpoint exposes configuration options that control how documents are processed. This section covers all available configurations, from parse-level OCR and layout settings through extraction schemas and workflow orchestration.

Configuration by Endpoint

Parse converts documents into structured content. Options are grouped by purpose:
GroupPurposePages
enhanceAI-powered accuracyAgentic Modes, Chart Extraction
retrievalRAG optimizationChunking Methods
formattingDetecting styling & output formatTable Formats, Additional Document Data
spreadsheetExcel/CSV handlingSpreadsheet Processing
settingsProcessing controlsProcessing Settings, Page Ranges
result = client.parse.run(
    input=upload,
    enhance={...},
    retrieval={...},
    formatting={...},
    spreadsheet={...},
    settings={...}
)

Common Patterns

Variable chunking with embedding optimization for vector search:
result = client.parse.run(
    input=upload,
    retrieval={
        "chunking": {"chunk_mode": "variable", "chunk_size": 1000},
        "embedding_optimized": True
    },
    formatting={"table_output_format": "dynamic"}
)
Enable agentic mode for both text and tables:
result = client.parse.run(
    input=upload,
    enhance={
        "agentic": [{"scope": "text"}, {"scope": "table"}]
    }
)
Array extraction with source locations for long documents:
result = client.extract.run(
    input=upload,
    instructions={"schema": schema},
    settings={
        "array_extract": True,
        "citations": {"enabled": True}
    }
)

Migrating from v2

If you’re using the legacy configuration format, use this converter to transform your v2 config to v3:
See the Migration Guide for complete mapping tables and examples.