split.run() method divides documents into sections based on descriptions you provide. You define what sections to look for, and Split identifies which pages belong to each section.
Basic Usage
Method Signature
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
input | str | Yes | File ID (reducto://...), URL, or jobid:// reference |
split_description | list[dict] | Yes | List of sections to identify, each with name, description, and optional partition_key |
parsing | dict | None | No | Parse configuration (page range, OCR settings) |
settings | dict | None | No | Split settings (e.g., table_cutoff) |
split_rules | str | None | No | Natural language prompt describing rules for splitting |
Split Description
Thesplit_description parameter is required. Each entry defines a section to find:
With Partition Key
Usepartition_key when a section type repeats multiple times and you want to group by a specific identifier:
partition_key is a string describing what identifier to look for (e.g., “account number”, “patient ID”, “invoice number”). Split will find all instances of that section and group them by the identifier value it finds in the document.
Split Rules
Thesplit_rules parameter is a natural language prompt that controls how pages are classified. The default rule allows pages to belong to multiple sections only at boundaries:
Parsing Configuration
Configure how the document is parsed before splitting:Response Structure
Split Object
Each split contains:name(str): The section name you definedpages(list[int]): Page numbers belonging to this section (1-indexed)conf(str): Confidence level ("high"or"low")partitions(list | None): When usingpartition_key, contains sub-sections with their ownname,pages, andconf
Error Handling
Complete Example
Chaining with Extract
A common pattern is to split a document then extract different schemas from each section:Best Practices
Write Clear Descriptions
Detailed section descriptions improve classification accuracy.
Use Partition Keys
Use
partition_key with a string identifier when sections repeat multiple times.Next Steps
- Learn about split configuration options
- Explore the async client for concurrent processing
- See chaining endpoints for Split + Extract workflows