Skip to main content
POST
/
split
Split
import requests

url = "https://platform.reducto.ai/split"

payload = {
    "input": "<string>",
    "parsing": {
        "enhance": {
            "agentic": [],
            "summarize_figures": True
        },
        "retrieval": {
            "chunking": { "chunk_mode": "disabled" },
            "embedding_optimized": False,
            "filter_blocks": []
        },
        "formatting": {
            "add_page_markers": False,
            "include": [],
            "merge_tables": False,
            "table_output_format": "dynamic"
        },
        "spreadsheet": {
            "clustering": "accurate",
            "exclude": [],
            "include": [],
            "split_large_tables": {
                "enabled": True,
                "size": 50
            }
        },
        "settings": {
            "embed_pdf_metadata": False,
            "force_url_result": False,
            "ocr_system": "standard",
            "persist_results": False,
            "return_images": [],
            "return_ocr_data": False,
            "timeout": 900
        }
    },
    "split_description": [
        {
            "name": "<string>",
            "description": "<string>",
            "partition_key": "<string>"
        }
    ],
    "split_rules": "Split the document into the applicable sections. Sections may only overlap at their first and last page if at all.",
    "settings": { "table_cutoff": "truncate" }
}
headers = {
    "Authorization": "Bearer <token>",
    "Content-Type": "application/json"
}

response = requests.post(url, json=payload, headers=headers)

print(response.json())
{
  "usage": {
    "num_pages": 123,
    "credits": 123
  },
  "result": {
    "section_mapping": {},
    "splits": [
      {
        "name": "<string>",
        "pages": [
          123
        ],
        "conf": "low",
        "partitions": [
          {
            "name": "<string>",
            "pages": [
              123
            ],
            "conf": "low"
          }
        ]
      }
    ]
  }
}

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json
input
required

The URL of the document to be processed. You can provide one of the following: 1. A publicly available URL 2. A presigned S3 URL 3. A reducto:// prefixed URL obtained from the /upload endpoint after directly uploading a document 4. A jobid:// prefixed URL obtained from a previous /parse invocation

split_description
SplitCategory · object[]
required

The configuration options for processing the document.

parsing
object

The configuration options for parsing the document. If you are passing in a jobid:// URL for the file, then this configuration will be ignored.

split_rules
string
default:Split the document into the applicable sections. Sections may only overlap at their first and last page if at all.

The prompt that describes rules for splitting the document.

settings
object

The settings for split processing.

Response

Successful Response

usage
object
required
result
object
required

The split result.

I