Reducto is the agentic document platform for AI teams who need production-grade document processing at enterprise scale. It provides a complete toolkit for the full document lifecycle, from classification and extraction to editing and workflow orchestration, powered by custom in-house and leading frontier models. What you can build:Documentation Index
Fetch the complete documentation index at: https://docs.reducto.ai/llms.txt
Use this file to discover all available pages before exploring further.
- Automated intake pipelines that classify, split, and route documents without manual triage
- Structured data extraction from invoices, contracts, medical records, and any document type at scale
- Document generation and editing that fills forms, modifies templates, and produces new documents programmatically
- RAG-ready content pipelines with layout-aware chunking optimized for LLM consumption
- Multi-step workflows that chain classification, parsing, extraction, and editing into single API calls
Platform capabilities
Reducto covers the full lifecycle of document work. Each capability is available as a standalone API endpoint or composed into pipelines.Classify
Route documents by type before processing. Define categories in natural language.
Parse
Convert documents into structured JSON with text, tables, and figures, with layout-aware chunking.
Extract
Pull specific fields into structured JSON using a schema. Define what you need, get exactly that.
Split
Divide documents into logical sections using natural language descriptions.
Edit
Fill PDF forms and modify DOCX files programmatically with natural language instructions.
Pipelines
Chain multiple steps into reusable, single-call workflows deployed from Studio.
How to use Reducto
API & SDKs
For developers building automated pipelines. Available in Python, Node.js, Go, and REST.
Studio
For visual pipeline building. Configure, test, and deploy document workflows in your browser.
Performance
Reducto orchestrates a pipeline of specialized models, including custom in-house models and frontier VLMs, with agentic multipasses that correct errors iteratively. This architecture delivers accuracy on the long tail of real-world documents: handwritten forms, rotated pages, nested tables, multi-column layouts, and degraded scans. Every result links to the Studio citation viewer where you can inspect outputs against source documents at the bounding-box level.Built for production
SOC 2 Type II
Audited security controls.
HIPAA
Signed BAA available.
Zero Data Retention
Documents deleted within 24h.
- Deployment flexibility. SaaS, hybrid VPC, full VPC, or air-gapped on-premises
- Scale. 3B+ pages processed. Autoscaling with dedicated worker pools available.
- Security. Encryption at rest (AES-256) and in transit (TLS 1.2+). EU and AU data residency. Learn more →
- Support. Dedicated field engineering, custom model fine-tuning, and 24/7 oncall for Enterprise customers
Get started
API Quickstart
Parse your first document in 5 minutes with Python, Node.js, Go, or REST.
Studio Quickstart
Build and deploy an extraction pipeline visually, no code required.