Overview - Reducto

Reducto is the agentic document platform for AI teams who need production-grade document processing at enterprise scale. It provides a complete toolkit for the full document lifecycle, from classification and extraction to editing and workflow orchestration, powered by custom in-house and leading frontier models. What you can build:

Automated intake pipelines that classify, split, and route documents without manual triage
Structured data extraction from invoices, contracts, medical records, and any document type at scale
Document generation and editing that fills forms, modifies templates, and produces new documents programmatically
RAG-ready content pipelines with layout-aware chunking optimized for LLM consumption
Multi-step workflows that chain classification, parsing, extraction, and editing into single API calls

Platform capabilities

Reducto covers the full lifecycle of document work. Each capability is available as a standalone API endpoint or composed into pipelines.

Classify

Route documents by type before processing. Define categories in natural language.

Parse

Convert documents into structured JSON with text, tables, and figures, with layout-aware chunking.

Extract

Pull specific fields into structured JSON using a schema. Define what you need, get exactly that.

Split

Divide documents into logical sections using natural language descriptions.

Edit

Fill PDF forms and modify DOCX files programmatically with natural language instructions.

Pipelines

Chain multiple steps into reusable, single-call workflows deployed from Studio.

Supports 30+ file types including PDFs, images, spreadsheets, presentations, and scanned documents.

How to use Reducto

API & SDKs

For developers building automated pipelines. Available in Python, Node.js, Go, and REST.

Studio

For visual pipeline building. Configure, test, and deploy document workflows in your browser.

Agent-ready tooling: Reducto integrates directly into AI agent workflows via the MCP server, CLI, and native SDKs.

Agent Guide

One-page reference for coding agents using Reducto.

MCP Server

Give agents Reducto tools directly in Claude Code, Codex, Cursor, and other MCP clients.

CLI

Parse local files and folders from a terminal with minimal setup.

Performance

Reducto orchestrates a pipeline of specialized models, including custom in-house models and frontier VLMs, with agentic multipasses that correct errors iteratively. This architecture delivers accuracy on the long tail of real-world documents: handwritten forms, rotated pages, nested tables, multi-column layouts, and degraded scans. Every result links to the Studio citation viewer where you can inspect outputs against source documents at the bounding-box level.

Built for production

SOC 2 Type II

Audited security controls.

HIPAA

Signed BAA available.

Zero Data Retention

Documents deleted within 24h.

Deployment flexibility. SaaS, hybrid VPC, full VPC, or air-gapped on-premises
Scale. 3B+ pages processed, with autoscaling for production workloads.
Security. Encryption at rest (AES-256) and in transit (TLS 1.2+). EU and AU data residency. Learn more →
Support. Dedicated field engineering, custom model fine-tuning, and 24/7 oncall for Enterprise customers

See Enterprise readiness →

Get started

API Quickstart

Parse your first document in 5 minutes with Python, Node.js, Go, or REST.

Studio Quickstart

Build and deploy an extraction pipeline visually, no code required.

​Platform capabilities