Documents and Processing
How do I return the whole document as plain Markdown?
How do I return the whole document as plain Markdown?
Set chunking to Tables will be formatted according to your
disabled (the default). The entire document will be returned as a single chunk with all content in the content field as Markdown.table_output_format setting (default: dynamic, which uses Markdown for simple tables).What's the difference between content and embed fields?
What's the difference between content and embed fields?
Both fields contain the chunk’s text, but optimized for different purposes:
content: Raw extraction with original formatting. Tables appear as HTML or Markdown. Use for display.embed: Optimized for vector embeddings. Whenembedding_optimized: true, tables become natural language summaries like “This table shows quarterly revenue…” which embed better.
embed for your vector database, content for displaying results to users.See Understanding Chunks for details.How do I use Reducto for RAG?
How do I use Reducto for RAG?
- Parse with variable chunking to get semantically meaningful segments
- Enable embedding optimization so tables become natural language
- Filter noise like headers and footers
- Store chunks in your vector database
Which configurations increase latency?
Which configurations increase latency?
Adds significant latency:
enhance.agenticwith any scope (runs VLM passes)enhance.agentic[].advanced_chart_agent: true(detailed chart analysis)- Large documents with
embedding_optimized: true
settings.return_images(generates cropped images)settings.embed_pdf_metadata(modifies PDF)
- Chunking mode changes
- Table output format changes
- Block filtering
Can I choose which LLM or model Reducto uses?
Can I choose which LLM or model Reducto uses?
No. Reducto manages model selection internally to optimize for accuracy, cost, and latency. The models used may change as we improve the system.For on-premise deployments, you can configure which LLM providers are available. See LLM Configuration.
URLs and Retention
How long are result URLs valid?
How long are result URLs valid?
- Image URLs (
image_urlfromreturn_images): Valid for 1 hour - PDF URLs (
pdf_url): Valid for 1 hour - Result URLs (when
type: "url"): Valid for 1 hour
How long are job results retained?
How long are job results retained?
By default, job results are deleted after 12 hours per Reducto’s zero data retention (ZDR) policy.To keep results longer:With
persist_results: true, results are stored indefinitely and can be retrieved anytime using the job ID. This requires opting in to Reducto Studio.Why am I getting a URL instead of inline results?
Why am I getting a URL instead of inline results?
When the response exceeds approximately 6MB, Reducto returns To always get URL responses (for consistent handling):
result.type: "url" instead of result.type: "full". Fetch the content from result.url:Job ID not found error
Job ID not found error
Jobs are deleted after 12 hours per the zero data retention policy. If you’re looking for a job from more than 12 hours ago, it has been automatically deleted.To prevent this:
- Process results immediately when you receive them
- Store results in your own database
- Use
persist_results: trueto keep results indefinitely
API and Integration
What's the maximum file size?
What's the maximum file size?
- Direct upload (
/upload): 100MB - Presigned URL upload: 5GB
- URL passthrough: No limit (Reducto fetches the file)
How do I check system status?
How do I check system status?
Visit status.reducto.ai for real-time status of all Reducto services, uptime history, and incident reports.Subscribe to updates to get notified of any service disruptions.
How do I contact support?
How do I contact support?
- Email: [email protected]
- Slack: Available for enterprise customers
- Studio: Use the feedback button in Reducto Studio