> ## Documentation Index > Fetch the complete documentation index at: https://docs.reducto.ai/llms.txt > Use this file to discover all available pages before exploring further. # On-premise changelog > Release notes for on-premise deployments of Reducto export const PasswordProtect = ({children}) => { const [password, setPassword] = useState(""); const [isAuthenticated, setIsAuthenticated] = useState(false); const [error, setError] = useState(""); const correctPasswordHash = "9daff39ca2584edc54444193f62e5e54dce0bcd5e5d604b1748c79bfb3d7d1fd"; const hashPassword = async inputPassword => { const encoder = new TextEncoder(); const data = encoder.encode(inputPassword); const hashBuffer = await crypto.subtle.digest('SHA-256', data); const hashArray = Array.from(new Uint8Array(hashBuffer)); return hashArray.map(b => b.toString(16).padStart(2, '0')).join(''); }; useEffect(() => { const storedPassword = localStorage.getItem("reducto-onprem-password"); if (storedPassword) { checkStoredPassword(storedPassword); } }, []); const checkStoredPassword = async storedPassword => { const hashedStored = await hashPassword(storedPassword); if (hashedStored === correctPasswordHash) { setIsAuthenticated(true); setError(""); } }; const handleSubmit = async e => { e.preventDefault(); const hashedInput = await hashPassword(password); if (hashedInput === correctPasswordHash) { setIsAuthenticated(true); setError(""); localStorage.setItem("reducto-onprem-password", password); } else { setError("Incorrect password. Please try again."); setPassword(""); } }; if (isAuthenticated) { return <>{children}; } return

🔒

Protected Content

This content requires a password to access.

{error &&

{error}

}

; }; * perf: Bound concurrent agentic image memory, preventing worker OOMs on heavy documents. * fix: Resolve spurious `DocumentCorruptError` failures on DOCX files that reference external templates. * feat: Added `/openapi-onprem-full.json` for on-prem deployments. This schema includes the full HTTP pod API surface, including routes hidden from the hosted public API reference. * feat: Added on-prem deployment controls and worker/runtime reliability improvements for supported customer environments. * fix: Improved job finalization, result streaming, document rendering, and customer-facing error handling for better stability under load. * docs: Added RSS-ready metadata for on-prem changelog entries. The generated feed is available at [onprem-changelog-rss.reducto.ai/rss.xml](https://onprem-changelog-rss.reducto.ai/rss.xml). * feat: Add `num_pages` field to the `/classify` API response. * feat: Add `max` mode for agentic tables for higher-fidelity table extraction. * fix: Fail jobs on OCR transient errors instead of silently returning an empty response. * fix: More accurate HTTP error status codes so client-side validation failures return 4xx instead of 5xx. * fix: Multipart S3 upload for result payloads over 100MB, with a new `OversizedResultError` for limit cases. * chore: Removed legacy sandbox setup that is no longer used by supported on-prem workflows. * fix: Improved `/jobs` source redaction and added release validation for `/version` interpolation. * fix: Strengthened authentication header handling for webhook-related endpoints. * docs: Added clearer guidance for the on-prem shared security model, securing Reducto deployments, and observability access controls. See [On-prem security model](/onprem/security_model), [Securing Reducto](/onprem/securing_reducto), and [Observability & Monitoring](/onprem/observability). * feat: Extend the SIGUSR2 stack-trace dumper to the DB-queue worker pods (`reducto-worker`, `reducto-priority-worker`, `reducto-gpu-worker`). Previously the handler was only installed on HTTP and streaq worker pods; this completes coverage so `kill -s USR2 ` produces a stderr stack dump on every Reducto pod type. See [Observability](/onprem/observability#pod-stack-trace-dumps-sigusr2). * perf: Higher embed pool concurrency for improved throughput * fix: Pre-resolve OCR before vision-model citations with stricter containment threshold for improved accuracy * fix: Preserve 5xx semantics for PDF text extraction subprocess crashes * perf: Faster spreadsheet processing + fix images on text-empty sheets * perf: Higher PDF render pool concurrency with eager background respawn * fix: Replace non-ASCII chars that break obfuscation build * feat: Azure Vision OCR client migrated to the async aio SDK with hard wall-clock cancellation via `asyncio.wait_for`. The new `AZURE_VISION_TOTAL_CALL_TIMEOUT` (default 45s) closes the underlying aiohttp socket when it fires, so workers no longer wedge on slow Azure responses. Per-call connect/read timeouts and SDK retries remain configurable via `AZURE_VISION_CONNECTION_TIMEOUT`, `AZURE_VISION_READ_TIMEOUT`, and `AZURE_VISION_MAX_RETRIES`. See [LLM options → Azure Vision](/onprem/llm_options#azure-vision-ocr) for the full env var reference. * feat: SIGUSR2 stack-trace dumper installed in HTTP gunicorn workers and streaq worker processes. `kubectl exec` into a pod and run `kill -s USR2 ` to print every thread's stack to stderr (visible via `kubectl logs`) for diagnosing hung tasks, wedged event loops, or contended thread pools. See [Observability](/onprem/observability#pod-stack-trace-dumps-sigusr2). * feat: Kubernetes liveness watchdog + exec probe for `reducto-worker`, `reducto-priority-worker`, and `reducto-gpu-worker` pods. A pod is only restarted when the in-process watchdog reports an in-flight task running longer than `WORKER_STUCK_TASK_THRESHOLD_SEC` (default 1800s) or the watchdog itself stops heartbeating (event loop wedged). Idle workers are never restarted. Configurable via `worker.livenessProbe.*` Helm values. See [Operations](/onprem/operations#worker-liveness-probe). * feat: Hard-killable subprocess pool with per-renderer timeouts for PDF renders * feat: Hard-killable subprocess pool for PDF flatten + embed timeouts * fix: Skip zero-dimension crops in fine-grained citations * fix: Killable timeout for stuck PDF text extraction * fix: HTTP worker graceful shutdown timeout set to 120s * fix: Strip control characters from extracted strings * fix: Preserve overlay text selectability when embedding into existing PDFs * fix: Protect figures from signature detection * fix: Return 4xx for images with extreme aspect ratios * refactor: More reliable PDF widget ingestion in `/edit` * fix: Azure Vision OCR SDK retry behavior on-prem * perf: Spreadsheet memory optimizations * perf: Faster OCR cropping for table prediction * perf: Faster PDF render path * perf: Faster Google Cloud Vision response and rotation hint processing * feat: Make all fields in auto-generated extraction schemas optional * fix: Memory leak in PDF processing * fix: Avoid PDF library hang on dense-tiling-pattern PDFs via fallback reader * fix: Corrupt zip-based documents now return 400 instead of 500 * fix: Preserve citations on scalar-only schema adherence corrections * fix: Cap spreadsheet agent instructions to avoid input token limits * fix: Correct ordering of table enrichment relative to markup application * fix: Clean up spurious 5xx on cancels and HTML to PDF hangs * feat: New Azure Vision OCR strategy for on-prem deployments * feat: Datadog dashboard for on-prem k8s deployments * feat: Expose `keep_line_breaks` in V3 `settings.alpha` * feat: Optimized inference backend for table router with safety wrapper * fix: Faster usage logging via append-only buffer to reduce DB contention * fix: Tolerate quoted-printable colspan in HTML parsing * fix: Stabilize table router builds with pinned model export version * chore: Bound embed feature with per-call wall-clock budget * feat: Dynamic rendering for small-font metadata extraction * feat: Remove Adobe-added watermark layers from documents that have them * feat: Default `fast_embed_pdf_metadata` to True * fix: Preserve PDF optional content properties through page extraction for watermark layer stripping * fix: Extract always returns V3 shape when citations are missing * fix: Extract citations no longer dropped when `page_range` skips page 1 * fix: Clean up downloaded PDF temp files on batch failure * refactor: LLM client dependency bump for CVE remediation * refactor: Azure Vision array OCR failover with retryable errors only * chore: Patch critical and high CVEs in Docker images * chore: Return S3 links instead of raw JSON to avoid response size limits * fix: Error attribution added to `GET /job` endpoint * chore: Pass prompts as files instead of args to avoid bytes overflow * fix: Skip parse-ID overlay in `/edit` when `form_schema` is provided * fix: Filter out uninitialized providers in model config resolution * fix: Page range validation parity in parse pipeline * refactor: Classify enhancements * feat: Azure Vision multi-endpoint failover with load balancing * feat: Enhanced image-to-PDF conversion * feat: Per-chunk embed PDF in pipeline * feat: Run LLM enrichments in parallelized DAG for lower latency * fix: Correct argument handling in offline entrypoint for org/job\_id * fix: Hybrid OCR prefers metadata over garbled OCR when text is reordered * refactor: Remove unused models from on-prem image builds for slim images * fix: `max_completion_tokens` / `check_schema` escape hatch for reasoning models * fix: Office document conversion with CJK fonts * fix: Deliver webhook after `persist_results` to avoid S3 race condition * chore: Upgrade LLM client dependencies * feat: New `force_simple_page` config option * feat: Enable `fast_flatten` for legacy early flatten in pipeline orchestration * fix: `max_completion_tokens` schema validation for reasoning vs. non-reasoning models * fix: `max_tokens` and other arg handling for Azure LLM inference * feat: Increased sheet processing timeout from 600s to 900s * feat: Fast PDF flatten, selective rasterization replaces full-document rasterization * fix: Validation for custom experimental options on on-prem * fix: Reduce DB lock contention on batch completion * fix: Equation enrichment correctly preserves line/word offsets * feat: New fast embed via env settings * feat: Configurable timeouts for OCR and embed text metadata steps * feat: Per-page billing feature breakdown in parse response * feat: Auto formatting per page * perf: 10x faster PDF text overlay rendering * perf: Parallelize figure classification with higher LLM concurrency * perf: Optimized batch result aggregation * fix: CSV parsing column count uses max row width; no longer drops columns when first row is narrower * fix: OCR PDF no longer drops pages on multi-chunk documents * fix: Allow None in override schema for bool fields * fix: Correct `num_pages` reporting on jobs * fix: Image conversion error handling * fix: Strip markup noise before language detection to avoid false unknowns * fix: Avoid page overlaps in `/split` endpoint output * fix: Guard against errors in layout postprocess * fix: Reclassify corrupt PDF annotation failures as proper 4xx status codes * chore: Raise default overflow chunk limit to 500 * perf: Parallelize KV fallback to prevent task deadline breaches * feat: Native DOCX XML parsing pipeline (alpha) with `.pages` support * feat: HEIC image support and section-based chunking for Numbers files * feat: Formatting and images support for Numbers files * feat: Extract model and internal prompt overrides for v2/v3 configurations * feat: YAML extract and citations models added to Helm chart for GPU deployments * feat: KV repetition detection with Gemini fallback replacing repetition\_penalty * feat: OTEL pipeline routing and K8s metrics collection * fix: Auth for chained jobs * fix: Required fields on extraction schema * fix: Equation detection TypeError from tuple/list concatenation * fix: Move sync blocking calls off the event loop in HTTP handlers * fix: Use encrypted DB URL and disable k8s metrics in on-prem environments * fix: Fail fast on hung PDF renders * fix: Parallelize S3 batch result loading for faster retrieval * perf: Optimized layout postprocessing * perf: Optimized hybrid OCR processing * refactor: Graceful fallbacks for XML conversion issues * chore: Cron retries increased from 1 to 2 for improved reliability * chore: Send total credits for on-prem customers * feat: OCR-based table citations for deep extract * feat: Add API key prefix filter parameter for /jobs endpoint * fix: On-prem presigned URL upload path mismatch * fix: new layout postprocessing * fix: Empty OCR fallback handling * fix: Python version in sandbox runtime * chore: Upgraded enhanced figure summary models * fix: on-prem deployments start without Redis configured * feat: On-prem usage logging for customer tracking * feat: Schemaless deep extract * feat: Granular citations in deep extract * feat: Deep extract available for on-prem deployments * feat: Spreadsheet sheet-name page\_range support * feat: Suppress citation content feature flag for extract * fix: Garbled DOCX for change tracking * fix: Recursive render to handle nested tables for edit * fix: Rotation in embed metadata * fix: GCP API key requirement for on-prem * fix: Remove libpq options from DB connect\_args for RDS Proxy compatibility * fix: Lock timeout batches and non-locking last-batch check * fix: Background threads no longer block main processing * fix: Transient classify inference issues * perf: Merge tables speedup from O(N^2) to O(N) * perf: Retry improvements to avoid double work * refactor: Decompose batch pipeline into composable phases * chore: Upgrade Gemini models * fix: Middleware context propagation and ordering for proper trace handling * fix: Cron job retry logic and syntax improvements * fix: HTTP startup patched for hashlib.md5 in FIPS environments (GCS support) * perf: Decaying timeout for retries with improved retry\_on\_timeout behavior * perf: Updated timeout and max batch configurations * feat: Chunk overlap configuration for including text context from previous/next chunks * feat: summarize\_all\_figures option in v3 alpha config * feat: Deep extraction optimizations for improved structured data quality * fix: Lazy loading for HTTP/worker modules to avoid unnecessary dependency imports * fix: Guard against empty document\_url list in pipeline and split endpoints * fix: Cron job improvements * fix: More reliable page orientation detection * fix: Exception handling on deep extraction completion * refactor: Deep extract sandbox image * chore: Upgraded Anthropic models * chore: New table detection model with improved accuracy * fix: Add bounds check for page index in PDF text embedding to prevent IndexError crashes * fix: Skip cover pages for PDF portfolios during attachment concatenation * fix: Fallback to original pages when portfolio has no PDF attachments * fix: Fast-fail URL download on non-success HTTP status codes * fix: Random checkbox YOLO crash when Conv has no batch normalization * fix: HuggingFace model downloads for builds * fix: Lazily import probing modules to avoid Modal dependency in on-prem * feat: Custom agentic layout postprocessing * feat: Routing for parse batches * feat: Classify concurrency improvements * refactor: Set reducto environment to `onprem` by default * chore: Upgrade pytorch and torchvision dependencies * fix: Detect visual redlines (colored strikethrough/underline) in DOCX change tracking * fix: Use min instead of max for checkbox detection * fix: CSV parsing truncation and scientific notation for large integers * fix: Recover in-progress batches alongside pending ones * feat: Fallback to Gemini Flash for improved reliability * feat: Intelligent ordering fallbacks * feat: Schema adherence model updates * feat: Add ONNX model integrity verification with forced fresh model download * refactor: Database lock timeout for sync DB engine * feat: New OCR recognition model for Apple deployments * fix: Recover in-progress batches alongside pending ones * fix: Add ONNX model integrity verification and force fresh model download * feat: Classify endpoint with parallelized Gemini Flash Lite probes for document classification * feat: Add bucket\_name as alpha option in v3 parse config * feat: Enable bucket & KMS ARN override for hybrid VPC deployments * feat: Auto region routing for Gemini models * feat: Native office conversion alpha flag in v3 config * feat: Inference helm charts and kv-base routing * feat: Enable flatten for edit endpoint * feat: Improved models for standard figure summary * feat: Add dimension limit handling for AWS environments * fix: Memory leaks and file descriptor leaks in PIL Image handling across OCR and processing pipelines * fix: N-squared completion pattern in batch processing for significantly improved performance at scale * fix: Race condition in parse completion job processing * fix: Argument order bug in pdftext multiprocessing extraction * fix: V3 config fixes for on-prem deployments * fix: Initialize empty sheets to prevent errors on blank spreadsheets * fix: Force resize to fit AWS dimension limits for large documents * fix: Image conversion failures now return proper 415 error instead of 500 * fix: PyPDFForm version update to resolve form filling bug * fix: Classify endpoint fixes for improved reliability * fix: Offset\_in\_chunk calculation for empty blocks * fix: Exclude veryHidden sheets when exclude\_hidden\_sheets is enabled * fix: Checkbox detection bug * fix: Prioritize S3/BUCKET over GCS when both GCP\_PROJECT\_ID and BUCKET are set * fix: cron.py Kubernetes usage * fix: Distributed traces with LOGFIRE\_DISTRIBUTED\_TRACING * fix: Temperature 0.1 for promptable layout for more deterministic results * fix: local-full Dockerfile fix by adding gcc and python3-dev to apt install * perf: Hydrate SharedBatchWorker.process\_org\_batch before ThreadPoolExecutor for improved concurrency * refactor: Remove enhanced enrich tables, default to same model for simpler table processing * chore: Upgrade table models * feat: V3 config overrides for v2-only and on-prem-only settings * feat: List item support and chunk offsets in blocks for improved extraction * fix: handle\_required\_fields not adding missing fields to array items in extraction * fix: Page marker blocks now include correct page and original\_page values * fix: Multi-batch recovery when job processing is interrupted * fix: Division by zero error for images with corrupted EXIF data * fix: Restore V2 OCR defaults (highres OCR system) in V3 on-prem config for consistent behavior * chore: Bookworm image build configuration for CD pipeline * feat: Bookworm Dockerfile variant for improved on-prem DOCX→PDF conversion reliability * fix: DOCX→PDF conversion using LibreOffice from Trixie backports for improved reliability * feat: Super-agent integration into /extract pipeline for improved structured data extraction * fix: Traceparent propagation for API requests * perf: Per-image table predictions for better performance * feat: Hybrid VPC routing based on header with default AU/EU/US regions * feat: Add docx fallbacks for malformed XML and OOXML-format .doc files * feat: Change default presigned URL expiration from 1 hour to 12 hours * fix: Table edit pattern improvements and preferred edit model changes * feat: Schema adherence for required keys in extraction * feat: Improved table edit granularity * fix: Properly propagate password errors for password-protected PDFs * fix: Anthropic Bedrock on-prem edit calls * feat: Add raw XML repair fallback for malformed docx files * fix: Local parse hanging for multi-batch documents * feat: Schema Optimization Agent for improved extraction accuracy * fix: Hyperlinks being dropped when OCR extraction mode is enabled * feat: Add line level offsets when config is enabled * feat: Intelligent Ordering Model API integration * feat: Add document\_password support to pipeline API for password-protected documents * feat: Implement character-level DOCX change tracking * fix: Hidden rows and columns handling for spreadsheets * feat: Cloudflare R2 Storage Class support * fix: Settings Overrides for streamlined API config/env var customization * chore: inference parallelization * feat: OCR word and line rotation data propagation * fix: layout prediction improvements * feat: extract schema adherence * fix: empty table model output * refactor: Document fetching logic * feat: Updated ordering model * feat(settings): more streamlined customization for models and prompts via API configuration / env variables * feat: Customizable models for AWS Bedrock using environment variables * refactor: Default models updated for AWS Bedrock to `us.anthropic.claude-sonnet-4-5-20250929-v1:0` * feat: support more edge case custom file mimetypes * fix: table chunking * fix: make enrich tables more robust * refactor: optimize some DB transactions to not be left open too long * feat: Add signatures as a formatting option in v3 config * feat: extract schema adherence * refactor: optimize enrich tables latency * feat: new hybrid OCR implementation * feat: add force file mimetype to extension config option * feat: env var based customization for local KV prompt/model * feat: Add priority-based worker routing to skip shared/dedicated workers when priority is not set * chore: adding latency sensitive for fast mode in Spreadsheet Agent * feat: add OpenAI Responses LLM Provider * feat: Allow direct DataDog Tracing with Beta Headers and Logfire Service name handling * fix: table block chunking * fix: Chainguard image dependencies * fix: md5 for FIPS environments * fix: numbers file parsing * fix: allow invalid surrogates when encoding * feat: Add logfire gauge metrics for K8s queue lengths * feat: Add Azure Blob Storage authentication support for private endpoints * fix: race condition with in progress batch -> job completion enqueue * fix: logfire logging if logfire token is set * fix: embed pdf metadata * fix: persist results before webhook * fix: update cancel\_all and wipe endpoints for on-prem and secure them correctly * refactor: cron cleanup function + running frequency * fix: Chainguard image dependency issues * fix: PgDog Helm Chart application version configuration for on-premise deployments * feat: V3 API config with improved spreadsheet response format and citations support * feat: Enhanced table block chunking for better extraction of large tables * feat: Agent-in-the-loop (AITL) extraction with generalizable configuration for multiple fields * feat: Spreadsheet figure summary support for better data visualization * feat: LLM provider preference configuration for v3 API (specify OpenAI, Anthropic, Google, etc.) * feat: Helm chart PgDog dependency for PostgreSQL monitoring * feat: Affinity and topologySpreadConstraints support in Helm charts for advanced pod scheduling * fix: OCR system handling in v3 config * fix: Race condition for single batch jobs * fix: Webhook delivery on Kubernetes environments * fix: Parse job update batching for improved database performance * fix: DOCX timeout increased for large document processing * fix: Table merging with XML parsing improvements * fix: Underline/strikethrough character threshold adjustments * chore: Datadog integration for enhanced monitoring * feat: Reduce PDF output size by avoiding text layer rasterization * feat: Custom chunking response format support * fix: AITL configuration handling for proper field validation * feat: Agent-in-the-loop (AITL) documentation exposed and configuration updated to handle multiple fields * feat: Hyperlink extraction support in PDF parsing - preserves document links in output * feat: PostgreSQL Helm dependency migrated to OCI registry for better reliability * feat: Spreadsheet figure summary generation for visual data extraction * fix: OCR system switching for v3 config * fix: Change tracking for accurate document diff detection * feat: Helm charts now support affinity and topologySpreadConstraints for advanced Kubernetes pod placement control * feat: Table merging heuristics improved * fix: Webhook delivery on Kubernetes fixed for reliable notification * fix: Underline and strikethrough detection threshold adjusted for better accuracy * fix: Safe fill implementation used everywhere in PDF form filling * chore: Datadog monitoring integration * feat: V3 API config support - new configuration format for improved extraction control * feat: Naive table merging for bulk processing with better cross-page detection * feat: Figure summary enhancements and configuration via API * feat: LLM provider preference support in v3 config * feat: Tool use support for Anthropic provider * feat: /openapi.json and /openapi-legacy.json endpoints for API schema access * feat: Split implementation improvements * fix: Database transaction handling in Kubernetes - don't keep transactions open * fix: Experimental table citations now default to true in v3 * fix: Extract confidence concurrency handling * chore: Figure summarization adjusted for more thorough output * fix: Helm chart labels for retry stale jobs cronjob * fix: Build configuration cleanup * feat: Support for custom extract models via LLM service, enabling on-premise model configurations * fix: PDF form dropdown filling improvements with proper context and option handling * fix: Excel column to string conversion using openpyxl * feat: New /jobs endpoint with cursor-based pagination for efficient job listing and filtering * feat: New PDF edit flow using parse pipeline for improved form filling accuracy and performance * feat: Schema-less extraction generation - automatically infer extraction schemas when not provided * feat: Enhanced table merging across pages in HTML documents with improved row/column detection * feat: Improved spreadsheet agent with citations support and performance optimizations * feat: Parallelized batch results loading from storage for faster retrieval * fix: Multi-page TIFF and JPEG handling for proper page extraction * fix: Password-protected landscape PDF processing * fix: Text overlay visibility issues during edit flow * fix: Spreadsheet agent formatting values in preview mode * chore: Docker base image upgraded to Debian Trixie for better security and compatibility * chore: Enhanced mode set as default for better quality * feat: Priority handling for time-sensitive extraction requests with improved page mapping reasoning * feat: Improved Vertex AI Gemini region configuration * fix: Array extract error handling - prevents crashes from malformed LLM output * fix: Better concurrency management for key-value extraction * fix: Split configuration handling improvements * chore: OpenAI API retry logic for handling slow responses * chore: Exponential backoff for split operations * feat: Improved layout inference with reduced latency * feat: PDF edit overlay improvements using OCR-B font for better text rendering * fix: Timeout configuration improvements for long-running operations * fix: Worker stability improvements and bug fixes * feat: Spreadsheet extraction agent enhancements for better cell and table detection * fix: Citation formatting improvements across extraction outputs * feat: Cross-page table merging improvements with naive row merging implementation * fix: Performance optimizations for large document processing * feat: Enhanced extraction pipeline with improved data handling * fix: Error handling improvements throughout the system * feat: GCP Workload Identity support for Google Cloud deployments * feat: AWS region override configuration for flexible cloud deployments * fix: Worker stability enhancements * feat: Prometheus alerting integration for monitoring * fix: Reliability enhancements for long-running jobs * feat: opt-in or opt-out to send billing usage to license server * feat: block OpenAI invocation with BLOCK\_OPENAI env var * feat: signature detection * fix: helm chart template rendering * fix: update figure summarization to correctly override default prompt when user wants to override * refactor: update equations detection to use on premise-provided LLMs * feat: configurable S3 endpoint url * feat: character-level support for azure in hybrid mode * feat: support for docx comments * feat: split support with gemini on vertex ai * refactor: updated LLM service with vision/text * fix: ensure formatted text (i.e. underline, strikethroughs) is not subsumed by key value detection * feat: secret management in helm chart * feat: add .msg file support * feat: HEIC file format support for image processing * feat: character-level OCR detection for strikethrough and underline formatting * feat: parallelize and optimize PDF metadata embedding for improved performance * fix: add locks to prevent race conditions * fix: timeout handling for DOCX to PDF conversion with proper 400 status codes * feat: configurable S3 SSL options for boto * feat: /billing-usage API for exporting usage in air-gapped deployments * feat: Support for Google Cloud Storage gs\:// document url * feat: timeout and fail jobs and batches when queued for GLOBAL\_QUEUE\_TIMEOUT\_SEC * feat: BackendConfig support in Helm Chart for GCP * feat: generate extract schema if no schema was provided * feat: adding form schema for edit documentation * feat: improve cold starts * fix: fine-grained citation fixes * feat: DOCX improvements * feat: added schema token limits * feat: add customizations to auth via environment variables * feat: faster model inference optimizations * fix: OCR image resizing improvements * feat: implement fault-tolerant webhook delivery * feat: fix table headers for html parsing * feat: add secret metadata parameter to /job/{job_id} endpoint * feat: include config when include\_metadata is enabled for job endpoint * feat: adding sheet color to output * feat: clean up refs in extract output * feat: excel table color mapping implementation * feat: enhance merge tables * feat: table feedback loop using the enrich table flag * feat: add litellm proxy model for 'best' * feat: long-polling with timeout (seconds) query param for `/job/{job_id}` * fix: job type and add duration field in `/jobs` endpoint * fix: Preserve all decimals in md tables * feat: support for GCP * feat: presentation detection and kv-disabling * fix: Strike underline tuning * feat: implement rtf * fix: Fix offsets for tables extracted from excel sheets * feat: add optional confidence fields to OCRWord and OCRLine * feat: add source in `/jobs` * feat: initial change detection implementation * docs: clarify Excel citation coordinate system differences * feat: Integrate spreadsheet agent for extract * feat: Convert images to pdf for pdf\_url * Fix: Allow Gemini to output `` and `` fields for key-value * chore: update textract quota * feat: option for multiplatform builds for onprem * docs: Add Model Governance Policy to security section * fix: Split regex fix for subcategory * fix: Add retries to spreadsheet agent * fix: Merging splits in the new format * fix: Merge array\_extract citations based on extract results * Add chart extraction documentation page * feat: ship both small/large models in built images * feat: allow changing default use\_gpu\_ocr config value based on env var * feat: handle multipart/form-data content type errors on /split endpoint * Latency fix: Agentic unicode changes * fix: sanitize html file upload path to s3 * Add strict typing for SplitResult.splits * feat: Helm chart and values for GPU-based OCR deployment * feat: enhanced DOCX change tracking with improved underline detection and formatting accuracy * fix: optimized model server initialization to reduce startup time and improve processing performance * fix: resolved document conversion hangs caused by separate executor processes for improved reliability * feat: added cancel\_all endpoint for on-prem deployments to cancel all running jobs at once * feat: enhanced extraction with schema key normalization and improved page range references in citations * feat: added global timeout overrides for better performance control and reliability * fix: resolved document conversion hangs with global timeout implementation * fix: improved change tracking validation with proper error handling * feat: enhanced DOCX metadata extraction for improved change tracking capabilities * fix: improved Excel citation handling when OCR data is not available * feat: added on-prem licensing alerts when connection to license.reducto.ai fails * feat: implemented timeout functionality for improved processing performance and reliability * fix: improved authentication on /upload and /cancel endpoints for better security * feat: enhanced multilingual OCR text embedding with support for Latin, CJK, Cyrillic, and Devanagari scripts using custom Unifont font * feat: file-based authentication system for on-prem deployments with Kubernetes secret mounting support * feat: automatic file cleanup system with configurable retention windows (default 60 minutes) to manage storage usage * fix: improved authentication reliability with retry logic for API validation calls * fix: enhanced extraction pipeline to handle None extract\_outputs and improve data merging * fix: native office conversion now skips files over 150MB and falls back to LibreOffice for better reliability * fix: improved citation confidence handling when confidence values are null * fix: batch processing improvements to keep batches alive for large documents * feat: enhanced array extraction to work with non-array fields for improved data extraction flexibility * feat: improved LLM error handling and timeout support for more reliable model calls * feat: added support for OpenDocument Text (.odt) file uploads through existing LibreOffice conversion pipeline * fix: improved block merging logic to properly update table content during document enrichment * fix: added retry logic for database errors on job status requests to improve reliability * fix: preserve empty blocks (such as figures with no content) in final document layout * feat: default to big extract model * feat: automatic file cleanup * fix: support for azure openai * feat: add hidden sheet/row/column filtering for Excel processing * feat: Fix jsonbbox and citations for excel * fix: pdf processing timeout * fix: Fix DOCX to PDF conversion error status code from 500 to 400 * feat: enable change tracking capability * fix: remove the large figure filter in dfine layout model postprocessing * feat: Split blocks for array\_extract on excel to separate pages * feat: Persist the full result for url results to persist bucket * feat: Query jobs by user-id and fair queueing docs * fix: Edit conditionals so full tables aren't returned in citations * fix: Fix job type error when cancelling a job * fix: on-prem changelog auth on light and dark mode * fix: Rename file to include guessed extension if one isn't already included * fix: handle empty bbox arrays in layout postprocess calculations * feat: update replicated helm chart * feat: surface all table citations in v2 * feat: internal webhook via IPC on job completion * feat: expose table citations in extraction results * feat: add persist config option that persists parsebatches and results * fix: keep batch alive for large HTML documents * fix: check for empty document\_url list in extract * refactor: PDF editing with PyPDFForm for improved form handling * feat: update extraction pipeline with improved array handling * fix: f-string usage in logfire calls across the codebase * fix: OpenAI vision LLM calls in LLM router * fix: root level acroform rendering (preserve form values) * feat: add onprem config option to enable figure summaries for all figures * feat: add exclude\_configs query param to /jobs endpoint to reduce response size * fix: onprem CD now corrects the `/version` url to the latest version number Enhanced support for OCR text embedding in PDFs with `embed_text_metadata_pdf` flag. Added support for routing on-premise deployments to v2 extraction pipeline and improved exception handling for subscription errors in Stripe usage logging. Fixed Pydantic AI Agent tools configuration in the document editing functionality. Added Azure OpenAI support and improved table model with a new fallback mechanism. Enhanced webhook validation and added support for selective customer notifications via target channels parameter. Added support for LiteLLM Proxy configuration via environment variables: * `LITELLM_PROXY_URL`: URL of the LiteLLM Proxy * `LITELLM_PROXY_FAST_MODEL`: Fast model to route to via the proxy * `LITELLM_PROXY_ACCURATE_MODEL`: Accurate model to route to via the proxy When using the proxy configuration: * Both fast and accurate models must be defined if using the proxy URL * Existing LiteLLM routing options are overridden when proxy settings are active This enables easier integration with centralized proxy setups for model routing and observability. Add a `/wipe` endpoint to the On-Prem API to wipe the database of all parse jobs, batches, and tasks. This is only available to on prem customers and is a good fail safe. Ensure that this is not available or exposed to the users. Should be a backend only failsafe. Please let us know if you'd like this disabled or removed in your deployment. In this release, we make some query optimizations to significantly reduce CPU usage of the Postgres DB at high document volumes (e.g. > 1k pg/min). In Google Cloud environments, we improved the skew detection capability by updating our thresholds to more intelligently detect skew in certain cases. Some additional bug fixes were made for folks who specify an LLM provider preference.