Hybrid VPC deployment provides a balance between data sovereignty and operational simplicity. Your data stays in your cloud account while Reducto manages all compute infrastructure.Documentation Index
Fetch the complete documentation index at: https://docs.reducto.ai/llms.txt
Use this file to discover all available pages before exploring further.
Overview
In a Hybrid VPC deployment:- Data stays in your cloud account: All documents, intermediate artifacts, and results are stored in your storage
- Compute runs on Reducto’s infrastructure: GPU processing and model inference are handled by Reducto
- Stateless by design: Objects have a configurable lifecycle, ensuring no data persists beyond processing
- Multiple storage providers: AWS S3, Azure Blob Storage, and Box are supported
AWS S3
Cross-account IAM role with ExternalId protection. Optional PrivateLink for private-only API access.
Azure Blob Storage
Cross-tenant service principal access with RBAC. Standard Azure security model.
Box
Box enterprise app with Client Credentials Grant. Ideal for organizations already using Box for document management.
Google Cloud Storage
Cross-project service account access. Standard GCP IAM model.
Key benefits
| Benefit | Description |
|---|---|
| Data sovereignty | Storage remains in your cloud account |
| No GPU management | Offload model inference to Reducto’s optimized GPU cluster |
| Cost efficiency | Avoid provisioning and maintaining GPU capacity |
| Fast auto-scaling | Scale to zero when idle, scale up on demand |
| Reduced DevOps burden | Faster iteration, no infrastructure maintenance |
Architecture
Data flow
- You upload documents to your storage (or use Reducto’s
/uploadendpoint) - You call Reducto API with a reference to your document
- Reducto uses your configured credentials to access the document
- Processing occurs on Reducto’s compute infrastructure
- Results and artifacts are written back to your storage
- Objects expire automatically based on your lifecycle configuration
Choosing a Storage Provider
AWS S3 — Recommended for AWS-native organizations
AWS S3 — Recommended for AWS-native organizations
Best choice if your organization already uses AWS. Provides cross-account IAM role assumption with ExternalId protection against confused deputy attacks. Optional AWS PrivateLink keeps all traffic off the public internet. Terraform module provided for automated setup.
Azure Blob Storage — For Azure-native organizations
Azure Blob Storage — For Azure-native organizations
Best choice if your organization uses Azure. Uses cross-tenant service principal with RBAC role assignments. Terraform configuration provided for automated setup.
Box — For Box-first document workflows
Box — For Box-first document workflows
Best choice if your organization already manages documents in Box. Uses Box enterprise app authentication (Client Credentials Grant). No Terraform provider available — setup is done through the Box Admin Console.
Google Cloud Storage — For GCP-native organizations
Google Cloud Storage — For GCP-native organizations
Best choice if your organization uses GCP. Uses cross-project service account access with IAM bindings. Contact Reducto for setup guidance.
Document Handoff
There are multiple ways to provide documents to Reducto APIs, regardless of which storage provider you use:- Upload Endpoint
- Direct URI (S3 only)
Use Reducto’s
/upload endpoint to upload documents directly. Files are automatically stored in your configured storage:For PrivateLink connections (AWS only), specify the region-specific hybrid endpoint as
base_url:- US:
https://hybrid.platform.reducto.ai - EU:
https://hybrid.eu.platform.reducto.ai - AU:
https://hybrid.au.platform.reducto.ai
Integration Contract
After setting up your storage infrastructure, provide the following values to Reducto:- AWS S3
- Azure
- Box
| Value | Description |
|---|---|
bucket_name | S3 bucket name |
region | AWS region (e.g., us-east-1) |
role_arn | IAM role ARN for Reducto to assume |
external_id | ExternalId for secure role assumption |
privatelink_endpoint_id | VPC Endpoint ID (if using PrivateLink) |
Multi-Region Setup
For organizations needing storage in multiple regions for latency or compliance requirements, see the provider-specific setup guides linked above. Each provider supports region-specific configurations that Reducto routes automatically based on the deployment area (US, EU, AU).Security
All storage integrations follow least-privilege principles:- AWS: ExternalId prevents confused deputy attacks; IAM policy limits access to S3 operations only
- Azure: RBAC role assignment scoped to the specific storage account/container
- Box: App access restricted to the configured folder; enterprise admin approval required
- All providers: Automatic data cleanup via configurable lifecycle policies