The parsing response format is optimized for flexibility with retrieval augmented generation.
If you are just looking for a markdown representation of a given document, you can disable chunking altogether and just use response['result']['chunks'][0]['content'].
See the entire response format in the API Reference.

Attribute breakdown

result
object

Code snippet example

The Parse output can have two types: either full or url. The full type contains all the content and block information. Reducto swaps to using type=url when the request response size is close to exceeding the maximum allowed HTTP response size (6 MB for our current infra). Set force_url_result to true to consistently return URLs.
{
  "job_id": "abcdef123456",
  "duration": 5,
  "pdf_url": null,
  "studio_link": "https://studio.reducto.ai/job/xyz",
  "usage": {
    "num_pages": 1,
    "credits": 0.5
  },
  "result": {
    "type": "full",
    "chunks": [
      {
        "content": "Detailed content for full LLM usage.",
        "embed": "Summarized content optimized for embeddings.",
        "enriched": "Additional semantic enrichment details.",
        "enrichment_success": true,
        "blocks": [
          {
            "type": "Header",
            "bbox": {
              // All bbox values normalized to [0,1] range
              "left": 0.1, // Distance from left edge
              "top": 0.05, // Distance from top edge
              "width": 0.8, // Width as % of page width
              "height": 0.1, // Height as % of page height
              "page": 1, // Current page number (1-indexed)
              "original_page": 10 // Original doc page number
            },
            "content": "Text content",
            "image_url": null, // presigned url to download figure/table image
            "confidence": "High",
            "logprobs_confidence": 0.92
          } ...
        ] ...
      } ...
    ]
  }
}