Best Practices: Latency Optimization
When processing high-volume documents or building real-time applications, latency becomes a critical factor. This guide provides the most impactful settings to reduce latency.
Many latency-sensitive settings involve trade-offs with accuracy for complex documents. See the Advanced Options guide for detailed explanations of each setting.
Quick Reference
Use this checklist when optimizing for latency:
Advanced Options
- Use
extraction_lightfor simple document types (verify accuracy with evaluation sets) - Turn off model reasoning insights (
modelReasoningInsightsEnabled: false) - only needed for debugging - Disable advanced multimodal (
advancedMultimodalEnabled: false) - unless processing scans/handwritten content - Turn off bounding box citations (
citationsEnabled: false) - removes spatial location references
Extraction Chunking Options
- Limit page ranges if data is on specific pages
- Use
confidenceortake_firstmerging instead ofintelligent - Use
large_array_heuristicsarray strategy if processing large arrays
Parser Configuration
- Use
documentchunk type for non-array extraction to skip merging entirely - Disable figure parsing - unless documents contain important charts/diagrams
- Disable agentic OCR - unless processing handwritten/poor quality scans
Workflow
- Split into parallel extractors if you have both simple fields and complex arrays
Light Extraction
The biggest change you can make to reduce latency is selecting Extraction Light instead of the default Extraction Performance.

Extraction Light is faster and cheaper, but removes support for advanced visual features like figure parsing and signature detection. See the Extraction Light Changelog for details.
Disabling Advanced Options
Each of these options adds processing overhead. Disable what you don’t need:
Chunking Optimizations

For non-array extraction: Set chunk type to document to skip intelligent merging entirely—this is the fastest option.
For large array extraction: Use large_array_heuristics array strategy with smaller chunk sizes.
Merging strategy: Switch from intelligent to confidence, take_first, or take_last to avoid extra processing overhead.

Disable Advanced Parsing Options

- Figure parsing - Disable unless documents contain important charts/diagrams
- Signature detection - Disable unless signature verification is needed
- Agentic OCR - Disable unless processing handwritten or poor-quality scans
Related Topics
- Learn about Advanced Options for detailed explanations of each setting
- Understand Field Names and Prompt Crafting for schema optimization
- Explore Evaluation Sets to validate accuracy when optimizing for speed

