Parsing Error Handling | Extend Documentation

When an error occurs, the Parse API returns a structured error with the following fields:

Field	Type	Description
`code`	string	A specific error code that identifies the type of error
`message`	string	A human-readable description of the error
`retryable`	boolean	Indicates whether retrying the request might succeed
`requestId`	string	A unique identifier for the request, useful for troubleshooting

How failures surface: sync vs. async

Where an error shows up depends on which endpoint you use.

Synchronous (/parse) waits for processing to finish, so every failure — both request validation and document processing — comes back as an HTTP error response with the fields above. The HTTP status codes below apply to this endpoint.
Asynchronous (/parse_runs) returns 200 with a parse run as soon as the job is accepted. Request-level problems (invalid request, auth) still return an HTTP error at creation time, but document processing failures are not HTTP errors — the run instead reaches status: "FAILED" with a failureReason and failureMessage (one of the custom error codes below), which you read when you poll or receive the webhook.

Production integrations should use the asynchronous /parse_runs endpoint and check status on the returned run. See Move to production with async processing.

Custom error codes

We provide custom error codes so your system can tell exactly what happened. On the sync endpoint they appear in the error body’s code field; on the async endpoint they appear as the run’s failureReason. Most are client errors related to the file provided for parsing and are not retryable.

Error Code	Description	Retryable
`INVALID_CONFIG_OPTIONS`	Invalid combination of options in the incoming config.	❌
`UNABLE_TO_DOWNLOAD_FILE`	The system could not download the file from the provided URL, likely means your presigned url is expired, or malformed somehow.	❌
`FILE_TYPE_NOT_SUPPORTED`	The file type is not supported for parsing.	❌
`FILE_SIZE_TOO_LARGE`	The file exceeds the maximum allowed size.	❌
`CORRUPT_FILE`	The file is corrupt and cannot be parsed.	❌
`OCR_ERROR`	An error occurred in the OCR system. This is a rare error code and would indicate downtime, so requests can be retried. We’d suggest applying a retry with backoff for this error.	✅
`PASSWORD_PROTECTED_FILE`	The file is password protected. Retry the request with the file’s password using `file.settings.password`. See Password-Protected Files.	❌
`FAILED_TO_CONVERT_TO_PDF`	The system could not convert the file to PDF format.	❌
`FAILED_TO_GENERATE_TARGET_FORMAT`	The system could not generate the requested target format.	❌
`INTERNAL_ERROR`	An unexpected internal error occurred. We’d suggest applying a retry with backoff for this error as it likely a result of some outage.	✅

HTTP status codes

These HTTP status codes are returned by the synchronous /parse endpoint. On the asynchronous /parse_runs endpoint, only request-level errors (400, 401, 403) are returned at creation time — processing failures surface on the run as status: "FAILED". We generally recommend relying on the custom error codes above for programmatic handling.

400 Bad Request

Returned when:

Required fields are missing (e.g., file)
url is not provided in the file object
The provided url is invalid
The config contains invalid values (e.g., unsupported target format or chunking strategy)
The file type is not supported
The file size is too large

401 Unauthorized

Returned when:

The API token is missing
The API token is invalid

403 Forbidden

Returned when:

The authenticated workspace doesn’t have permission to use the parse functionality
The API token doesn’t have sufficient permissions

422 Unprocessable Entity

Returned when:

The file is corrupt and cannot be parsed
The file is password protected
The file could not be converted to PDF
The system failed to generate the target format

500 Internal Server Error

Returned when:

An OCR error occurs
A chunking error occurs
Any other unexpected error occurs during parsing

Handling errors

Error handling differs between the two endpoints: with sync you catch a thrown HTTP error, while with async you also check the run’s terminal status. Pick the tab that matches how you call Parse.

Synchronous (/parse)

Asynchronous (/parse_runs)

Catch the HTTP error and inspect its status code and body.

Python

TypeScript

Java

Go

1 from extend_ai import Extend
2 from extend_ai.core.api_error import ApiError
3 
4 client = Extend(token="your_api_key")
5 
6 try:
7     response = client.parse(file={"url": "https://example.com/document.pdf"})
8 except ApiError as e:
9     # e.body holds the structured error: { code, message, retryable, requestId }
10     body = e.body or {}
11     if body.get("retryable"):
12         ...  # retry with exponential backoff
13     print(f"parse failed ({e.status_code}): {body.get('code')} — {body.get('message')}")
14     raise

For retryable errors (OCR_ERROR, INTERNAL_ERROR, or any 5xx), retry with exponential backoff. Always log the requestId when contacting support.