“Splitters” break down multi-page documents into separate, organized sections. These can be configured via either the UI or the API.
splitter.overrideConfigYou can consume splitter outputs via webhooks, track updates to your splitter, and more. See Webhook Events for details.
You can view full details of the SplitConfig in our API reference.
When working with the SplitConfig, you can configure several key aspects, such as:
The identifierKey field on each classification allows you to define a natural-language rule for extracting a unique identifier from subdocuments of that type. For example, you might extract an invoice number, contract ID, or receipt number.
When provided, the splitter extracts the specified identifier and includes it in the output’s identifier field. This also helps the system understand when to separate out subdocuments of this type
identifierKey is supported on splitting_light >= 1.3.0 and splitting_performance >= 1.5.0. If passed on older versions, it is accepted but ignored. It replaces the deprecated advancedOptions.splitIdentifierRules field, which is likewise accepted but ignored on those newer versions.