Extractors, classifiers, and splitters let you restrict which pages of the source document are read and billed.
Use either pageRanges or fixedPageLimit, preferring pageRanges, never both.
[] to process the full document (subject to global limits)Equivalent to pageRanges: [{ "start": 1, "end": 25 }].
All rules that apply to pageRanges also apply to the synthetic range generated from fixedPageLimit.
pageRanges (or the range derived from fixedPageLimit) is applied before chunking starts:
chunkingStrategy and pageChunkSize refer to the virtual page numbersExample:
Resulting chunks:
pageRanges over fixedPageLimitpageRanges for non-contiguous or trailing pages