Base model versions are the fundamental building blocks of our processing and parsing system. They control the underlying base configuration for each processor and parser, including critical components like:
These base configurations are essential in determining the overall performance, accuracy, and capabilities of our processing system across different use cases.
For parse processors specifically, the selected parse version controls the model versions and pipeline defaults used to transform files into structured document blocks. This includes the OCR engine, layout model, reading order model, and the VLMs used to process specific block types such as tables, key-value pairs, forms, charts, figures, formulas, and barcodes.
We employ a semantic versioning system for our base processors. This system allows us to communicate the nature and impact of changes clearly. Our version numbers follow the format: MAJOR.MINOR.PATCH
Major version updates are reserved for changes that can potentially affect accuracy, latency, or reliability across every use case. The expectation is that a major version update would have a guaranteed impact for all users of the processor, and any consumer should expect to see changes in their output when upgrading to a new major version.
These include changes such as:
Minor version updates are for changes that could cause potentially significant changes to specific use cases, but not necessarily all use cases.
Examples include:
signature type or enum typePatch updates are for changes that are not expected to meaningfully impact functionality in any way outside of very few isolated cases.
These might include:
Understanding our versioning system is crucial for developers and users of our base processors. It allows you to:
We strive to provide detailed changelogs for each version update, ensuring you have all the information needed to understand the changes and their potential impacts on your workflows.
If you have any questions or need further clarification on our versioning system, please reach out to us on Slack.