Getting Started
What is Extend?
Extend is your platform for building, iterating on, evaluating, and deploying AI-powered document processing infrastructure. We support a variety of tasks like document extraction, splitting, classification, and parsing. These tasks are performed by Processors, which can then be orchestrated within Workflows to handle your document processing use cases.
What do you want to do?
Extract structured data from documents Recommended
→ Quick Start: Build your first extraction processor
Split multi-page documents into organized sections
→ Configure a splitter
Just parse documents into clean, structured text
→ Parse documents
Categorize documents into different types
→ Configure a classifier
Orchestrate multiple processors into complex document pipelines
→ Create a workflow
Test and measure how well your processors perform
→ Overview of evaluation sets
Coding with an agent?
Use these raw markdown files to seed coding assistants with Extend platform context:
How do I use this?
- Download one of these files and save it in your project root as
CLAUDE.mdorAGENTS.md. - Run your coding agent from that same project root so it can discover the file.
- If your agent does not auto-load root guidance, explicitly reference the file in your prompt (for example: “Use
./CLAUDE.mdfor project instructions.”). - For Codex/agent-style tools, keep
AGENTS.mdin the working directory root where commands are executed. - For Claude Code, keep
CLAUDE.mdat repository root and start the session from that directory. - In web LLM chats (for example ChatGPT, Claude, Gemini), upload
CLAUDE.mdorAGENTS.mdas an attachment at the start of the conversation. - If attachments are not available, paste the file content into your first prompt and ask the model to follow it for the rest of the session.

