Overview
Unsiloed AI provides a powerful API for processing unstructured documents. You can:- Parse documents into structured Markdown and JSON
- Extract data using custom schemas
- Classify documents by type
- Split multi-document files into separate documents
Prerequisites
Before you begin, you’ll need:- An Unsiloed AI account and API key
- A document to process (PDF, DOCX, PPTX, image, etc.)
- Python 3.7+ or Node.js 14+ (optional, for SDK usage)
Step 1: Get Your API Key
To get API access, sign up on Unsiloed AI. We’ll get you set up with an API key and help you get started.Keep your API key secure and never commit it to version control. Use environment variables to store it.
Step 2: Install the SDK (Optional)
We provide official SDKs for Python and JavaScript/TypeScript. You can also use the REST API directly.Step 3: Parse Your First Document
Choose your preferred language and run the example below:Step 4: Extract Structured Data
To extract specific fields from your document, define a JSON schema:Understanding the Response
Parsing Response
The parsing API returns structured chunks with markdown, segments, and metadata:Parsing Response Structure
Parsing Response Structure
Extraction Response
The extraction API returns extracted fields with confidence scores and bounding boxes:Extraction Response Structure
Extraction Response Structure
- Parsing: Returns chunks with markdown, HTML, segments, and layout information
- Extraction: Returns structured fields with confidence scores and precise bounding boxes
- Bounding boxes: Pixel-level coordinates for locating data in the original document
- Confidence scores: Model confidence (0-1) for each extracted field
- Page references: Page numbers where each field was found
Next Steps
Document Processing
Learn about advanced parsing options and strategies
Data Extraction
Deep dive into schema-based extraction
Classification
Classify documents by type
API Reference
Explore the complete API documentation
Common Use Cases
RAG & Semantic Search
RAG & Semantic Search
Parse documents into clean, hierarchical Markdown chunks optimized for embedding and retrieval. Preserve layout, tables, and images for accurate context.
Invoice Processing
Invoice Processing
Extract structured data from invoices with citations and confidence scores for validation workflows.
Contract Analysis
Contract Analysis
Parse legal documents while preserving structure, then extract key clauses and dates.
Document Classification & Routing
Document Classification & Routing
Classify documents to route them to appropriate processing pipelines.
API Base URL
All API requests should be made to:api-key header.

