Documentation Index
Fetch the complete documentation index at: https://docs.unsiloed.ai/llms.txt
Use this file to discover all available pages before exploring further.
Overview
The Document Splitting feature analyzes PDF pages, classifies them into predefined categories, and creates separate PDF files for each category. This is ideal for processing mixed document batches like scanned files containing invoices, contracts, and reports.Splitting jobs are processed asynchronously. Submit a splitting job and poll the status endpoint to retrieve results when complete.
How It Works
Our document splitting system uses advanced AI to:- Analyze Each Page: Extract text and visual features from every page
- Classify Content: Categorize pages based on document type and content
- Generate Confidence Scores: Provide accuracy metrics for each classification
- Create Separate Files: Split the original PDF into category-specific documents
- Package Results: Deliver all split documents in a convenient ZIP file
Supported Categories
You can split documents into various categories:- Business Documents: Invoices, receipts, purchase orders, contracts
- Financial Documents: Bank statements, financial reports, tax forms
- Legal Documents: Contracts, agreements, legal notices, compliance forms
- Healthcare Documents: Medical records, insurance forms, lab reports
- HR Documents: Resumes, employment forms, payroll documents
- Academic Documents: Research papers, reports, transcripts
API Usage
Submit Split Job
Check Split Job Status
Response Format
Job Creation Response
Job Status Response (Completed)
Response Fields
Top-Level Fields:job_id(string): Unique identifier for the split jobstatus(string): Job status - “processing”, “completed”, or “failed”progress(string): Current processing status messagefile_url(string): URL to the original uploaded filefile_name(string): Name of the original fileparameters(object): Job parameters including classes and category descriptionserror(string|null): Error message if job failedquota_remaining(number|null): Remaining API quota
success(boolean): Whether the split operation succeededmessage(string): Success/failure messagefiles(array): Array of generated split files
name(string): Name of the split file (category-based)path(string): Relative path to the filetype(string): File type (always “file”)fileId(string): Unique identifier for the filefull_path(string): Full download URL for the split fileconfidence_score(number): Confidence score for the classification (0-1)

