OCR APIs

New

OCR is now split into dedicated endpoints and pages for PDF OCR and Image OCR.

What it does

Extract machine-readable text from scanned PDFs and images.

Control OCR behavior with `lang`, `dpi`, `psm`, and `oem` options.

Use in search indexing, invoice extraction, and document automation pipelines.

PDF OCR

POST /v1/pdf/ocr/parse

Input via file upload or public URL.

Image OCR

POST /v1/image/ocr/parse

Input via file upload, image URL, or base64 payload.

Quick cURL

curl -X POST https://pdfapihub.com/api/v1/pdf/ocr/parse \
  -H "CLIENT-API-KEY: your_api_key_here" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://example.com/scanned.pdf",
    "pages": "1-2",
    "lang": "eng",
    "dpi": 220
  }'

Explore OCR APIs

Use the dedicated pages for endpoint-level details and native sandbox testing.