OCR APIs

    New

    OCR is now split into dedicated endpoints and pages for PDF OCR and Image OCR.

    What it does

    Extract machine-readable text from scanned PDFs and images.

    Control OCR behavior with `lang`, `dpi`, `psm`, and `oem` options.

    Use in search indexing, invoice extraction, and document automation pipelines.

    PDF OCR

    POST /v1/pdf/ocr/parse

    Input via file upload or public URL.

    Image OCR

    POST /v1/image/ocr/parse

    Input via file upload, image URL, or base64 payload.

    Quick cURL

    curl -X POST https://pdfapihub.com/api/v1/pdf/ocr/parse \
      -H "CLIENT-API-KEY: your_api_key_here" \
      -H "Content-Type: application/json" \
      -d '{
        "url": "https://example.com/scanned.pdf",
        "pages": "1-2",
        "lang": "eng",
        "dpi": 220
      }'

    Explore OCR APIs

    Use the dedicated pages for endpoint-level details and native sandbox testing.