📤 Extract Invoice Data from PDF – Automated with AI & API

Need to automatically extract invoice data from PDF files and use it in structured workflows? Our 7-PDF Invoice Extractor with integrated FastAPI does exactly that – delivering structured invoice data as JSON or ZUGFeRD-compliant XML, ready for ERP, DMS, accounting, shops, or inventory systems.

Automatically extract invoice data from PDF

Your Benefits at a Glance

  • 🔍 Fully automated data extraction from PDF invoices
  • 🔄 Output as JSON or ZUGFeRD XML (DIN EN 16931)
  • 🧠 AI-powered field recognition
  • 🖼️ OCR (Tesseract) support for scanned PDFs
  • 🔐 REST API with token-based authentication via FastAPI

📄 What data is extracted?

The extraction is fully automated and includes:

  • ✔ Invoice number, date
  • ✔ Sender, recipient (VAT ID, IBAN, address)
  • ✔ Line items with quantity, price, tax rate
  • ✔ Totals: net, tax, gross

🌍 Multilingual PDF support

  • 🇩🇪 German
  • 🇬🇧 English
  • 🇵🇱 Polish
  • 🇫🇷 French
  • 🇮🇹 Italian
  • 🇪🇸 Spanish

Language is detected automatically – no setup required.

🚀 CURL example: send PDF & get JSON

curl -X POST https://generator.7-pdf.de/extract-invoice/ \
-H "accept: application/json" \
-H "Authorization: Bearer [[ T O K E N - C O D E ]]" \
-H "Content-Type: multipart/form-data" \
-F "file=@/path/to/invoice.pdf"

Response (JSON excerpt):

{
  "success": true,
  "invoice_number": "RE-2025-1003",
  "xml": "<?xml version="1.0" ?>\n<rsm:CrossIndustryInvoice ...",
  "validation_file": "passed",
  "validation_string": "passed"
}

⚙️ Easy integration into existing systems

  • 📤 Compatible with CURL / Postman / Python / PHP
  • 📁 Scan-to-folder with script automation
  • 📦 Pass to ERP, DATEV, inventory, or accounting tools

📚 Get started now – activate your API token

Activate API Access

Top