Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Ingestion service

Capabilities

Performance

Additional costs

Image-based PDFs

Multi-Column Layouts

Complex Tables Detection

Image Content Extraction

On-Prem deploymentDeployment

Default

(error)

(error)

(error)

(error)

(tick)

10-15s per page

None

Docling

🟡

(tick)

🟡

(error)

(tick)

10-20s per page

Azure infra Costs

MDI

(tick)

(tick)

(tick)

(error)

(error)

10-20s per page

1.6 cents per page

MDI with Image Content Extraction

(tick)

(tick)

(tick)

(tick)

(error)

20-30s per page

3 cents per page

Assumption:

  • 1.6 cents for MDI

  • 1.4 cents for 5k tokens (vision model GPT4o) per image per page (assuming 1 image per page)

...