Home Learn About Book a call
Learn · Technology

AI vs OCR for logistics documents

AI and OCR look similar on a vendor slide — both take a PDF in and return data out. In logistics operations the two behave very differently, and the difference is the reason many document-automation projects quietly failed in 2018-2022 and are being re-done in 2024-2026. This guide explains where OCR still fits, where it breaks on real shipping documents, and what "document AI" actually changes.

The short version

OCR reads characters. AI reads meaning. Running OCR on a Bill of Lading gives you a page of words in reading order. Running a document AI on the same B/L gives you { shipper, consignee, containers[], incoterms, freight_terms } — each field correctly labelled even when the carrier's form has no matching label on the page. OCR is useful, but it is a component, not a product. The complete solution needs the AI layer on top.

Why the distinction matters in logistics specifically

Logistics documents are the worst-case scenario for traditional OCR and the best-case scenario for document AI. Five features compound:

What each technology is good at

OCR is good at
Fixed-layout forms, printed text, standard fonts. Producing low-cost text from scans at high volume. Being a building block inside a larger pipeline. Historical archive digitisation.
OCR is bad at
Variable layouts, handwriting, language-dependent fields, reasoning across fields, detecting missing data, understanding abbreviations or domain jargon, resolving entities ("MSC" → Mediterranean Shipping Company).
Document AI is good at
Variable layouts, multilingual content, handwriting, cross-document reasoning, detecting missing fields, correcting small errors in context, choosing the right action when the document is ambiguous.
Document AI is bad at
Explaining why it chose an answer without help. Being cheap on very high volume simple extractions (OCR+rules can still win on cost for stable templates).

How they handle a real B/L — worked example

Imagine an MSC Bill of Lading scanned to PDF. The shipper block is top-left, the consignee block is top-right, the container list is a table in the middle, freight terms are in the bottom panel.

OCR returns: a flat text blob with all words in roughly reading order. "SHIPPER / EXPORTER NEDLLOYD MACHINERY BV BLAAK 16 ROTTERDAM NETHERLANDS CONSIGNEE VDR EXP 5501 BUSAN KOREA PORT OF LOADING ROTTERDAM PORT OF DISCHARGE BUSAN..." — you get every character but no structure.

Document AI returns:

{
  "carrier": "MSC",
  "mbl_number": "MEDUH1234567",
  "shipper": {
    "name": "NEDLLOYD MACHINERY BV",
    "address": "Blaak 16, 3011 TA Rotterdam, Netherlands"
  },
  "consignee": {
    "name": "VDR EXP 5501",
    "address": "Busan, Korea"
  },
  "pol": "NLRTM",
  "pod": "KRPUS",
  "containers": [
    { "number": "MSKU1234567", "type": "40HC", "seal": "FX889142" }
  ],
  "incoterms": "CIF BUSAN",
  "freight_terms": "prepaid"
}

The structured output is what the TMS actually wants. Getting from OCR output to the structured output — with acceptable accuracy on 24 different carrier templates — is exactly the problem that document AI solves and that rule-based OCR cannot.

Why so many OCR logistics projects failed 2018-2022

Between 2018 and 2022 many forwarders bought OCR-based "document automation" products. A typical story: a pilot on MSC and Maersk shows 90% field accuracy, project goes into production, the forwarder adds CMA CGM — new template, rules break, accuracy drops to 60%, ops team re-keys everything. A year later the tool is a shelfware line item on the IT budget.

The technical reason: OCR plus rules is fragile to layout. Every new template needed a new rule set. Vendor support queues backed up. Maintenance of the rule set became a full-time job nobody budgeted for.

The organisational reason: the product was sold as "automation" but delivered "partial automation requiring constant rule updates". The promise-to-delivery gap drove internal champions out and killed budgets.

What changed in 2023-2026

Two technical shifts changed the maths. First, large language models (GPT-4 class and beyond) became good enough to reason about document layout and semantics in a single pass, without per-template rules. Second, vision-language models (Claude Vision, GPT-4V, Gemini Vision and open alternatives) started handling scanned PDFs end-to-end — no separate OCR step needed for most cases. What used to require a team of engineers maintaining rules for 24 carrier templates now requires a prompt, a schema and test data.

The commercial shift: forwarders who tried OCR and got burned are now the fastest to adopt AI, because they have a concrete baseline to measure against. AI projects in 2026 tend to run pilots against the OCR tool that failed, and the lift is usually visible in two weeks.

Does AI entirely replace OCR?

No. OCR lives on inside modern pipelines for three reasons:

Practical builders combine the two: fast OCR for the routine cases, AI for the variable and hard ones, confidence thresholds deciding which pipeline handles a given document. The vendor marketing term for this is "hybrid extraction"; the reality is just "pick the cheaper tool that works for each case".

The evaluation playbook — how to compare

When assessing a document-extraction vendor, ask for five numbers on your own documents (not the vendor's curated sample):

  1. Field-level accuracy per critical field (shipper, consignee, container, Incoterm, freight terms). Aggregate document accuracy hides gaps.
  2. Straight-through-processing rate: what percentage of documents need zero human touch.
  3. Latency per document, p50 and p95. Ops cares as much about p95 as average.
  4. New-template onboarding time: how long from "first MSC B/L" to "90% accuracy on MSC B/Ls".
  5. Maintenance cost: who fixes the rules/prompts when accuracy drops, and what does it cost.

If a vendor can't give you these five numbers on a real sample, you are buying a brochure, not a product.

Where Logentic fits

Logentic uses document AI (Claude-class vision-language models) fronted by lightweight OCR where it helps. More importantly: extraction is only the first step. Our agent Alex then validates the extracted fields against the booking, fills the shipment in the TMS, surfaces exceptions to the human operator, and closes the loop with the customer. OCR vendors sell extraction; Logentic sells "the task is done". The distinction is the point.

Related reading

What is a Bill of Lading? · CMR waybill guide · AI Transport Management System · European freight forwarders: the AI gap

Frequently asked questions

Is document AI the same as "IDP" (Intelligent Document Processing)?

IDP is an older umbrella term that historically meant "OCR plus rules plus some machine learning for classification". Modern document AI built on vision-language models is a step-change above what the IDP label used to mean. Some incumbent IDP vendors have added LLM capabilities; others still sell the 2018-era stack with an AI-washed badge.

Can AI read a bad scan better than a human?

On average, yes, for routine fields. Humans still edge out AI on unusual handwriting and heavily damaged scans. The practical result: AI handles the ~95% of documents that are fine, humans handle the 5% that aren't. Throughput per human operator rises sharply.

Does AI get confused by two-column PDFs?

Modern vision models handle multi-column layouts well. Traditional OCR often misread two-column layouts by running the text left-to-right across both columns, which scrambled the output.

How do I start comparing OCR and AI vendors for my forwarder?

Pick 20 real documents representative of your traffic (mix of carriers, templates, languages, including some bad scans). Send them to each vendor with the same schema and measure the five metrics above. Two weeks of effort saves 12 months of the wrong procurement.

See AI extraction on your documents

Send 3 of your real Bills of Lading or CMRs; we run them through Logentic live. Book a 45-minute demo.

Book a free call →