#document AI

All articles tagged with "document AI"

Benchmark Finds AI Systems Often Answer Correctly but Cite the Wrong Evidence

A new benchmark called CiteVQA shows that leading AI models frequently give accurate document answers while failing to identify the actual supporting passage, a gap researchers call attribution hallucination.

Key Takeaways

CiteVQA measures both answer correctness and citation correctness in long documents.
A correct answer with a wrong citation receives no credit under the benchmark’s strict metric.

DT Editorial Team·May 25, 2026·via the-decoder.com

ByteDance Study Finds Long-Document AI Learns Better From Questions Than From Transcribing Text

Researchers from ByteDance Seed and HKUST report that question-answer training improved long-document performance in multimodal models, while pure text-recognition training actually made results worse.

Key Takeaways

Researchers compared OCR-style training with question-answer supervision for long documents.
The study reports that pure text-recognition training worsened performance.

DT Editorial Team·May 25, 2026·via the-decoder.com