🔴 LIVE — Updated every 10 minutes
👤 -- reading now 🌡 Nairobi
Breaking
HomeTechnologyPixelRAG beats text parsers on accuracy…
Technology

PixelRAG beats text parsers on accuracy and cuts AI agent token costs 10x

VentureBeat Jun 12, 2026 5h ago ⏱ 1 min read 👁 3 views
PixelRAG beats text parsers on accuracy and cuts AI agent token costs 10x
Image via VentureBeat
📋 Article Summary
201 words
Most enterprise RAG pipelines start the same way: a text parser converts web pages and documents into plain text so they can be chunked and indexed for retrieval. That conversion step destroys retrieval signals — and according to new research,… Most enterprise RAG pipelines start the same way: a text parser converts web pages and documents into plain text so they can be chunked and indexed for retrieval. That conversion step destroys retrieval signals — and according to new research, it's responsible for the majority of wrong answers.A research team from UC Berkeley, Princeton University, EPFL and Databricks published a paper this week introducing PixelRAG, a system that skips that conversion entirely. Instead of parsing pages into text, PixelRAG renders them as screenshots, indexes those images and feeds retrieved tiles directly to a vision-language model reader. Tested across 30 million screenshot tiles covering all of Wikipedia, it outperforms text-based RAG across six benchmarks, improving accuracy by up to 18.1% over text-based baselines.Parsers are the wrong place to look for fixes, according to the research team."Improving parsers is an endless process because every website requires special handling," Yichuan Wang, lead author and UC Berkeley doctorate student, told VentureBeat.  "Our goal was…
Continue Reading
Full story on VentureBeat
Read Full Story →
🔗 Clicking will take you to venturebeat.com
Share this story: WhatsApp X/Twitter Facebook
👁 People Also Read
Theker just raised $85M to build the factory robot that doesn’t specialize in anything
Technology

Theker just raised $85M to build the factory robot that doesn’t specialize in anything

Unlike humanoid robots designed around a fixed form — think Boston Dynamics — Theker's machines are built to be reconfigured.

Read
Oracle warns of security bug that hackers abused to breach 100+ companies
Technology

Oracle warns of security bug that hackers abused to breach 100+ companies

The tech giant warned of a security flaw that a cybercrime gang said it's exploiting as part of a mass-hacking…

Read
Opendoor’s India exit is fueling a bigger conversation about AI and outsourcing
Technology

Opendoor’s India exit is fueling a bigger conversation about AI and outsourcing

The decision comes as India emerges as the world’s largest GCC market.

Read
KE
Microsoft’s open-source SkillOpt automatically upgrades AI agent skills without touching model weights
Technology

Microsoft’s open-source SkillOpt automatically upgrades AI agent skills without touching model weights

Agent skills have become an important part of real-world AI applications, providing a mechanism — a set of instructions saved…

Read