Technology

New AI optimization framework beats Claude Code and Codex by 2.5x on the same compute budget

VentureBeat Jun 18, 2026 3h ago ⏱ 1 min read 👁 9 views

Image via VentureBeat

📋 Article Summary

204 words

Imagine your engineering team just deployed an AI agent to search through internal company documents and answer employee questions. It works perfectly in development, but in production, it consistently hallucinates or misses key constraints. Fixing this is rarely a simple… Imagine your engineering team just deployed an AI agent to search through internal company documents and answer employee questions. It works perfectly in development, but in production, it consistently hallucinates or misses key constraints. Fixing this is rarely a simple patch. It requires a tedious, trial-and-error process of tweaking chunking strategies, retrieval methods, and system prompts simultaneously. Because these adjustments are entangled, it becomes nearly impossible to attribute which specific tweak actually solved the problem. To address this challenge, researchers at Renmin University of China and Microsoft Research introduced Arbor, a framework that upgrades AI-driven research and optimization from a sequence of trial-and-error guesses into a cumulative learning process. Arbor organizes hypotheses, experiments, and insights into a tree that helps the system learn from prior failures to make smarter, verified improvements over time.In practical tests, Arbor delivered more than 2.5 times the verifiable performance gains of standard AI coding agents across real-world engineering tasks while operating under the same resource budget. For…

This is a summary. Read the complete story on VentureBeat below.

Full story on VentureBeat

Read Full Story →

🔗 Clicking will take you to venturebeat.com

New AI optimization framework beats Claude Code and Codex by 2.5x on the same compute budget

How to turn off AI in your Google Docs

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

AWS enters the context layer race with a graph that learns from agents, not manual curation

Roelof Botha joins SpaceX’s board of directors