🔴 LIVE — Updated every 10 minutes
👤 -- reading now 🌡 Nairobi
Breaking
HomeTechnologyOn-device AI agents hit a hard…
Technology

On-device AI agents hit a hard memory limit. Apple’s new architecture routes around it.

VentureBeat Jun 9, 2026 4h ago ⏱ 1 min read 👁 4 views
On-device AI agents hit a hard memory limit. Apple’s new architecture routes around it.
Image via VentureBeat
📋 Article Summary
201 words
On-device AI models have stayed small because the entire weight set has to live in DRAM, capping practical parameter counts well below what server-side deployments use. Enterprise architects evaluating agentic workloads have had to choose between capable cloud-dependent models and… On-device AI models have stayed small because the entire weight set has to live in DRAM, capping practical parameter counts well below what server-side deployments use. Enterprise architects evaluating agentic workloads have had to choose between capable cloud-dependent models and limited on-device ones. Apple's third-generation foundation models, announced at WWDC26, break that constraint by moving the weight set off DRAM entirely.The AFM 3 family was developed in collaboration with Google and spans five models: two on-device and three server-based, all running within Apple's Private Cloud Compute boundary. The server-side models, including AFM 3 Cloud Pro for agentic tool use and complex reasoning, run on Nvidia GPUs in Google Cloud. The on-device architecture is Apple's own. AFM 3 Core Advanced is a 20-billion-parameter model that stores weights in NAND flash rather than DRAM."Instead of forcing the entire model into DRAM, the full model is stored in flash memory," Apple's research team wrote. "Because NAND-to-DRAM bandwidth is too slow to swap weights…
Continue Reading
Full story on VentureBeat
Read Full Story →
🔗 Clicking will take you to venturebeat.com
Share this story: WhatsApp X/Twitter Facebook
👁 People Also Read
KE
Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information
Technology

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

A joint research collaboration between researchers at the University of Illinois at Urbana-Champaign (UIUC), UC Berkeley, and the open source…

Read
Following Anthropic, OpenAI files confidentially for IPO
Technology

Following Anthropic, OpenAI files confidentially for IPO

ChatGPT-maker OpenAI has filed confidentially for an initial public offering, the company said Monday in a blog post. The filing…

Read
Apple’s WWDC AI demos looked more real after $250M false ad settlement
Technology

Apple’s WWDC AI demos looked more real after $250M false ad settlement

The vibe of Apple's 2026 WWDC keynote felt like a spouse proudly listing all the honey-do-list items tackled. One subtle…

Read
KE
Mercor’s Brendan Foody calls out Sequoia over ‘dual-pricing’ valuation tricks
Technology

Mercor’s Brendan Foody calls out Sequoia over ‘dual-pricing’ valuation tricks

Sequoia is just one of the top firms that sells same equity at two different prices.

Read