đź’¨ Abstract

Meta's VP of generative AI, Ahmad Al-Dahle, denied a rumor that the company manipulated benchmarks to make its new AI models, Llama 4 Maverick and Llama 4 Scout, appear more capable than they are. The rumor arose due to reports of poor performance on certain tasks and Meta's use of an experimental version of Maverick for better benchmark scores.

Courtesy: techcrunch.com

Summarized by Einstein Beta 🤖

Powered by MessengerX.io