
when I heard leaked benchmark of @xAI 's Grok 4 scoring 45 % on HLE (Humanities Last Exam) 🤯
Market Brief
Daily market recaps with key events, stock movements, and global influences
46 posts • GPT (4.1 mini)
Published
More breaking stories on DeepNewz — updated live.
when I heard leaked benchmark of @xAI 's Grok 4 scoring 45 % on HLE (Humanities Last Exam) 🤯
Grok 4’s leaked benchmark just dropped a bombshell. The chart shows xAI’s model hitting a jaw-dropping 45% on HLE (Humanities Last Exam) — an exam so brutal it was designed to keep LLMs humble: •2,500 expert-written questions across 100+ disciplines •14 % multimodal (text +
the Grok 4 benchmark chart (leaked version) is just beautiful Did @xAI really hit 45% on HLE (Humanities Last Exam) 🤯 Because the HLE test is so hard. It (HLE) holds 2,500 expert-written questions spanning more than 100 subjects, including math, physics, computer science and