Testing Grok 4 On Math

Hosted on MSN

Grok 4.2 tops logic tests over Claude and ChatGPT

A new OmniCalculator report finds xAI’s Grok 4.2 outperforming both Claude and ChatGPT in logic and problem-solving, cutting answer instability to nearly half of legacy models. Claude 4.6 still leads ...

Bleeping Computer

xAI is testing Grok 4.20 to take on GPT-5, may launch this month

Elon Musk-owned xAI is testing Grok 4.20, a new model update to Grok 4, which already competes with GPT-5 in some benchmarks, such as ARC-AGI 2. GPT-5 is one of the best models for coding, and it ...

Hosted on MSN

Grok tops logic tests as Claude leads in writing

New benchmarks from OmniCalculator show Grok 4.2 outperforming rivals in logic and problem-solving, while Claude 4.6 excels in writing quality and tone. ChatGPT remains the most popular AI chatbot ...

NextBigFuture

XAI Grok 4 is Third on AI Leaderboard

XAI Grok 4 ranks third on the LMarena leaderboard. Google Gemini and OpenAI O3 rank ahead of Grok 4. Grok-4 was tested with real-world prompts across domains like coding, math, as well as creative ...

NextBigFuture

XAI Grok 4.20 and OpenAI GPT 5.2 Are Solving Significant Previously Unsolved Math Proofs

A Mathematician with early access to XAI Grok 4.20, found a new Bellman function for one of the problems he had been working on with my student N. Alpay. Not an Erdős problem, but original research.

SiliconANGLE

Elon Musk’s xAI sets AI benchmark records with new reasoning-optimized Grok 4 model

Elon Musk’s xAI Holdings Corp. has debuted a new large language model, Grok 4, that’s optimized for reasoning tasks such as generating code. The LLM’s late Wednesday launch followed a turbulent week ...

Geeky Gadgets

Grok 4.2 vs Gemini 3.0 : Speedier Code, Video Smarts & Improved Reasoning

Is Grok 4.2 the most intelligent coding model we’ve seen yet? With its release in January 2026, this AI powerhouse has already sparked conversations across the tech world. In this comparison, World of ...

Geeky Gadgets

Grok 4.2 Quiet Trials Show Sharper UI, Cleaner Code, Plus Playable Games

What if the future of AI could not only dream up stunning web designs but also code them into reality with unmatched precision? In this overview, Universe of AI explores how Grok 4.2, codenamed ...

Ars Technica

Musk’s Grok 4 launches one day after chatbot generated Hitler praise on X

On Wednesday night, Elon Musk unveiled xAI’s latest flagship models Grok 4 and Grok 4 Heavy via livestream, just one day after the company’s Grok chatbot began generating outputs that featured ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results