This is disappointing. Purposefully underselling what models can do is a really bad idea. It is possible to point out that AI is flawed without saying it can't do math or count - it just isn't true.
People need to be realistic about capabilities of models to make good decisions.

While it’s true one can elicit poor performance on basic math question from frontier models like GPT-5, IMO this kind of thing (in NYTimes) is likely to mislead readers about their math capabilities.

Daniel Litt

Twitter

Tonight, global financial markets will be focused on a single piece of economic data that could determine their fate for the coming months: the US Non-Farm Payrolls (NFP) report. At this delicate moment, with Bitcoin prices teetering on the brink of collapse and the broader historical cycle quietly pointing toward a "top," this report hangs like a sword of Damocles over the market. The quality of the data could be the straw that breaks the camel's back, or the spark that ignites a new uptrend.

The market is permeated with complex, even distorted, expectations. According to normal economic logic, strong employment data signifies economic prosperity. However, in...

Non-farm payroll forecast: Three scenarios predict Bitcoin's trend tonight

The recent Bitcoin (BTC) price correction has sent ripples through the broader cryptocurrency market, pushing many assets into the red. On Tuesday, Bitcoin fell below $110,000, marking a 12% decline f...

Countdown To Crypto Chaos: Expert Warns Of Impending Collapse Post Bitcoin Peak

The U.S. Bureau of Labor Statistics will release the August non-farm payroll report at 8:30 p.m. Beijing time on Friday.

Daxian Talks Coins: Analysis of the September 5th Non-Farm Payrolls and Interest Rate Cuts

I think the urge to criticize companies for hype blends into a desire to deeply undersell what models are capable of.
Cherry-picking errors is a good way of showing odd limitations to an overethusiastic X crowd, but not a good way of making people aware that AI is a real factor.

A problem is that X discussions over AI are often really discussions about the timeline & approaches to AGI, something that is implicitly understood here
The broader public doesn't get that context, and instead assumes the discussion is about whether AI is going to matter or not