Selected:

Can large language models figure out the real world?
August 26, 2025
New test could help determine if AI systems that make accurate predictions in one area can understand it well enough to apply that ability to a different area.

Despite its impressive output, generative AI doesn’t have a coherent understanding of the world
November 6, 2024
Researchers show that even the best-performing large language models don’t form a true model of the world and its rules, and can thus fail unexpectedly on similar tasks.