OpenAI and Google outdo the mathletes, but not each other | TechCrunch
AI models from OpenAI and Google DeepMind achieved gold medal scores in the 2025 International Math Olympiad, illustrating rapid advancements in AI and competitive equality.
LLMs bow to pressure, changing answers when challenged: DeepMind study
We show that LLMs - Gemma 3, GPT4o and o1-preview - exhibit a pronounced choice-supportive bias that reinforces and boosts their estimate of confidence in their answer, resulting in a marked resistance to change their mind.
xAI launches Grok 3 AI, claiming it is capable of 'human reasoning'
Grok 3 models launched, boasting significantly enhanced capabilities and reasoning features, aiming to improve user experience and research efficiency.