Good Models Math - Search News

This DeepSeek demo shows how good the Chinese AI model is at math and reasoning

DeepSeek's AI models rival top Silicon Valley offerings, excelling in some complex tasks. The models use inference-time compute, breaking queries into smaller, manageable tasks. DeepSeek's DeepThink ...

NextBigFuture

AI Large Language Model Math Breakthroughs

AI large language models have been especially weak on math. There are now several papers from Google Deep Mind, Alibaba and other universities where AI large language models are at Math Olympiad ...

VentureBeat

Alibaba claims no. 1 spot in AI math models with Qwen2-Math

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now If you haven’t heard of “Qwen2” it’s ...

InfoQ

Alibaba Releases Two Open-Weight Language Models for Math and Voice Chat

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Dany Lepage discusses the architectural ...

TechRepublic

OpenAI Model Wins Gold at International Mathematical Olympiad – or Did It?

A Google DeepMind researcher and OpenAI’s former CTO are posing questions about the validity of OpenAI’s claim about its gold-medal score. OpenAI’s latest model has achieved a gold-level score at the ...

Ars Technica

New study shows why simulated reasoning AI models don’t yet live up to their billing

There’s a curious contradiction at the heart of today’s most capable AI models that purport to “reason”: They can solve routine math problems with accuracy, yet when faced with formulating deeper ...

EurekAlert!

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

VentureBeat

Microsoft’s GRIN-MoE AI model takes on coding and math, beating competitors in key benchmarks

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Microsoft has unveiled a groundbreaking artificial intelligence model, ...

TechCrunch

AI models are starting to crack high-level math problems

Over the weekend, Neel Somani, who is a software engineer, former quant researcher, and a startup founder, was testing the math skills of OpenAI’s new model when he made an unexpected discovery. After ...

American Enterprise Institute

Why AI Struggles with Basic Math (and How That’s Changing)

Large Language Models (LLMs) have ushered in a new era of artificial intelligence (AI) demonstrating remarkable capabilities in language generation, translation, and reasoning. Yet, LLMs often stumble ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results