WINNER: MathGPT breaks records, beats ChatGPT, Microsoft AI models
MathGPT has surpassed Microsoft's "ToRA 13B", model previously ranked #1 in benchmarks assessing mathematical aptitude
The makers of QANDA, the largest AI-driven learning platform in Asia, Mathpresso, have revealed that their huge language model, MathGPT, has beaten OpenAI and Microsoft models to set a new world record in maths.
It is believed that MathGPT has surpassed Microsoft's "ToRA 13B", the model that held the previous record, to be ranked #1 in benchmarks that assess mathematical aptitude, such as 'MATH' (12,500 challenging math questions) and "GSM8K" (8,500 elementary school arithmetic problems) according to Interesting Engineering.
In the MATH benchmark, OpenAI's GPT-4 was outperformed by MathGPT.
As part of a strategic cooperation with KT, Qanda and Upstage started developing MathGPT together in November of last year. Learning data from 10 million searches each day, including learning level, context, and interaction, were made available to Upstage by Qanda.
KT also gave Mathpresso a $8 million investment in September of last year to help with LLM growth.
Upstage refined the natural language-based language model to allow logical inference and trained this on its own specialised solution to prevent hallucinations.
Unlike domain-specific learning data, like expert knowledge, ChatGPT is trained using large amounts of textual data. As a result, it exhibits the phenomenon of hallucinations, in which it produces reactions that could plausibly transmit false information.
-
AI boom set to lift TSMC’s Q4 profit by 27%
-
An eye on 'global economic instability' as shares slumps, tensions intensify
-
Tesla dethroned: BYD shocks EV market as top seller in 2025
-
China sets up $8.9B fund to boost 2026 consumer goods trade-in
-
Meta to acquire Chinese AI startup Manus to expand advanced AI capabilities
-
China to roll out action plan for digital yuan expansion
-
Amazon halts plan for 'drone delivery in Italy' just before launch
-
Europe’s crypto adoption gains regional momentum; Who’s investing the most?