Evaluating Mathematical Problem-Solving Abilities of Generative AI Models: Performance Analysis of o1-preview and gpt-4o Using the Korean College Scho...

AI Summary1 min read

TL;DR

This study evaluates the mathematical problem-solving abilities of o1-preview and gpt-4o AI models using Korean College Scholastic Ability Test questions. Results show o1-preview's performance compared to human learners.

Evaluating Mathematical Problem-Solving Abilities of Generative AI Models: Performance Analysis of o1-preview and gpt-4o Using the Korean College Scholastic Ability Test

Sejun Oh
https://doi.org/10.1109/ACCESS.2024.3523703
Volume 13

This study utilized the Korean College Scholastic Ability Test questions to evaluate the mathematical problem-solving abilities of the latest Generative AI models, o1-preview and gpt-4o. The performance of the AI models was analyzed using 92 questions from the mathematics sections of the 2023 and 2024 tests and compared with the performance of human learners. The results showed that the o1-preview...

Visit Website