OpenAI

OpenAI launches o3 and o4-mini AI reasoning models with enhanced capabilities

April 17, 2025 at 4:57:12 AM

TL;DR OpenAI has launched two new AI reasoning models, o3 and o4-mini, enhancing ChatGPT's capabilities. The o3 model excels in complex queries and visual tasks, making fewer errors than o1, and is strong in programming. The o4-mini model is smaller and optimized for cost-effective reasoning, performing well in math and coding. Both models improve instruction following and provide more personalized responses, aiming to compete with other AI companies.

OpenAI launches o3 and o4-mini AI reasoning models with enhanced capabilities

OpenAI has launched two new AI reasoning models, o3 and o4-mini, which are designed to enhance ChatGPT's capabilities significantly. These models are trained to think longer before responding and can utilize all available tools within ChatGPT, including web searches, file analysis with Python, visual reasoning, and image generation. This advancement allows them to produce detailed answers efficiently, typically within a minute, making them adept at solving complex, multi-faceted problems.

Key Features of OpenAI o3

OpenAI o3 is the most powerful model in this release, excelling in areas such as coding, math, science, and visual perception. It sets new state-of-the-art (SOTA) benchmarks on various tests, including Codeforces and SWE-bench. Notably, o3 reduces major errors by 20% compared to its predecessor, o1, particularly in programming, business, and creative ideation. Early testers praised its analytical capabilities and its ability to generate and evaluate hypotheses in fields like biology and engineering.

Key Features of OpenAI o4-mini

OpenAI o4-mini is a smaller, cost-effective model that delivers impressive performance in math, coding, and visual tasks. It is recognized as the best-performing model on AIME 2024 and 2025 benchmarks and outperforms o3-mini in non-STEM tasks and data science. Its efficiency allows for higher usage limits, making it suitable for high-volume queries that require reasoning.

Improvements and Competitive Landscape

Both models have been rated by external experts as having better instruction-following capabilities and providing more useful, verifiable responses than previous versions. They also feature enhanced conversational abilities, referencing memory and past interactions for personalized responses. This release is part of OpenAI's strategy to maintain a competitive edge against other AI companies like Google, Meta, and Anthropic, as the field increasingly focuses on reasoning models to enhance performance.