Google has introduced Gemini 2.5, a new family of AI reasoning models that pause to "think" before answering questions. The first model in this series, Gemini 2.5 Pro Experimental, excels in various benchmarks, showcasing significant advancements in reasoning and coding capabilities.
Gemini 2.5 is designed to solve complex problems with enhanced performance and accuracy. The model's reasoning ability extends beyond simple classification and prediction; it includes analyzing information, drawing conclusions, and making informed decisions. Building on previous models, Gemini 2.5 integrates improved post-training techniques to enhance its reasoning capabilities.
Gemini 2.5 Pro
Gemini 2.5 Pro Experimental is the most advanced model for complex tasks, leading the LMArena leaderboard significantly. It demonstrates strong performance in coding, math, and science benchmarks. Currently available in Google AI Studio and the Gemini app, it will soon be accessible on Vertex AI, with pricing details to follow.
Gemini 2.5 Pro achieves state-of-the-art results in advanced reasoning benchmarks without costly test-time techniques. It scored 18.8% on Humanity’s Last Exam, a dataset designed to evaluate human-like reasoning capabilities.
Advanced Coding
The model has made significant improvements in coding performance, excelling in creating web applications and code transformation. On the SWE-Bench Verified standard, Gemini 2.5 Pro scored 63.8% with a custom agent setup, demonstrating its ability to generate executable code from minimal prompts.
Building on the Best of Gemini
Gemini 2.5 retains the strengths of previous models, including native multimodality and an extensive context window. The current version features a 1 million token context window, with plans to expand to 2 million tokens. This allows the model to process vast datasets and tackle complex problems across various information sources.
Developers can experiment with Gemini 2.5 Pro in Google AI Studio, and it will soon be available on Vertex AI. Feedback is encouraged to enhance Gemini's capabilities further.