For over 26 years, Google has pursued its mission to organize the world’s information and make it universally accessible and useful. This vision has driven continuous innovation, particularly in artificial intelligence, pushing the boundaries of what AI can achieve in organizing and interpreting information.
When Google unveiled Gemini 1.0 in December, it was a breakthrough—the first AI model built to be natively multimodal. Gemini 1.0 and its successor, Gemini 1.5, significantly advanced AI’s ability to process and understand information across text, video, images, audio, and code. Millions of developers leveraged these models, reshaping existing products and inspiring new ones like NotebookLM, celebrated for its utility and user appeal.
Yesterday, Google proudly launches Gemini 2.0, a revolutionary step forward in the AI landscape. Designed to usher in a new era of “agentic” AI, Gemini 2.0 features capabilities that extend beyond understanding information—it can take meaningful actions with user supervision, offering ground-breaking possibilities for AI-assisted tasks.
With advances in multimodality—including native image and audio output—and enhanced tool integration, Gemini 2.0 aims to realize the vision of a universal assistant. Developers and trusted testers can begin exploring its potential today, with its Flash experimental model now accessible to all Gemini users. Additionally, the newly launched “Deep Research” feature, leveraging advanced reasoning and long-context capabilities, enables users to tackle complex topics and compile detailed reports, setting a new standard for research assistance.
No product has been more impacted by AI than Google Search. AI Overviews, now serving 1 billion users, enable entirely new question formats, making them among the most popular features in Search. With Gemini 2.0’s advanced reasoning, these Overviews will soon address more intricate topics, including advanced math equations, multimodal queries, and coding challenges. Limited testing has begun, with broader availability expected early next year.
Gemini 2.0 is underpinned by Google’s decade-long investment in AI innovation. It’s powered entirely by Trillium, Google’s sixth-generation TPUs, now available for customer use. This foundational technology accelerates the training and inference processes, enabling developers to build with unmatched efficiency.
Gemini 2.0 is more than an AI model; it’s the cornerstone of an agentic revolution. This new wave of AI includes native capabilities for multimodal reasoning, long-context understanding, complex instruction following, and real-time tool use. These enhancements empower developers to build dynamic, interactive applications across various domains.
Google DeepMind’s research continues to explore agentic possibilities through ground-breaking prototypes:
- Project Astra: A universal AI assistant integrating multimodal understanding, enhanced memory, and real-world tool use for applications like Google Search and Maps. Trusted testers are already trialling this on Android and wearable devices.
- Project Mariner: A browser-focused research prototype showcasing the ability to reason and interact with web elements to perform complex tasks. This project, still in its infancy, demonstrates AI’s potential to assist in navigating digital environments safely.
- Jules: A developer-centric AI agent embedded within GitHub workflows, designed to tackle issues, plan, and execute coding tasks under supervision.
Google’s commitment to safety and ethics underpins every step of Gemini 2.0’s development. From internal reviews to AI-assisted red-teaming approaches, the focus remains on mitigating risks while enhancing reliability. Features like privacy controls and advanced prompt-injection defences are integral to these efforts, ensuring AI agents act responsibly and transparently.
The launch of Gemini 2.0 marks a pivotal moment in AI innovation, setting the stage for a future where intelligent agents assist across digital and physical realms. As these technologies evolve, Google remains steadfast in its mission to develop AI responsibly, making its transformative capabilities accessible to all.
Main Image: Google