advertisement
Google Begins Experimentation Of Gemini 2.0 Flash
Google has announced the launch of Gemini 2.0 Flash and its associated research prototype. The tech giant believes that this is a step forward in the field of artificial intelligence as the new model is designed for developers and showcases several experimental agent-based applications.
Building upon the success of its predecessor, 1.5 Flash, Gemini 2.0 Flash offers enhanced performance with similarly rapid response times. Notably, it outperforms 1.5 Pro on key benchmarks while operating at twice the speed. Beyond speed improvements, 2.0 Flash introduces new capabilities, including support for multimodal inputs like images, video, and audio, as well as multimodal outputs such as generated images and multilingual text-to-speech audio. It also integrates with external tools like Google Search and code execution environments.
This model is currently available to developers in an experimental phase through the Gemini API in Google AI Studio and Vertex AI, with broader availability planned for January.
advertisement
Beyond the core model improvements, the announcement highlights the development of AI agents. These agents represent a new approach to interacting with technology, focusing on task completion and proactive assistance. Several research prototypes demonstrate this concept:
- Project Astra: This project explores the concept of a universal AI assistant capable of understanding multiple languages, utilizing tools like Google Search and Maps, and maintaining context over longer conversations. It has seen improvements in dialogue, tool use, memory, and latency.
- Project Mariner: This prototype focuses on browser-based agent interaction. It aims to understand and interact with web content, including text, images, and code, to complete tasks within the browser.
- Jules: This is an AI-powered code agent designed to assist developers within a GitHub workflow. It can analyze issues, develop plans, and execute code under developer supervision.
- Gaming Agents: These agents are designed to interact with video games, offering real-time suggestions and utilizing Google Search for game-related information. There’s also exploration of Gemini 2.0’s spatial reasoning for robotics applications.
A strong emphasis is placed on responsible AI development. The development team is actively addressing safety and security concerns through various measures, including internal reviews, red teaming, safety training, and collaboration with external experts. Specific examples include mitigations against unintentional data sharing in Project Astra and protection against prompt injection in Project Mariner.
The release of Gemini 2.0 Flash and the associated agent prototypes marks an important advancement in AI. The focus on performance, multimodality, and agent-based interaction, combined with a commitment to responsible development, positions Gemini as a key player in the ongoing evolution of artificial intelligence.