Explore the key novelties and announcements from Google I/O 2024:
Gemini 1.5 Pro
The upgraded generative AI, Gemini 1.5 Pro, can now analyze longer documents, codebases, videos, and audio recordings. Revealed in a private preview, it can handle up to 2 million tokens, doubling its previous capacity and becoming the largest input model available commercially.
Gemini Live
Gemini Live allows users to have "in-depth" voice chats with Gemini on their smartphones. Users can interrupt and ask questions, with Gemini adapting to their speech in real time. It can also respond to surroundings via photos or videos from smartphone cameras.
Gemini on Android
Gemini will soon replace Google Assistant on Android, offering deep integration with the mobile OS and Google apps. Users will be able to drag and drop AI-generated images into Gmail, Google Messages, and other apps. For YouTube will feature "Ask this video" to extract specific information from videos.
Gemini Nano
Google is integrating Gemini Nano, its smallest AI model, into the Chrome desktop client starting with Chrome 126. This will enable developers to leverage the on-device model for their own AI applications.
Veo
Veo is an AI model that creates 1080p video clips up to one minute long from a text prompt. It can produce various visual and cinematic styles, including landscapes and time lapses, and edit pre-existing footage.
Firebase Genkit
The new Firebase Genkit is designed to simplify the development of AI-powered applications in JavaScript/TypeScript, with Go support on the way. This open-source framework, under the Apache 2.0 license, helps developers quickly integrate AI into new and existing projects.
Ask Photos
Launching later this summer, the experimental "Ask Photos" feature in Google Photos, powered by Gemini AI, will allow users to search their photo collections using natural language queries.