Google's Gemini 2.5 Computer Use model, now in preview, can interact with web and mobile interfaces for tasks like browsing, clicking, and typing, demonstrating strong performance in web and Android environments, and is used for UI testing and workflow automation.
Google has launched Gemini 2.0, a new multimodal generative AI model designed to enhance online tasks with features like autonomous tool linking and complex instruction following. The initial release, Gemini 2.0 Flash, is available to developers globally, with a broader rollout expected in early 2025. Gemini 2.0 aims to compete with AI models like ChatGPT and GitHub Copilot, offering capabilities such as text-to-speech and native image generation. Google is also testing projects like Project Mariner, a Chrome extension for summarizing web pages, and Jules, a coding assistant. The company is implementing security measures to protect against potential cyber threats.