Tech giant Google (GOOGL) has unveiled its Gemini 2.5 Computer Use model, an enhanced version of the Gemini 2.5 Pro system. This innovative model allows AI agents to interact directly with websites and applications through the Gemini API in Google AI Studio and Vertex AI. Unlike standard APIs, this system can perform digital tasks by clicking, typing, scrolling, and completing forms, mimicking human behavior. It excels primarily in web browsers but also shows promise for mobile applications.
The model operates by processing user requests alongside a screenshot and action history, generating actions like button clicks or form entries. Google has implemented safety measures, including pre-action checks and developer controls to mitigate risks. Teams at Google have already leveraged the model for rapid software testing, enhancing efficiency significantly. The Gemini 2.5 model is now in public preview, inviting developers to engage with demos and tools. Analysts maintain a Strong Buy consensus on GOOGL stock, suggesting a positive future outlook.
Source link