Google is enhancing its Gemini platform by adding experimental modes, including the recently highlighted “gem pix” for image editing. New descriptions point to three primary modes: Agent Mode, which will feature autonomous exploration, planning, and execution, putting Gemini in competition with tools like ChatGPT. This mode aims for complex task execution rather than simple prompts.
The second mode, Gemini Go, is designed for collaborative ideation, possibly integrating with existing tools like Canvas for brainstorming. Thirdly, the Immersive View mode promises to deliver visual answers to inquiries, potentially incorporating image generation for dynamic explanations.
While not all these modes will necessarily be separate, Agent Mode is distinguished with its own icon, suggesting it will remain a standalone feature. Google’s strategy aims to transform Gemini from a simple chat assistant into a comprehensive platform for creativity, research, and visual learning, enhancing user experience and productivity.