Author: Carl Franzen
Google’s native multimodal AI image generation in Gemini 2.0 Flash impresses with fast edits, style transfers
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Google’s latest open source AI model Gemma 3 isn’t the only big news from the Alphabet subsidiary today. No, in fact, the spotlight may have been stolen by Google’s Gemini 2.0 Flash with native image generation, a new experimental model available for free to users of Google AI Studio and to developers through Google’s Gemini API. It marks the first time a major U.S. tech company has shipped multimodal image generation directly within a model to consumers. Most other AI image…
OpenAI unveils Responses API, open source Agents SDK, letting developers build their own Deep Research and Operator
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI is rolling out a new suite of APIs and tools designed to help developers and enterprises build AI-powered agents more efficiently atop some of the very same technology powering its own first-party AI agents Deep Research (which scours the internet independently to develop richly researched, well organized and cited reports) and Operator (its tool for controlling a web browser cursor autonomously based on a user’s text instructions and performing actions like finding sports tickets or making reservations). Now, with access…