The Parrot Can Talk AND Draw: Multimodal is here to stay
In all the chatter about 4o's new image mode, you might have missed Gemini 2.5 dropping plus 4o's new advanced voice mode—the unifying thread is multimodal, and it's a big deal!
Breathe. A new day, several new AI updates to keep up with!
Today, I want to break down two major developments quickly: Google’s Gemini 2.5 and the newly improved voice mode in GPT-4o. Both are possible because of advancements in multimodal AI. I’m working on a much more in-depth piece for Gemini 2.5, but it’s a big enough deal I want to call it out her…
Keep reading with a 7-day free trial
Subscribe to Nate’s Substack to keep reading this post and get 7 days of free access to the full post archives.