Sunday Rundown #69: Meta Connect & Sexy Gollum
Sunday Bonus #29: Creating NotebookLM "Audio Overviews" without a source.
Happy Sunday, friends!
Welcome back to the weekly look at generative AI that covers the following:
Sunday Rundown (free): this week’s AI news + a fun AI fail.
Sunday Bonus (paid): a goodie for my paid subscribers.
Let’s get to it.
🗞️ AI news
Here are this week’s AI developments.
👩💻 AI releases
New stuff you can try right now:
Google released two upgraded Gemini models—Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002—that are simultaneously cheaper, faster, and better versions of their predecessors.
Google’s already awesome NotebookLM can now natively parse audio files and YouTube videos as sources.
Meta announced a slew of AI developments at Meta Connect:
A new live voice model, which will eventually feature the voices of famous people.
The ability to talk about and edit your photos in Meta AI using natural language.
Automatic translations of the audio in your Reels.
More AI tools for businesses.
More AI-powered image and theme creation features in Meta products.
New Llama 3.2 family of multimodal models.
OpenAI finally rolled out the Advanced Voice Mode in ChatGPT to Plus and Teams users…but not in the UK, EU, and a few other European countries. Classic.(There are also five new voices to pick from.)
Runway now lets you generate vertical videos using Gen-3 Alpha Turbo.
Suno Pro and Premier users can now crop songs to adjust their start and end times.
🔬 AI research
Cool stuff you might get to try one day:
Alibaba Group presented an impressive framework MIMO that can synthesize any reference image into an existing video.
Allen Institute for AI introduced a Molmo family of open multimodal models that are competitive with or outperform the best existing vision models.
Duolingo teased lots of AI-powered experiences at its Duocon 2024 event.
Microsoft launched a preview of a “Correction” feature in Azure AI that can allegedly detect and fix LLM hallucinations in real time.
📖 AI resources
Helpful stuff that teaches you about AI:
185 real-world gen AI use cases from the world's leading organizations - a collection by Google.
AI Data Drop: 3 Key Insights from Real-World Research on AI Usage - takeaways from Copilot usage at 58 companies by Microsoft.
‘Advanced Voice’ ChatGPT Just Happened … But There's 3 Other Stories You Probably Shouldn’t Ignore [VIDEO] - another great deep dive by AI Explained.
The Future Of AI Agents [VIDEO] - a funny and inspiring talk by HubSpot’s Dharmesh Shah about where AI agents are headed, with cool practical demos.
🔀 AI random
Other notable AI stories of the week:
A few big top-level OpenAI developments:
The company will restructure into a “for-profit benefit corporation,” with Sam Altman potentially gaining equity for the first time.
CTO Mira Murati announced her resignation, making her the latest senior executive to leave the company in the past year.
…so did Chief Research Officer Bob McGrew.
...and VP of Research Barret Zoph.
Snapchat is partnering with Google Cloud to power its My AI chatbot.
🤦♂️ AI fail of the week
Come on, Gollum! You know it’s poor taste to outshine the bride! (Final version here.)
Anything to share?
Sadly, Substack doesn’t allow free subscribers to comment on posts with paid sections, but I am always open to your feedback. You can message me here:
💰 Sunday Bonus #29: How to create a NotebookLM podcast without using sources
As I made abundantly clear last week, I’m a fan of NotebookLM’s new “Audio Overview” feature that turns your sources into natural-sounding podcasts.
But did you know you can use this feature to create a podcast about almost any topic…even if you don’t have underlying sources?
I didn’t, either.
Well, turns out you can.
Here’s how.