Sunday Rundown #71: MiniMax & Masochistic Pumpkin
Sunday Bonus #31: Getting the most out of Advanced Voice Mode as a free user.
Happy Sunday, friends!
Welcome back to the weekly look at generative AI that covers the following:
Sunday Rundown (free): this week’s AI news + a fun AI fail.
Sunday Bonus (paid): a goodie for my paid subscribers.
Let’s get to it.
🗞️ AI news
Here are this week’s AI developments.
👩💻 AI releases
New stuff you can try right now:
Google released its Imagen 3 text-to-image model to all Gemini users. (But it’s been available in beta since August. I even pitted it against FLUX.1.)
MiniMax (Hailuo AI) finally brought image-to-video capabilities to its video model. I updated my image-to-video showdown post to include the model. Spoiler alert: It’s incredible! (Try it for free.)
OpenAI’s Advanced Voice Mode for ChatGPT is now available to free users but capped at 15 minutes per month.
Suno introduced a “Replace Section” feature that lets you select and edit specific parts of your music track. Available to Pro and Premier users.
Chinese researchers unveiled Pyramid Flow, an open-source video model capable of text-to-video and image-to-video generation. (Try it for free.)
🔬 AI research
Cool stuff you might get to try one day:
Apple released a paper on its Depth Pro model, which can generate a high-resolution 3D depth map of a given image in a fraction of a second.
📖 AI resources
Helpful stuff that teaches you about AI:
“Cursor Team: Future of Programming with AI” - a super detailed deep dive into coding with AI by the Lex Fridman Podcast.
“Machines of Loving Grace: How AI Could Transform the World for the Better” - a rosy vision of AI’s potential future by Anthropic CEO Dario Amodei. (Here’s my NotebookLM podcast if you prefer a condensed audio version.)
“The State of AI 2024” Report (Fully Read w/Highlights) - another great breakdown by AI Explained.
🔀 AI random
Other notable AI stories of the week:
Adobe introduced a free Content Authenticity Web App that lets creators authenticate their work, get proper attribution, and protect it from AI scraping.
OpenAI partnered with Hearst to integrate content from Hearst’s 20 magazine brands—like Cosmopolitan, ELLE, and Runner’s World—into its products.
🤦♂️ AI fail of the week
Wait…did you just independently change my “No” prompt to “Yes”? (Final version.)
💰 Sunday Bonus #31: How to make the most of your 15 minutes of Advanced Voice Mode
Free ChatGPT users, rejoice!
You now have access to the awesome “Advanced Voice Mode” that can respond instantly, pick up on your tone, mimic emotions, and more.
Here’s a great YouTube playlist from OpenAI, showcasing its many potential applications.
But you only get 15 free minutes a month, which isn’t a whole lot.
So how do you use that time as efficiently and productively as possible?
Here are some ideas: