ChatGPT Plus Upgrades: Are They Worth It?

I test drive three recent updates to ChatGPT Plus to see if they're any good.

Oct 12, 2023

Happy Thursday, net ninjas!

If you’ve been following my 10X AI posts, you’ll know that OpenAI has been busy giving ChatGPT Plus users a bunch of new toys to play with over the past several weeks.

I finally got to try most of them, and I’m ready to report with my observations.

I already covered the Code Interpreter (now “Advanced Data Analysis”), so go read it if that’s your thing.

Today, I’ll be looking at:

DALL-E 3
Browse with Bing
ChatGPT Vision

I’ll share my impressions of what works well and what isn’t quite up to scratch. I’ll also compare the ChatGPT Plus features to their free Bing counterparts.

Ready?

No?

How about now?

Great, let’s go!

It’s all a wee bit fragmented, innit?

One overall highlight before I move on.

While it’s great that ChatGPT Plus is getting cool add-ons, what’s decidedly less great is that they’re not integrated at all.

To use any of the new features, you must select one—and only one—from this ever-expanding list when starting a new chat:

The “Default” model is the one that has “Vision”

So far so good, but what if I want to use “Browse with Bing” to find a PDF link for “Advanced Data Analysis” to immediately analyze?

Can’t do that in the same chat.

Want ChatGPT Vision to view an image and then ask DALL-E 3 to create a painting based on it?

No can do: They’re two separate features.

You can circumvent this by e.g. copy-pasting the output from one chat into another. But that’s obviously not particularly streamlined.

I expect that once all of these features drop the “Beta” tag, they’ll be rolled into the default GPT-4 model.

For now, you’ll just have to live with this hodge-podge situation.

Let’s go ahead and look at the individual features.

This post might get cut off in some email clients. Click here to read it online.

1. DALL-E 3

First off, we’ve got DALL-E 3.

What is it?

This is the ChatGPT implementation of OpenAI’s latest text-to-image model DALL-E 31. It lets ChatGPT generate images directly inside your conversation.

You activate it by starting a new chat and selecting DALL-E 3 (Beta) from the dropdown list:

Now you’ll be able to request ChatGPT to make images while chatting with it.

What’s good?

There are many things to like about DALL-E 3 in ChatGPT Plus.

1. Can prompt itself

If you give ChatGPT a basic prompt like this…

“Portrait of a chimpanzee”

…it won’t just stick to your initial input. Instead, ChatGPT will create more elaborate prompts to feed to DALL-E 3 in order to generate images with varying styles and compositions.

As a result, you’ll get a set of very different pictures to pick from:

Four diverse chimpanzee portraits by DALL-E 3

ChatGPT can even prompt itself from scratch based purely on your ongoing chat:

Conversation about Space Shuttles with ChatGPT. ChatGPT draws 4 space shuggle images.

So now you can have images of a space shuttle to accompany ChatGPT’s narrative, in case you’ve never seen one before:

This self-prompting ability is great in many situations:

If you’re new to prompting and don’t know where to start
If you’re looking for an illustration of a concept you’re discussing
If you want to get inspiration for artistic styles and directions to explore

For most casual users, this alone is a big deal.

2. More aspect ratios

Right now, Bing only outputs images in a square format (1024X1024 pixels).

The ChatGPT version lets you pick from the following three options:

Square (1024X1024)
Wide (1792x1024)
Tall (1024x1792)

So you can generate landscape and portrait pictures by simply asking for “wide” or “tall” images, respectively.

At the moment, you can’t ask ChatGPT for other aspect ratios, but that might change in the future.

3. Convenience

I’m sure many people will find it handy that they can create images directly in ChatGPT without having to switch over to separate software.

It also allows for a back-and-forth interaction where ChatGPT creates an initial set of images, the user asks for refinements, ChatGPT generates more images, and so on.

This gets us closer to the way you might work with a human artist in the real world, as they gradually refine their first draft based on your feedback.

4. Fewer restrictions than Bing

Bing operates with a system of “tokens” that you can spend to generate images.

Once you run out of tokens, you’ll have to wait until they refresh before you can create more.

There are ongoing indications that Microsoft has lowered the amount of allocated tokens from 100 per day to only 25 per week for some users.

With ChatGPT Plus, you can generate an unlimited amount of images2.

Then there are the now-famous “doggy” content restrictions, where Bing outright blocks any content it deems unsafe seemingly at random and displays a cartoon dog instead:

"Unsafe image content detected" result in Bing — “‘Silly turtle caricature’?! You’re one sick fucking maniac!”

ChatGPT doesn’t block images nearly as frequently. When it does, it can at least explain the reasoning behind it and help you find an alternative approach.

Here’s the “silly turtle caricature” that was too controversial for Bing, courtesy of ChatGPT:

Silly turtle caricature by ChatGPT DALL-E 3

So is DALL-E 3 in ChatGPT Plus always the better option? Well…

What’s bad?

Paradoxically, some of the things that are great about DALL-E 3 in ChatGPT can also become a nuisance in certain circumstances.

1. ChatGPT: The unwanted middleman

ChatGPT’s self-prompting is very useful if you don’t know what you’re going for.

But what if you want to be very deliberate and intentional with your prompts?

Then it might just get in the way.

ChatGPT is pre-prompted to create its own detailed descriptions, so it’ll often add elements you didn’t specify or describe the scene in a way you might not have intended.

ChatGPT prompting itself to create four different images of mansions

You can avoid this by asking ChatGPT to literally use your prompt as is without adding its own details, but even that might fail:

ChatGPT shortening the prompt but still adding own details — You were so close.

I often had to fight with ChatGPT to get it to use the prompt exactly as written.

And once it does follow your prompt to the letter, another problem pops up.

2. ChatGPT does not use “seeds” to vary the image

“Seeds” are used by most text-to-image models to control the random starting point from which the AI image is generated. What this does in practice is:

Ensures that the same text prompt results in different images when you run it multiple times, by allocating a different starting seed to each.
(For a visual reference, see any of the 4-image Midjourney grids I use to illustrate articles like this one.)
Lets you recreate a specific image by using the same text prompt and specifying the same seed.

ChatGPT does not use random seeds.

If you force it to follow your prompt word-for-word, it will spit out four identical images (meaning they all use the same starting seed under the hood):

Four identical mansion images in ChatGPT Plus

Even if you explicitly ask ChatGPT to use different seeds, it’ll acknowledge the issue but still fail to actually produce a different result:

ChatGPT trying different seeds for images but failing — This goes on, but I’ll spare you the pain of watching poor ChatGPT suffer

Bing, on the other hand, actually varies the seed so you can continue re-rolling the same prompt until you get the image you like:

Four different mansion images in Bing with four separate seeds

3. Limited sense of orientation

Occasionally, if you ask for a tall image, ChatGPT will end up flipping your subject:

Two portrait images of a smiling woman, but one is flipped sideways

The first image is how you’d expect a tall portrait to look, while the second one has ChatGPT flipping the entire view, which clearly isn’t the intent.

But this is a minor quibble. You can always re-roll to get the result you’re after.

What’s the verdict?

For a beginner audience, the ChatGPT version of DALL-E 3 is easily the way to go. It knows how to prompt itself, understands the context of your chats, and can work with you iteratively.

Most people will probably also care little about “seeds” and prompt precision.

But then there’s the question of price.

Bing is free. ChatGPT Plus is $20 a month.

My tentative recommendation is:

If you only want to create a few straightforward images and don’t care about the square format, Bing will do just fine.
If you need help brainstorming and expanding your ideas or want the additional aspect ratios, go for ChatGPT Plus.