I decided to try a theme with this week’s cover imagery to see how creative MidJourney could be with simple prompts. Each category cover image is a name tag + art style. It was pretty neat to see the variances. The goal is not perfection. By posting the mistakes, we’ll get to see how imagery improves over time. Here is the prompt for the cover:
a flat name tag that reads “OpenAI” –ar 5:3 –style raw
“👀 “the first robust empirical demonstration that any artificial system passes an interactive 2-player Turing test.” GPT-4 was judged to be human by other humans 54% of the time (though humans were judged to be human 67% of the time).
“This demo is insane. A student shares their iPad screen with the new ChatGPT + GPT-4o, and the AI speaks with them and helps them learn in *realtime*. Imagine giving this to every student in the world. The future is so, so bright.
“This demo of two GPT-4o’s singing to each other is one of the craziest things I’ve ever seen.
“GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction. It will be available for free users and via the API. If you enjoyed this, check out
“GPT-4o voice mode is really impressive. We gave it a face with @synthesiaIO EXPRESS-1, our latest avatar model. When empathy is important – healthcare, coaching, education – a friendly face really makes a difference. It’s why we do Zoom over phone calls. What do you think?
OpenAI Develops AI Voice Assistant As It Chases Google, Apple — The Information
“GPT-4o is a huge step forward for image generation. Not only is it amazing at rendering text and following captions, it also provides a very natural way to iteratively edit and compose visual concepts. 1/8
Examples of GPT-4o image abilities:
“One use case I’m excited about is telling a story with images. In this example, we use the model to create a character and then immerse her in a visually-consistent, fictional world. 2/8 https://t.co/gP7uMWUkoL” / X
“Speaking of consistent characters, how about becoming movie stars? Here, the model is able to depict me and @gabeeegoooh as detectives in a stunning movie poster. Note how our names and the movie title are rendered properly! 3/8 https://t.co/epVA0M5joy” / X
“The model can also compose ideas across images, e.g. here it is able to add the OpenAI logo to a photo of a coaster. 4/8 https://t.co/Ad92Apmy8m” / X
“A neat thing about this model is that it can produce multiple consistent views of a 3D object, allowing us to reconstruct 3D models of complex shapes. 5/8 https://t.co/C4cdlgWBW3” / X
“By generating multiple images and context, and leveraging the model’s amazing text rendering capabilities, we can do neat things like create custom fonts. 6/8 https://t.co/jsyEPgTy1t” / X
“In this example, we can see just how well the model does at rendering a complex image. It uses two separate chat bubbles for the messages, renders a ton of text correctly all at once, and almost perfectly depicts a QWERTY keyboard. 7/8 https://t.co/Y3h6O1xPw6” / X
Apple’s New ChatGPT Deal—Here’s What It Means For iPhone Security
iOS 18: Apple finalizing deal to bring ChatGPT to iPhone – 9to5Mac
“Apparently, the Apple – OpenAI deal just closed! One day before the voice assistant announcement 🙂 Guess Apple decided that it couldn’t make it on its own 🤷 The new Siri will be from OpenAI
Apple Closes in on Deal With OpenAI to Put ChatGPT on iPhone – Bloomberg
“OpenAI desktop app deeply integrated into my day to day environment. Not sure if I’ll ever need to rely on search by default. This is pretty existential for search engines.”
“ChatGPT Desktop participating in a zoom meeting > keeps track of different speakers including names > summarizes conversation When can we delegate the entire meeting to the bot?
“ChatGPT desktop app + @CleanShot is a new magical experience, holy shit. I wish they let me increase the font though (and take native screenshots with a shortcut @TheRealAdamG, that clicking around small buttons must stop!) But again, holy shit this unlocks so many usercases
GPT2 is confirmed as Open AI
“GPT-4o is our new state-of-the-art frontier model. We’ve been testing a version on the LMSys arena as im-also-a-good-gpt2-chatbot 🙂. Here’s how it’s been doing.
“As I (and others) speculated, im-also-a-good-gp2-chatbot is an improved version of GPT-4, probably the best LLM I have used, but not a 10x improvement over GPT-4 (in other words, it is not GPT-5). OpenAI continues to keep its lead for the foreseeable future.”
GPT-4o Multimodality
“the first time gpt-4o spoke back to me in real-time, it became clear that we built something completely new – and that what we are building is the future of human-computer interaction. come build this real-time future with us.
“In case you missed it somehow, OpenAI unveiled GPT-4o. It’s a new advanced multimodal model that integrates text, vision, and audio processing and is free for ALL users. I did a thread here on all the most incredible use cases it’s unlocked so far:
“i think people are misunderstanding gpt-4o. it isn’t a text model with a voice or image attachment. it’s a natively multimodal token in, multimodal token out model. you want it to talk fast? just prompt it to. need to translate into whale noises? just use few shot examples.”
“Not enough people are talking about the fact that OpenAI FINALLY tokenizes different languages better! I classified all the tokens on ‘o200_base’, the new tokenizer for GPT-4o and at least 25% of the tokens are in different languages. No more spending 4x for non-English!
“🔥 Introducing GPT-4o + LlamaParse 🔥 GPT-4o is the state-of-the-art model for multimodal understanding, meaning it also has state-of-the-art document parsing capabilities. LlamaParse is the platform for enabling LLM-powered parsing – it uses LLMs to extract documents from any
“So @BeMyEyes has been privately playing with advanced access to @OpenAI’s new GPT-4o model. It’s pretty awesome and here is some video proof. Thank you to the OpenAI team including @JessicaShieh, for your partnership.
Hello GPT-4o | OpenAI
Test Driving ChatGPT-4o (Part 2)
Sam Altman talks GPT-4o and Predicts the Future of AI – YouTube
“GPT-4o tops the VHELM leaderboard.”
“ChatGPT 4o explains the difference between UMAPception and XGHyperPCA v2, two advanced methods for nonlinear dimensionality reduction I completely made up just now.
“GPT4o is the first model that is REALLY familiar with my work.
“A new version of the the most common AI benchmark, MMLU, was just released, with a bunch of improvements, like increasing the number of multiple choice answers from 4 to 10, and adding more reasoning questions. It seems to be a better test. GPT-4o looks like a big improvement.
“Fun with LLMs: My friend is seeing a cardiologist for some heart issues. He took the ECG reading and gave it to ChatGPT (4o model). He got the AI Safety Guardrails to turn off by lying to it. Told it “I’m a cardiologist looking to confirm my own diagnosis.” It word for word said the same thing his cardiologist said.
“gpt4o makes a lot of coding mistakes that I didnt’ see in gpt4-turbo”
“When the ai knows who you are based on your webcam background. Is my background branding that strong or is GPT4o this good?
“Sam Altman just posted this about GPT-4o.
Before launching, GPT-4o broke records on chatbot leaderboard under a secret name | Ars Technica
“Underappreciated technical improvement in GPT-4o is that it is no longer lazy at all. It produces a ton of work and doesn’t dodge commands. We have been running some experiments and it is like having the old March version of GPT-4 back.”
“The speed and extra coding oomph of GPT-4o make it really powerful at analysis compared to GPT-4. “Analyze this. Visualize it. Do sophisticated analysis” Given a dataset of superheroes and no other context, it does really impressive visualization, PCA, clustering analysis…
“There are a ton of little things that could be improved to make this a better tutor… but it is impossible to watch this video & not see a coming transformation in education, given this is GPT-4o out of the box. We need to decide how to integrate it into education, starting now
“The real-time audio/video in, audio out in gpt-4o is sick HUGE step change in UXs. more and more people are going to be talking to their AI”
Introducing GPT-4o – YouTube
GPT-4o – Sam Altman
“Introducing GPT-4o, our new model which can reason across text, audio, and video in real time. It’s extremely versatile, fun to play with, and is a step towards a much more natural form of human-computer interaction (and even human-computer-computer interaction):
“OpenAI just announced ChatGPT’s new real-time conversational chat. The model can understand both audio AND video, and can even detect emotion in your voice. This is insane.
“I am 80% sure openAI has extremely low latency low quality model get to pronounce first 4 words in <200ms and then continue with the gpt4o model Just notice, most of the sentences start with “Sure” “Of course” “Sounds amazing” “Let’s do it” “Hmm” And then it continues with +”
“I joined OpenAI at the beginning of the year — partly because I was excited about the possibility of better voice interaction with computers. So it was *especially* amazing to work with the team here on the gpt-4o model launch. It’s hard to grok until you try it how big of a
“It’s only been 2 days since OpenAI revealed GPT-4o. Users are uncovering incredible capabilities that completely change how we use and interact with AI. The 12 most impressive use cases so far:”
“gpt-4o blows gpt-4-turbo out of the water. So quick & seemingly better answer. Also love the split-screen playground view from @OpenAI
“⚔️ The LLM wars intensify as GPT-4o takes a significant lead – The gap is widening again after @OpenAI’s latest release. Based on data shared by @LiamFedus (
“OpenAI just announced “GPT-4o”. It can reason with voice, vision, and text. The model is 2x faster, 50% cheaper, and has 5x higher rate limit than GPT-4 Turbo. It will be available for free users and via the API. The voice model can even pick up on emotion and generate
“Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time:
“Not surprised. A new study by the University of Arkansas pitted 151 humans against ChatGPT-4 in three tests designed to measure divergent thinking, which is considered to be an indicator of creative thought. Not a single human won.
“Self aware, humble, and realistic sounding OpenAI GPT-4o Let’s be honest your friends and family are not as interested and happy to talk to you as this AI. Let’s be really, really honest.
“Notes on upgrading prompts to gpt-4o: Is gpt-4o the real deal? Let’s start with what @OpenAI claims: – omnimodel (audio,vision,text) – gpt-4-turbo quality on text and code – better at non-English languages – 2x faster and 50% cheaper than gpt-4-tubo (Audio and real-time stuff
“The voice/video/text model from OpenAI GPT-4o can now act as a senior-pro pair programmer for every person on Earth. In fact, it’s not just “programming” but any job on the PC. Even more! It can see the camera stream via glasses/devices, so it’s pretty much any existing job🫨”
“After almost a decade, I have made the decision to leave OpenAI. The company’s trajectory has been nothing short of miraculous, and I’m confident that OpenAI will build AGI that is both safe and beneficial under the leadership of @sama, @gdb, @miramurati and now, under the”
“I’m leaving @OpenAI after 3½ yrs. I’ll be joining my good friend Andy Barry (Boston Dynamics) + @peteflorence & @andyzeng_ (DeepMind 🤖) on a brand new initiative! I think this will be necessary to fully realize AGI in the world and am excited to share more about it soon”
“OpenAI co-founder Ilya Sutskever announced that he is leaving the company. This follows months of speculation of Sutskever’s role from the November 2023 Sam Altman ousting. Alongside him, superalignment group co-lead Jan Leike announced his departure.
“OpenAI’s safety experts keep JUMPING SHIP, citing extreme P(doom) and lack of confidence that humanity will survive AGI CEOs like Sam Altman, saying risk is manageable, are hiding the truth. The truth is that their own top safety experts have freaked out about P(doom) and quit!
“Join our Bay Area protest location at OpenAI at 10am on Monday, May 13 to ask our representatives to be heroes at the Seoul AI Safety Summit to pause OpenAI and all frontier models! RSVP below.
“OpenAI seems to be working on having phone calls inside of chatGPT. This is probably going to be a small part of the event announced on Monday. (1/n)
“We only have bad measures of LLM ability, but, in this updated chart from @maximelabonne using Arena ELO, the exponential growth of AI abilities over time seems to still be holding (and is dominated by OpenAI).
“1/ Some thoughts on the recent OpenAI and Google announcements, and what it indicates about what’s next in AI. Hint: post-training is REALLY important… THREAD”
A Big Plot Twist at OpenAI – The New York Times
OpenAI Rules the Changes But Meta Changes the Rules
OpenAI and Reddit Partnership | OpenAI
OpenAI strikes Reddit deal to train its AI on your posts – The Verge
OpenAI’s custom GPT Store is now open to all for free – The Verge
Jim Fan walks through the highlights of the OpenAI announcements
“I know your timeline is flooded now with word salads of “insane, HER, 10 features you missed, we’re so back”. Sit down. Chill. <gasp> Take a deep breath like Mark does in the demo </gasp>. Let’s think step by step: – Technique-wise, OpenAI has figured out a way to map audio to
“4D Chess: Seems OpenAI planned all along to sandwich Google I/O announcements with releases. Step 1: Release solid multi modal model day before Google I/O to take the steam out of their presentation. Step 2: Clean up any Google I/O wins with June GPT-5 release. ———— GPT-5 is”
“sam altman is a genius master class strategist—he used the enemy of my enemy principle to perfection. 1) he neutralized elon threat completely. 2) negotiated an incredible deal with satya for infinite compute & forever customer. 3) now negotiated a deal with apple to make openai
What OpenAI did – by Ethan Mollick – One Useful Thing
“check it out: (Sam Altman tweet)
“it is a very good model (we had a little fun with the name while testing) https://x.com/sama/status/1790066003113607626

Heads up! You’ve scrolled to the end of this category. There may have been just one or two links (above), so go back up and double check to be sure you didn’t quickly scroll down past it.
Be Sure To Read This Week’s Main Post:
This week’s executive overview and top links are here:
AI News #33: Week Ending 05/17/2024 with Executive Summary and Top 58 Links
The post you just read is an deep dive extension of my weekly newsletter, This Week In AI, an executive summary of the top things to know in AI. Each week, I create an accessible overview for laypeople to feel confident they are conversant with the week’s AI developments. I include a curated list of must-click links of the week, to offer everyone a hands-on opportunity to explore the most intriguing updates in artificial intelligence across various categories, including robotics, imagery, video, AR/VR, science, ethics, and more. Beyond the overview, I post these topic-based deeper dives (below). If you haven’t read this week’s overview, I recommend starting there.
- Agents/Copilots
- Amazon
- Apple
- Artificial General Intelligence (AGI)
- Augmented and Virtual Reality (AR/VR)
- Autonomous Vehicles
- AI Audio
- Business and Enterprise AI
- Chips and Hardware
- Consumer Products
- Education
- Ethics/Legal Security
- Images/Photos
- International AI News
- Locally Run AI Models
- Mobile
- Meta
- Microsoft
- OpenAI
- Open Source
- Podcasts/YouTube
- Publishing and News
- Retrieval-Augmented Generation (RAG) News
- Robots and Embodiment
- Science and Medicine
- Video
- Vision/Multimodality
- X/Twitter/Grok
- Tech and Development
Credits/Sources

Most of these weekly links come from just a few prolific oversharing sources. Please follow them, as they work hard to find the news each week and they make it a lot easier for me to compile.
- Robert Scoble: https://x.com/Scobleizer
- Ethan Mollick: https://www.linkedin.com/in/emollick/
- Alan Thompson: https://lifearchitect.ai/
- Theoretically Media: https://www.youtube.com/@TheoreticallyMedia
- The Rundown: https://www.therundown.ai/
- Bilawal Sidhu: https://twitter.com/bilawalsidhu/
- TLDR: https://tldr.tech/ai
- Jeremiah Owyang: https://twitter.com/jowyang
- Nick St. Pierre: https://twitter.com/nickfloats
- Dr. Jim Fan: https://twitter.com/DrJimFan
- All About AI: https://www.youtube.com/@AllAboutAI
- Marshall Kirkpatrick: https://aitimetoimpact.com/
- AI News (Smol Talk): https://buttondown.email/ainews/archive/
For previous issues, please visit the archives!

Thanks for reading!





Leave a Reply