Image created with OpenAI GPT-Image-1. Image prompt: TikTok LIVE phone-screen POV, floating hearts & spinning album art, ArtTutorial canvas on easel with paint-splat stickers, featuring auto-generating art canvas timelapse; soft-glow studio lighting, photoreal 8k

Ring Video Descriptions deliver real-time, Gen AI descriptions of what’s happening https://www.aboutamazon.com/news/devices/ring-video-descriptions-gen-ai

Adobe has quietly rolled out an AI camera app for iOS. Built by legends Marc Levoy (computational photography OG) and Florian Kainz (multi-academy award winner and creator of night sight mode on pixel). Delivers SLR-like image quality with manual controls all on device. https://x.com/bilawalsidhu/status/1936424884017717511

Black Forest Labs – Frontier AI Lab https://bfl.ai/announcements/flux-1-kontext-dev

Black Forest Labs just crossed 20,000 followers on @huggingface after the release of the weights of FLUX Kontext dev. Let’s go open image AI! https://x.com/ClementDelangue/status/1938633511562281192

BOOOM! Live on Inference Providers use BLAZINGLY FAST Flux Kontext – only on Hugging Face ⚡ https://x.com/reach_vb/status/1938593855512715441

Day zero support for Flux kontext dev on Chipmunk! Great work @austinsilveria!”” / X https://x.com/realDanFu/status/1938300379613347942

N8N: Build INSANE AI Image Agents 🤯. https://x.com/JulianGoldieSEO/status/1935850085205569836

11/ TL;DR MJ has dropped a decent model with unlimited video gen for $60/month. if you’re all about the MJ aesthetics, doing abstract generations (sans text), and don’t mind using MMAudio or similar to add audio in post, this is a great addition to the toolkit. https://x.com/bilawalsidhu/status/1935528281031462945

MidJourney Video 1/ First off. It’s fun to just click through your MJ catalog and see it come to life. – No text to video; only image to video for now – Works with MJ or uploaded imagery – Can choose high/low motion + auto or custom prompt – Can extend clips 4x – SD output at 24 fps; no upscaling https://x.com/bilawalsidhu/status/1935527429768163481

MidJourney Video 4/ Pretty good with motion graphics and AR visuals. Lack of text rendition (a weakness for MJ in general) comes through. Would not recommend for titles. But you can still get some beautiful abstract visuals (e.g head locked AR gen on the left, multi-monitor generations on right) https://x.com/bilawalsidhu/status/1935527747725709668

MidJourney Video 6/ Muzzle flashes look pretty good. But I had a very hard time getting shell casings to work properly. Let me know if you find a good prompt to achieve this, because I couldn’t. https://x.com/bilawalsidhu/status/1935527929569755261

MidJourney Video 7/ MJ video nails that high end unreal engine “”rendered”” look. Fisheye lens distortion and sweeping camera move is nice. But notice how wonky all the cars in the scene look. As it stands, I don’t think we’ll be pulling any 3d objects or scenes out of this video model. https://x.com/bilawalsidhu/status/1935527993830686966

MidJourney Video 8/ Some generations get this weird “”unsharp mask”” look (a technique for sharpening) as the generation progresses. Lmk if you spot it too. https://x.com/bilawalsidhu/status/1935528060612395421

MidJourney Video 10/ MJ video does seem like an amazing tool to make abstract visual elements you composite elsewhere. I hope they remove the extend duration limit (20s / 4 times max) because it could be an amazing tool for screensavers, music videos and concert visuals. https://x.com/bilawalsidhu/status/1935528210588213670

MidJourney Video 2/ Fast generation time. Works well for that wide angle vlogging style. You can extend any clip 4x. Two great examples below. Of course, it’s begging for dialogue. Sure, you can add the facial performance in post – but it won’t look half as good. Veo 3 has spoiled us here. https://x.com/bilawalsidhu/status/1935527555404271877

MidJourney Video 3/ MJ video does okay in my handshake test (homie on the left really went in hard lmao) Physics is a weakness of this model — doesn’t matter if it’s soft-body or rigid-body subject matter. Might get slightly better as user ratings roll in, but still far behind the SOTA. https://x.com/bilawalsidhu/status/1935527672484179979

MidJourney Video 5/ The dinosaur test comes next. Movement looks decent, but the rest of the physics in the scene are all over the place. The slipping tanks in the background reminds me a bit of Sora. Relative scale and relative motion is pretty wonky. https://x.com/bilawalsidhu/status/1935527810564767970

MidJourney Video 9/ Testing fluid simulations here. Not only is it pretty far from SOTA, sometimes I get generations with this stop motion-like choppy FPS look (e.g. wine glass on right). https://x.com/bilawalsidhu/status/1935528134113407408

The new Hailou 02 AI video model really does seem to have made huge strides in the “”gymnastics problem”” where fast flipping motions lead to distortion Here are the first three results of the “”a man in elaborate robes does a backflip while holding two pool noodles”” (a hard test!) https://x.com/emollick/status/1936091679850705019

Image Edit is heating up in the Arena – 3 new models have been added! ✨ Flux-Kontext-Max by BFL ✨ Bagel by ByteDance ✨ Step1X-edit by StepFun This brings the Image Edit Arena to a total of 7, with more coming! Upload an image and test them out, let’s see what you think of https://x.com/lmarena_ai/status/1936100445585539482

3DGH: 3D Head Generation with Composable Hair and Face https://c-he.github.io/projects/3dgh/

Higgsfield’s first high-aesthetic photo model Higgsfield Soul https://higgsfield.ai/soul

Install `diffusers` from source and start using Kontext from @bfl_ml 🧨 Use your favorite optims, too 🙂 Training is also supported (@linoy_tsaban and yours truly) 🤗 https://x.com/RisingSayak/status/1938267936378208655

RT @bfl_ml: High quality image editing no longer needs closed models We release FLUX.1 Kontext [dev] – an open weights model for proprieta…”” / X https://x.com/ClementDelangue/status/1938260818602430788

In case there is any ambiguity: DINOv2 is 100% a product of dumb hill-climbing on ImageNet-1k knn accuracy (and linear too) Overfitting an eval can be bad. But sometimes the reward signal is reliable, and leads to truly good models. It’s about finding a balance”” / X https://x.com/TimDarcet/status/1936831019908243507

Inside Disney’s Campaign to Protect Darth Vader from AI – Bloomberg https://www.bloomberg.com/news/newsletters/2025-06-22/inside-disney-s-campaign-to-protect-darth-vader-from-ai

Big open AI day! @bfl_ml just released the best open weights image editing model, comparable to GPT4-o @Google released Gemma 3n, the first model under 10B with @lmarena_ai score of 1300+ All on @huggingface of course. Let’s get open-source AI to dominate all modalities and”” / X https://x.com/ClementDelangue/status/1938283910980325670

Try on looks and discover your style with Doppl https://blog.google/technology/google-labs/doppl/

Disney and Universal filed a lawsuit against image generation company Midjourney, accusing it of training its models on their copyrighted content and reproducing it without permission. The studios claim Midjourney system generated unauthorized images of characters like https://x.com/DeepLearningAI/status/1937314755066171580

Google – Very good image generator so far: Pretty good “”horse riding on top of an astronaut going through a drive through”” & “”pile of old photographs, half show a UFO crashlanded in black and white, the others show pictures of cute dogs”” … but still can’t do clock hands outside of 10:10 https://x.com/emollick/status/1937668606214758534

Retrieving text chunks is the bread and butter of RAG. We’ve released a recent feature in LlamaCloud that allows you to not only retrieve text, but image elements from documents as well. 1️⃣ You can index, embed, and retrieve embedded figures (charts, pictures) within a PDF and https://x.com/jerryjliu0/status/1936451556293104067

Wow! OmniGen 2 is quite amazing – State of the Art in Image edits – Apache 2.0 licensed 🔥 Bonus: can also do in context generation, text to image, visual understanding and image edits Play with directly on the demo below and models on the hub 🤗 https://x.com/reach_vb/status/1937514552163197419

Introducing Design System on Magicpath. What if AI generation could follow a specific set of design rules? Now it can. Truly magic. ✨ https://x.com/skirano/status/1937591554044055697

Introducing Higgsfield Canvas: a state-of-the-art image editing model. Paint products directly onto your image with pixel-perfect control. Say hi to your new go-to for product placement, editing, and layout! 👋🏻 Comment Canvas to get the full guide in the DM. https://x.com/higgsfield_ai/status/1935042830520697152

The lifelike feel of @midjourney ‘s videos are in a class of their own. Some beautiful clips 👇 https://x.com/rohanpaul_ai/status/1936646300130308291

Most of the value in AI video won’t be captured by the creation tools. It’ll accrue to the platforms — X, YT, IG, etc. — where that content is distributed, ranked, and monetized. Can’t imagine it playing out any other way.”” / X https://x.com/bilawalsidhu/status/1935854180310434255

📸 Live Photo is ON! Kling AI now supports saving as Live Photos! Turn your favorite Kling creations into dynamic wallpapers — right on your phone. #klingai #livephoto #wallpaper https://x.com/Kling_ai/status/1937343208515924465

Summer’s here, and the waves are calling! 🌊 With SurfSurf Effect, surf anytime, anywhere—no limits, just pure summer fun. 🌞 Don’t leave your fur friend behind — let’s hit the waves and ride into adventure! 🏄‍♂️💥 #surfsurf #klingeffects #klingai https://x.com/Kling_ai/status/1937393240225063042

Midjourney’s new animation features continue to be compelling to play with because they really do let you make things that don’t feel like standard AI videos. Here I made some vast and strange machines. https://x.com/emollick/status/1935775887607447687

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading