Image created with OpenAI GPT-Image-1. Image prompt: mid‑1990s web‑browser screenshot, CRT glow, 256‑color dithering — Pop‑up JS alert “Welcome to my homepage!!!” overlapping tab — dancing musical‑note GIF captioned “ByteDance AI” — crisp pixel edges, screen‑door scan‑lines, phosphor glow

ByteDance released Tar 1.5B and 7B: image-text in image-text out models 👏 They have an image tokenizer unified with text, and they de-tokenize using either of two models (LLM and diffusion) The model is actually a full LLM (Qwen2), the tokenizer converts image tokens 🤯 https://x.com/mervenoyann/status/1942539723089621055

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading