Image created with gemini-3.1-flash-image-preview with claude-sonnet-4-5. Image prompt: Photorealistic 4K image of a full-height server rack standing in a frozen winter bay at dusk, encased in transparent blue ice with visible blinking green LEDs inside, ethernet cables frozen into the ice around it, sunset gradient sky from deep blue to warm orange reflecting on ice surface, landscape format, bold sans-serif title text ‘Local’ across top, National Geographic aesthetic, no CGI glow, hyperreal ice textures showing purples and golds from dusk light.

people sleep on last week’s open multimodal releases > GLM-OCR: sota OCR model > MiniCPM-o-4.5: Gemini 2.5-flash level Omni model that runs on your phone > InternS1: efficient generalist VLM outperforming on science tasks all allow commercial use freely 🔥”” https://x.com/mervenoyann/status/2021233480957304913

In sum, through an extensive (and costly) validation process, we have demonstrated that GPT-5 mini performs very well at recovering the ground truth data. It is clearly better than highly trained graduate students at this specific information retrieval task.”” At 1000x less cost”” https://x.com/emollick/status/2021689359309664645

Leave a Reply

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading