“Scott Alexander’s simple “AI Art Turing Test” proves AI is creative. 

AI’s math problem: FrontierMath benchmark shows how far technology still has to go | VentureBeat

“Starting to see the first serious economic analysis attempts to grapple with what AGI might mean. I appreciate that this piece embraces scenarios, we don’t know if or when AGI might happen. But wow that wages graph is something else. 

“”Write me a murder mystery short story. make sure there is non-obvious foreshadowing. make the ending tell us something about dialectics” GPT-4o has gotten better, but Claude is still winning, I think. Gemini experimental and Grok are still pretty far out. 

ChatGPT Defeated Doctors at Diagnosing Illness – The New York Times

“LLMs can win Maths Olympiad, yet still fail to answer dumb questions like “which number is bigger, 9.11 or 9.9?” @karpathy called it Jagged Intelligence. Research by MIT & Berkeley found that zeroing out the “Bible verse neurons” improved 9.8 vs 9.11 accuracy by 21% in llama-3.1 

““We are now generating intelligence at scale.” The remarks from Jensen Harris at tonight’s @ComputerHistory honoring of its 2024 fellows. A different side of Jensen. 

“For nuanced evaluations of complex generations, people now rely on LLM as judges… but which LLM should you use? Try the new Judge Arena, and compare model-judges on your prompts! 

What if AI doesn’t just keep getting better forever? – Ars Technica

Why are we using LLMs as calculators 

Learning high-accuracy error decoding for quantum processors | Nature

AI could cause ‘social ruptures’ between people who disagree on its sentience | Artificial intelligence (AI) | The Guardian

Leave a Reply

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading