Ethan B. Holland

Over 54,900 manually organized AI links and counting

AGI (Artificial General Intelligence): AI News Week Ending 11/15/2024

November 15, 2024

“I love seeing a new eval with such low pass rates for frontier models. It feels like waking up to a fresh blanket of snow outside, completely untouched.” / X

I love seeing a new eval with such low pass rates for frontier models. It feels like waking up to a fresh blanket of snow outside, completely untouched. https://t.co/YQd3fRQFnA
— Noam Brown (@polynoamial) November 10, 2024

New Paper Co-authored by Tepper School Researchers Articulates How Large Language Models Are Changing Collective Intelligence Forever – Tepper School of Business – Carnegie Mellon University

https://www.cmu.edu/tepper/news/stories/2024/september/collective-intelligence-and-llms.html

AI-generated poetry is indistinguishable from human-written poetry and is rated more favorably | Scientific Reports

https://www.nature.com/articles/s41598-024-76900-1

Needle Threading

Can LLMs Follow Threads Through Near-Million-Scale Haystacks?

https://needle-threading.github.io

“let me give the accelerationist blackpill on this. transformer/llm architecture is likely not the path. infinite compute isn’t either but whoever has it will win once the right architecture emerges. there is no agreed upon definition of intelligence or agi at this point, and” / X

https://twitter.com/aphysicist/status/1855628802941931871

““The AGI bubble is bursting a little bit,” said @mmitchell_ai. It’s become clear, she said, that “different training approaches” may be needed to make AI models work really well on a variety of tasks—an idea a number of AI experts echoed to Bloomberg News.

https://twitter.com/hardmaru/status/1856914058869707001

“The AGI paradox most are missing. When productivity approaches infinity and deflation becomes extreme — GDP will paradoxically register as catastrophic decline. Our core economic metrics assume scarcity — we’ll need entirely new frameworks to measure progress in an age of” / X

https://twitter.com/bilawalsidhu/status/1855250895287496966

Benchmark scores of Claude, ChatGPT and Gemini over time : r/singularity

Benchmark scores of Claude, ChatGPT and Gemini over time
byu/Jean-Porte insingularity

OpenAI, Google and Anthropic Are Struggling to Build More Advanced AI – Bloomberg

https://www.bloomberg.com/news/articles/2024-11-13/openai-google-and-anthropic-are-struggling-to-build-more-advanced-ai?embedded-checkout=true

“TBH AI can ace most engineering interview questions but it can’t really do real world engineering jobs quite yet! Closing that gap is what 2025 is all about!” / X

https://twitter.com/bindureddy/status/1856425643036291267

“Really think the speculation about an “end of scaling” seems very premature given (1) AI lab insiders are mostly universally bullish on scaling and (2) scaling inference works & has barely been exploited… but also a more linear improvement path for AI would still be disruptive” / X

https://twitter.com/emollick/status/1856701705544401091

“At least for non-experts: “AI-generated poems are now ‘more human than human’… participants are more likely to judge that AI-generated poems are human-authored, compared to actual human-authored poems…. participants rate AI-generated poems more highly than human-written poems”

https://twitter.com/emollick/status/1857433241294094380

“The AI slowdown is a non-story The biggest reason AI is “slowing down” is that there is nowhere else to go 🤷‍♀️ If you begin to saturate on benchmarks, nothing is left to do. 100/100 is the highest score you can get…” / X

https://twitter.com/bindureddy/status/1856784739312833016

“Yesterday I was talking with an AI professor, who taught many of today’s AI leaders, who I won’t name (for a reason I will get to in a second). He laid out that LLM’s won’t get us to AGI. A new architecture is needed, he told me. One built round human cognitive abilities. He” / X

https://twitter.com/Scobleizer/status/1857437122954924110