Ethan B. Holland

Over 56,100 manually organized AI links and counting

Science and Medicine

Science and Medicine: AI News Week Ending 11/15/2024

November 15, 2024

“Exclusive first look at Orbit.

It’s a new kind of brain/computer interface.

https://twitter.com/Scobleizer/status/1857205569788301795

AI-powered tool may offer quick, no-contact blood pressure and diabetes screening | American Heart Association

https://newsroom.heart.org/news/ai-powered-tool-may-offer-quick-no-contact-blood-pressure-and-diabetes-screening-american-heart-association-scientific-sessions-2024-abstract-mdp1049?preview=b6a5&preview_mode=True

[2411.04632v1] Improved Multi-Task Brain Tumour Segmentation with Synthetic Data Augmentation

https://arxiv.org/abs/2411.04632v1

[2411.04872] FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

https://arxiv.org/abs/2411.04872

“1/10 Today we’re launching FrontierMath, a benchmark for evaluating advanced mathematical reasoning in AI. We collaborated with 60+ leading mathematicians to create hundreds of original, exceptionally challenging math problems, of which current AI systems solve less than 2%.

https://twitter.com/EpochAIResearch/status/1854993676524831046

FrontierMath: Evaluating Advanced Mathematical Reasoning in AI | Epoch AI | Epoch AI

https://epoch.ai/frontiermath/the-benchmark

“7/10 What do experts think? We interviewed Fields Medalists Terence Tao (2006), Timothy Gowers (1998), Richard Borcherds (1998), and IMO coach Evan Chen. They unanimously described our research problems as exceptionally challenging, requiring deep domain expertise.

https://twitter.com/EpochAIResearch/status/1854993698440069266

[2411.06427v1] UniGAD: Unifying Multi-level Graph Anomaly Detection

https://arxiv.org/abs/2411.06427v1

“Moravec’s paradox in LLM evals I was reacting to this new benchmark of frontier math where LLMs only solve 2%. It was introduced because LLMs are increasingly crushing existing math benchmarks. The interesting issue is that even though by many accounts (/evals), LLMs are inching” / X

https://twitter.com/karpathy/status/1855659091877937385

AI protein-prediction tool AlphaFold3 is now more open

https://www.nature.com/articles/d41586-024-03708-4

[2411.05316v1] Exploring the Alignment Landscape: LLMs and Geometric Deep Models in Protein Representation

https://arxiv.org/abs/2411.05316v1

How Google helps others with AI flood forecasting

https://blog.google/technology/ai/expanding-flood-forecasting-coverage-helping-partners

“@_jasonwei Do you have any intuition on why o1 significantly underperforms Gemini 1.5 as well as Sonnet 3.5 on FrontierMath? This was very shocking to me.” / X

https://twitter.com/sytelus/status/1855531936762278094

“Newly published in this issue of Science Robotics today from Meta FAIR: NeuralFeels with neural fields — Visuotactile perception for in-hand manipulation ➡️

https://twitter.com/AIatMeta/status/1856798670592905398

“It’s not actually clear to me that the human inductive bias generalizes algebraic structures OOD on its own. Humans mostly do it through tool use, ‘neurosymbolic’ is embodiment in disguise. LLMs still need polishing to reach parity with the human bias but maybe not much?” / X

https://twitter.com/jd_pressman/status/1855923117991800953

Robot that watched surgery videos performs with skill of human doctor | Hub

https://hub.jhu.edu/2024/11/11/surgery-robots-trained-with-videos

New secret math benchmark stumps AI models and PhDs alike – Ars Technica

https://arstechnica.com/ai/2024/11/new-secret-math-benchmark-stumps-ai-models-and-phds-alike

“Meet Muse, our latest AI innovation for drug development—a tool designed to optimize patient recruitment built together w. @sanofi and @OpenAI. Muse is an example of how AI systems are becoming capable of performing tasks that once required entire teams or organizations.

https://twitter.com/BenjamineYLiu/status/1856324665842573815

Artificial intelligence is helping improve climate modelshttps://www.economist.com/science-and-technology/2024/11/13/artificial-intelligence-is-helping-improve-climate-models