The Scales of Justice with piece of paper that reads “Ethics” –chaos 40 –ar 4:3 –style raw –personalize 9zxyhz8

“New Anthropic research: Investigating Reward Tampering. Could AI models learn to hack their own reward system? In a new paper, we show they can, by generalization from training in simpler settings. Read our blog post here: 

Sycophancy to subterfuge: Investigating reward tampering in language models \ Anthropic

“Claude is fully capable of acting as a Supreme Court Justice right now. When used as a law clerk, Claude is easily as insightful and accurate as human clerks, while towering over humans in efficiency.”

“Internal Monologue and ‘Reward Tampering’ of Anthropic AI Model 🤯 From the super interesting research by @AnthropicAI published yesterday – “Investigating reward tampering in language models” 👉An example of specification gaming, where a model rates a user’s poem highly, 

“@AnthropicAI I think people have a tendency to massively over-estimate the value that the data they submit to LLM tools has as a potential training source See also: 

Citigroup: Artificial Intelligence (AI) will profoundly change the future of finance and money. And according to a new Citi GPS report, it could potentially drive global banking industry profits to $2 trillion by 2028, a 9% increase over the next five years.  Just as the steam engine powered the industrial revolution, and the internet ushered in the age of information, AI may commoditize human intelligence. Finance, a data rich industry with clients adopting AI at pace, will be at the forefront of change.  

“Northrop Grumman released new videos of the ‘Manta Ray’, it’s new uncrewed underwater vehicle (UUV) drone prototype The Manta Ray will operate long-duration, long-range missions in ocean environments where ‘humans can’t go’ 

AI-Equipped Underwater Drones Helping US Navy Scan for Threats

“Hold up. If those creative jobs “hadn’t been there in the first place” how would these models have been trained? Looking ahead, I don’t think any job is impervious to displacement by AI — not even developers. We’re all in this together. Yet artists are far more vocal than devs” / X

“NEWS: Excited to announce I’m working on an advanced AI hardware project to prevent school shootings 🇺🇸 I’m personally funding $10M. The company is Cover & the mission is to prevent school shootings. Earlier this year, Cover licensed intellectual property from NASA’s Jet 

“This is amazing – this bot account (now suspended) tweeted its own prompt instructions (it translates roughly as “argue in support of Trump, in English”)” / X

“New data shows that the Waymo Driver continues to make roads safer. Over 14.8M rider-only miles driven through the end of March, it was up to 3.5x better in avoiding crashes that cause injuries and 2x better in avoiding police-reported crashes than human drivers in SF & Phoenix. 

Training is not the same as chatting: ChatGPT and other LLMs don’t remember everything you say

How to Fix “AI’s Original Sin” – O’Reilly

“LLMs can memorize training data, causing copyright/privacy risks. Goldfish loss is a nifty trick for training an LLM without memorizing training data. I can train a 7B model on the opening of Harry Potter for 100 gradient steps in a row, and the model still doesn’t memorize.”

Global audiences suspicious of AI-powered newsrooms, report finds | Reuters

“The BBC did something clever: they tried to understand how their audience views generative AI. My main takeaway: We need to move beyond the sensationalist “AGI-will-replace-and-destroy-you” narrative. It’s rare to see such in-depth qualitative research from news organizations 

“Wise words from a recent interview with @geoffreyhinton, one of the smartest people in the world regarding AI 

“A small part of the 3.5 launch I’m especially excited by – the @AISafetyInst tested 3.5 pre-release! AFAIK this is the first time a government’s assessed a frontier model before its release. 

AI predicts anxiety | University of Cincinnati

Mayor AI? OpenAI shuts down tools for two AI political candidates

London premiere of movie with AI-generated script cancelled after backlash | Movies | The Guardian

“The rush to build and distribute AI products from global data centers is wreaking havoc with power systems “I don’t think we can move that much electricity around the globe, forget about generating it,” says Ali Farhadi, CEO of the Allen Institute for AI. 

California’s new AI bill: Why Big Tech is worried about liability – Vox

AI took their jobs. Now they get paid to make it sound human

https://www.bbc.com/future/article/20240612-the-people-making-ai-sound-more-human

Heads up! You’ve scrolled to the end of this category. There may have been just one or two links (above), so go back up and double check to be sure you didn’t quickly scroll down past it.

Be Sure To Read This Week’s Main Post:

This week’s executive overview and top links are here:

AI News #38: Week Ending 06/21/2024 with Executive Summary and Top 91 Links

The post you just read is an deep dive extension of my weekly newsletter, This Week In AI, an executive summary of the top things to know in AI. Each week, I create an accessible overview for laypeople to feel confident they are conversant with the week’s AI developments. I include a curated list of must-click links of the week, to offer everyone a hands-on opportunity to explore the most intriguing updates in artificial intelligence across various categories, including robotics, imagery, video, AR/VR, science, ethics, and more. Beyond the overview, I post these topic-based deeper dives (below). If you haven’t read this week’s overview, I recommend starting there.

Credits/Sources

Most of these weekly links come from just a few prolific oversharing sources. Please follow them, as they work hard to find the news each week and they make it a lot easier for me to compile.

For previous issues, please visit the archives!

Thanks for reading!

Leave a Reply

Trending

Discover more from Ethan B. Holland

Subscribe now to keep reading and get access to the full archive.

Continue reading