Ethan B. Holland

Over 50,200 manually organized AI links and counting

OpenAI News: Week Ending 10/11/2024

October 11, 2024

“We’re releasing a new benchmark, MLE-bench, to measure how well AI agents perform at machine learning engineering. The benchmark consists of 75 machine learning engineering-related competitions sourced from Kaggle.

https://twitter.com/OpenAI/status/1844429536353714427

“We benchmarked the @OpenAI’s DevDay Eval product and @bespokelabsai’s Minicheck for hallucination detection. Minicheck is the current best hallucination detector on @guardrails_ai Hub. OpenAI: – Accuracy: 69.19% – F1: 0.7564 – High recall, lower precision Minicheck: – Accuracy:

https://twitter.com/ShreyaR/status/1843784773346701640

“I’m leaving OpenAI after over 2 years of wild ride. Alongside @barret_zoph , @LiamFedus , @johnschulman2 , and many others I got to build a “low key research preview” product that became ChatGPT. While we were all excited to work on it, none of us expected it to be where it is” / X

https://twitter.com/Luke_Metz/status/1844161466032914645

“speaking of chatgpt, was trying to figure out the perfect walled garden i someday wanted to build. the “edit this area” of the image gen tool is so helpful for brainstorming ideas quickly. after 10 minutes of playing around, i ❤️

https://twitter.com/sama/status/1845510942949253156

Orchestrating Agents: Routines and Handoffs | OpenAI Cookbook

https://cookbook.openai.com/examples/orchestrating_agents

“Reminder: 1. LLM’s, transformers…etc not state secrets any longer. 2. Apple not in OpenAI consortium = Gemini or Anthropic or both as partners. 3. If you believe AI is “core tech” then Apple will not let a third-party control their destiny no matter the cost. (See chips as” / X

https://twitter.com/Dagk/status/1844253188972568673

“After running these additional experiments, we were impressed by a few things: 1) OpenAI o1 models show a consistent improvement over Anthropic and Google models on our long context RAG Benchmark up to 128k tokens. (3/5)” / X

https://twitter.com/DbrxMosaicAI/status/1844492163293511890

The OpenAI Talent Exodus Gives Rivals an Opening | WIRED

https://www.wired.com/story/openai-departures-research-rivals-artificial-intelligence

“2) Despite lower performance than the SOTA OpenAI and Anthropic models, Google Gemini 1.5 models have consistent RAG performance at extreme context lengths of up to 2 million tokens. (4/5)” / X

https://twitter.com/DbrxMosaicAI/status/1844492164501471261

“Why @OpenAI does not prioritize API Revenue and focuses on consumer products (ChatGPT), my thoughts. 🤔 Why not API: > Open models will be equally good, and enterprises might prefer more control > Models will become smaller and cheaper to run -> less revenue/margin > Other

https://twitter.com/_philschmid/status/1844339615915704747

Uber to launch AI assistant powered by OpenAI’s GPT-4o to help drivers go electric | Reuters

https://www.reuters.com/technology/artificial-intelligence/uber-launch-ai-assistant-powered-by-openais-gpt-4o-help-drivers-go-electric-2024-10-08

“The new Realtime API with web crawling is mind-blowing! Talk in realtime with any website. Powered by the OpenAI Realtime API and @firecrawl_dev 🔥 Check it out:

https://twitter.com/nickscamara_/status/1842243883842904529

“Interesting observation by altimeter — OpenAl revenue exceeds Google at the time of their IPO

https://twitter.com/bilawalsidhu/status/1843463461902397539

Microsoft’s AI Story Is Getting Complicated – WSJ

https://www.wsj.com/tech/ai/microsofts-ai-story-is-getting-complicated-ebe63ac9

OpenAI reducing dependency on Microsoft data centers, The Information reports – TipRanks.com

https://www.tipranks.com/news/the-fly/openai-reducing-dependency-on-microsoft-data-centers-the-information-reports

OpenAI Leaders Say Microsoft Isn’t Moving Fast Enough to Supply Servers — The Information

https://www.theinformation.com/articles/openai-eases-away-from-microsoft-data-centers

“Launch GPT4 Chat Interface in just 3 lines of code! It can’t get simpler than this 😀. Being covered by popular publications as we speak! import gradio as gr import openai_gradio gr.load( name=’gpt-4-turbo’, src=openai_gradio.registry,).launch() This and more in Gradio 5 —

https://twitter.com/Gradio/status/1843698665472368665

Before Mira Murati’s surprise exit from OpenAI, staff grumbled its o1 model had been released prematurely | Fortune

https://fortune.com/2024/10/01/openai-sam-altman-mira-murati-gpt-4o-o1-chatgpt-turbulent-year

OpenAI gets $4 billion revolving credit line on top of latest funding

https://www.cnbc.com/2024/10/03/openai-gets-4-billion-revolving-credit-line-on-top-of-latest-funding.html

OpenAI partners with Cosmopolitan and Elle publisher Hearst

https://www.engadget.com/ai/openai-partners-with-cosmopolitan-and-elle-publisher-hearst-180517248.html?src=rss

OpenAI’s GPT Store Has Left Some Developers in the Lurch | WIRED

https://www.wired.com/story/openai-gpt-store

“openai-gradio a Python package that makes it very easy for developers to create web apps that are powered by @OpenAI API in a few lines of code pip install openai-gradio

https://twitter.com/_akhaliq/status/1843363506697187626

OpenAI Projections Imply Losses Tripling to $14 Billion in 2026 — The Information

https://www.theinformation.com/articles/openai-projections-imply-losses-tripling-to-14-billion-in-2026

OpenAI and Hearst Content Partnership | OpenAI

https://openai.com/index/hearst

OpenAI Funding Fuels Wave of Big AI Deals — The Information

https://www.theinformation.com/articles/openai-funding-fuels-wave-of-big-ai-deals

“BREAKING: Looks like OpenAI is entering the arena against Perplexity… citations are now in GPT-4o 👀

https://twitter.com/thomasschulzz/status/1844062893723250940?s=46

The Race to Block OpenAI’s Scraping Bots Is Slowing Down | WIRED

https://www.wired.com/story/open-ai-publisher-deals-scraping-bots

Generative AI’s Act o1: The Reasoning Era Begins | Sequoia Capital