Connect with us

Science & Technology

The leaderboard ‘you can’t game,’ funded by the companies it ranks | Equity Podcast

Artificial intelligence models are multiplying fast, and competition is stiff. With so many players crowding the space, which one will be the best — and who decides that? Arena, formerly LM Arena, has emerged as the de facto public leaderboard for frontier LLMs, influencing funding, launches, and PR cycles. In just seven months, the startup went from a UC Berkeley PhD research…

Published

on

Artificial intelligence models are multiplying fast, and competition is stiff. With so many players crowding the space, which one will be the best — and who decides that? Arena, formerly LM Arena, has emerged as the de facto public leaderboard for frontier LLMs, influencing funding, launches, and PR cycles. In just seven months, the startup went from a UC Berkeley PhD research project to being valued at $1.7 billion. 

Watch as Equity host Rebecca Bellan catches up with Arena co-founders Anastasios Angelopoulos and Wei-Lin Chiang about how their platform became the go-to leaderboard for frontier AI models, and how they’re trying to build a neutral benchmark even as companies like OpenAI, Google, and Anthropic back the project.

Subscribe to Equity on YouTube, Apple Podcasts, Overcast, Spotify and all the casts. You also can follow Equity on X and Threads, at @EquityPod.

Chapters:

00:00 Intro

03:00 How Arena’s leaderboard works, and why it’s different from static benchmarks

07:00 Reproducibility concerns and how to scale

08:45 Can Arena stay independent while taking money from the labs it ranks?

11:15 Diversity, fraud prevention, and abuse mitigation

18:15 Arena’s “data moat”

19:20 Agent benchmarking and expert leaderboards

21:40 Open sourcing data

22:45 How do Arena’s rankings shape AI development?

24:15 Outro

Continue Reading
Advertisement
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

CNET

Android 17 REVEALED: Pause Point Is My Favorite Feature

Android 17 is out, and if you have a Google Pixel phone, you can try it right away. Android 17 comes with a bunch of new features, including a new green-screen selfie tool, a Pause Point tool and loads of video and photo improvements for Instagram. 00:00 – Introduction to Android 17 00:11 – Screen…

Published

on

Android 17 is out, and if you have a Google Pixel phone, you can try it right away. Android 17 comes with a bunch of new features, including a new green-screen selfie tool, a Pause Point tool and loads of video and photo improvements for Instagram.

00:00 – Introduction to Android 17
00:11 – Screen Reactions Feature
00:57 – Instagram Video Stabilization
01:27 – Pause Point: Mindful App Usage
02:05 – Smart Enhance for Photos and Video
02:35 – Sound Separation and Audio Extraction
04:07 – How to Download and Conclusion

Add CNET as a trusted news source
Never miss a deal again! See CNET’s browser extension 👉
Check out CNET’s Amazon Storefront:
Subscribe to CNET on YouTube:
Follow us on TikTok:
Follow us on Instagram:
Follow us on Bluesky:
Like us on Facebook:
CNET’s AI Atlas:
Follow us on X:
Visit CNET.com:

Continue Reading

Science & Technology

Black Founders Had a Great Fundraising Quarter…With a Catch

On one hand, US-based, Black-founded startups have already raised $643M, 70% of what was raised in the entirety of last year. But dig a little deeper into the numbers, and you’ll find that in the words of Crunchbase’s head of research: “…data has shown a persistent decline in funding to Black-founded companies that outpaces the…

Published

on

On one hand, US-based, Black-founded startups have already raised $643M, 70% of what was raised in the entirety of last year.

But dig a little deeper into the numbers, and you’ll find that in the words of Crunchbase’s head of research: “…data has shown a persistent decline in funding to Black-founded companies that outpaces the overall decline in startup funding.”

Continue Reading

CNET

The US Government Doesn’t Want You to Buy This Car

Xpeng brought Mashable reporter Amanda Yeo to China to experience the new VLA 2.0 autonomous driving model inside its P7 electric vehicle. 0:00 The Car the US Government Doesn’t Want You to Buy 0:18 Meet XPENG: China’s High-Tech Tesla Rival 0:39 How VLA 2.0 Autonomous Driving Works 1:43 Stress Testing Self-Driving in Hectic Traffic 2:21…

Published

on

Xpeng brought Mashable reporter Amanda Yeo to China to experience the new VLA 2.0 autonomous driving model inside its P7 electric vehicle.

0:00 The Car the US Government Doesn’t Want You to Buy
0:18 Meet XPENG: China’s High-Tech Tesla Rival
0:39 How VLA 2.0 Autonomous Driving Works
1:43 Stress Testing Self-Driving in Hectic Traffic
2:21 The Challenge of “Corner Cases” in Autonomy
2:43 Hands-Free Self-Parking Demo
3:00 Heads-Up Display and Interior Tech
3:24 XPENG’s Personal Flying Machines
4:22 Why Chinese EVs are Banned in the US

Add CNET as a trusted news source
Never miss a deal again! See CNET’s browser extension 👉
Check out CNET’s Amazon Storefront:

Subscribe to CNET on YouTube:
Follow us on TikTok:
Follow us on Instagram:
Follow us on Bluesky:
Like us on Facebook:
CNET’s AI Atlas:
Follow us on X:
Visit CNET.com:

#xpeng #electricvehicle #automobile #car #electricvehicle #china

Continue Reading

Trending