Say WOW

Suborbital space tourism finally arrives | FCC prepares to run public C-band auction | The big four in the U.S. launch industry — United Launch Alliance, SpaceX, Blue Origin and Northrop Grumman — hope to be one of two providers that will receive five-year contracts later this year to launch national security payloads starting in 2022. | China’s launch rate stays high | The International Space Station is the largest ever crewed object in space.

The Download: rethinking AI benchmarks, and the ethics of AI agents

November 26, 2024

| No Comments

This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology. The way we measure progress in AI is terrible Every time a new AI model is released, it’s typically touted as acing its performance against a series of benchmarks. OpenAI’s GPT-4o, for example, was launched in May with a compilation of results that showed its performance topping every other AI company’s latest model in several tests. The problem is that these benchmarks are poorly designed, the results hard…

This content is for Member members only.
Log In Register

Health

The Download: rethinking AI benchmarks, and the ethics of AI agents

What’s on BrandMoiAhora

Be Up to date at all times

Be Part of a Groove Society