Say WOW

Suborbital space tourism finally arrives | FCC prepares to run public C-band auction | The big four in the U.S. launch industry — United Launch Alliance, SpaceX, Blue Origin and Northrop Grumman — hope to be one of two providers that will receive five-year contracts later this year to launch national security payloads starting in 2022. | China’s launch rate stays high | The International Space Station is the largest ever crewed object in space.

Combining next-token prediction and video diffusion in computer vision and robotics

October 16, 2024

| No Comments

In the current AI zeitgeist, sequence models have skyrocketed in popularity for their ability to analyze data and predict what to do next. For instance, you’ve likely used next-token prediction models like ChatGPT, which anticipate each word (token) in a sequence to form answers to users’ queries. There are also full-sequence diffusion models like Sora, which convert words into dazzling, realistic visuals by successively “denoising” an entire video sequence. Researchers from MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) have proposed a simple change to the diffusion training scheme that makes…

This content is for Member members only.
Log In Register

Future

Combining next-token prediction and video diffusion in computer vision and robotics

What’s on BrandMoiAhora

Be Up to date at all times

Be Part of a Groove Society