Diego Martí Monsó

FILTER
Selected:

Combining next-token prediction and video diffusion in computer vision and robotics

October 18, 2024

A new method can train a neural network to sort corrupted data while anticipating next steps. It can make flexible plans for robots, generate high-quality video, and help AI agents navigate digital environments.