Yilun Du


Combining next-token prediction and video diffusion in computer vision and robotics

October 18, 2024

A new method can train a neural network to sort corrupted data while anticipating next steps. It can make flexible plans for robots, generate high-quality video, and help AI agents navigate digital environments.

In this conceptual painting, a computer responds with several different images in response to the prompt "A horse in a yellow flower field".

3 Questions: How AI image generators could help robots

October 28, 2022

Yilun Du, a PhD student and MIT CSAIL affiliate, discusses the potential applications of generative art beyond the explosion of images that put the web into creative hysterics.