Ching-Yun “Irene” Ko

FILTER
Selected:

Training LLMs to self-detoxify their language

April 23, 2025

A new method from the MIT-IBM Watson AI Lab helps large language models to steer their own responses toward safer, more ethical, value-aligned outputs.