ml:model_editing_and_unlearning

This is an old revision of the document!


Model Editing and Unlearning

Model editing is where a model, such as a large language model, is “edited” to change the facts in the model. Machine unlearning is where a trained model is adjusted to “remove” one or more datapoints (or classes of datapoints, such as all datapoints about bioweapons) that were used to train the model, so that it behaves like a model that was trained without those datapoints.

Model Editing

In NLP

Machine Unlearning

Overviews

Key Papers

In NLP or LLMs

Theory Papers

ml/model_editing_and_unlearning.1751492652.txt.gz · Last modified: 2025/07/02 21:44 by jmflanig

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki