ml:model_editing_and_unlearning
Table of Contents
Model Editing and Unlearning
Model editing is where a model, such as a large language model, is “edited” to change the facts in the model. Machine unlearning is where a trained model is adjusted to “remove” one or more datapoints that were used to train the model, so that it behaves like a model that was trained without those datapoints. The datapoints to remove can either be specific datapoints from the training set, or classes of datapoints, such as all datapoints about bioweapons.
Model Editing
In NLP
See also Knowledge Editing.
Machine Unlearning
Overviews
- Paper lists
- For NLP or LLMs
- Liu et al 2024 - Rethinking Machine Unlearning for Large Language Models (This is also a survey paper.)
Key Papers
In NLP or LLMs
Theory Papers
Related Pages
ml/model_editing_and_unlearning.txt · Last modified: 2025/07/07 07:13 by jmflanig