User Tools

Site Tools


nlp:post-training

Post-Training

Post-training refers to the things done to a LLM after pre-training to improve its performance, such as supervised fine-tuning, RLHF, instruction tuning, etc. This is a critical step before releasing the LLM. Typically, this refers to things done by the company or group before releasing the LLM, not the things done afterwards to customize to a specific application. (For an example of this usage, see the GPT-4 technical report.)

Overviews

Papers

Context-Extension

Sub-Areas

nlp/post-training.txt · Last modified: 2025/10/07 06:24 by jmflanig

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki