nlp:human-in-the-loop
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| nlp:human-in-the-loop [2024/03/13 23:20] – [RLHF] jmflanig | nlp:human-in-the-loop [2025/05/31 07:43] (current) – [RLHF] jmflanig | ||
|---|---|---|---|
| Line 11: | Line 11: | ||
| * Interactive AI Model Debugging and Correction (2022 Thesis) ([[https:// | * Interactive AI Model Debugging and Correction (2022 Thesis) ([[https:// | ||
| * InstructGPT: | * InstructGPT: | ||
| + | * [[https:// | ||
| + | * **[[https:// | ||
| ==== Classification ==== | ==== Classification ==== | ||
| Line 29: | Line 31: | ||
| === Overviews === | === Overviews === | ||
| - | * Quick overview: section 3 of [[https:// | + | * Quick overview: section 3 of [[https:// |
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| === Papers === | === Papers === | ||
| Line 37: | Line 45: | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| - | * InstructGPT: | + | * InstructGPT: |
| * Used PPO: [[https:// | * Used PPO: [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| - | * [[https:// | + | * [[https:// |
| - | for Language Model Training]] | + | * [[https:// |
| + | * [[https:// | ||
| + | |||
| + | === Crowdsourcing & Data Collection === | ||
| + | * [[https:// | ||
| ===== Conferences and Workshops ===== | ===== Conferences and Workshops ===== | ||
| Line 52: | Line 66: | ||
| ===== Related Pages ===== | ===== Related Pages ===== | ||
| + | * [[HCI and NLP]] | ||
| * [[Instruction-Tuning]] | * [[Instruction-Tuning]] | ||
| * [[Lifelong Learning]] | * [[Lifelong Learning]] | ||
nlp/human-in-the-loop.1710372037.txt.gz · Last modified: 2024/03/13 23:20 by jmflanig