nlp:alignment

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
nlp:alignment [2024/04/28 02:10] – [Posts] jmflanignlp:alignment [2025/06/03 00:26] (current) – [People] jmflanig
Line 3: Line 3:
 ===== Overviews ===== ===== Overviews =====
   * [[https://arxiv.org/pdf/2309.15025.pdf|Shen et al 2023 - Large Language Model Alignment: A Survey]]   * [[https://arxiv.org/pdf/2309.15025.pdf|Shen et al 2023 - Large Language Model Alignment: A Survey]]
 +  * [[https://arxiv.org/pdf/2310.19852|Ji et al 2023 - AI Alignment: A Comprehensive Survey]]
 +  * [[https://arxiv.org/pdf/2404.09932|Anwar et al 2024 - Foundational Challenges in Assuring Alignment and Safety of Large Language Models]]
  
 ===== Blog Posts, etc ===== ===== Blog Posts, etc =====
-  * [[https://www.lesswrong.com/posts/QBAjndPuFbhEXKcCr/my-understanding-of-what-everyone-in-technical-alignment-is|What Everyone in Alignment is Doing and Why]]+  * [[https://www.lesswrong.com/posts/QBAjndPuFbhEXKcCr/my-understanding-of-what-everyone-in-technical-alignment-is|2022 - What Everyone in Alignment is Doing and Why]]
  
 ===== Papers ===== ===== Papers =====
Line 17: Line 19:
  
 ===== People ===== ===== People =====
 +  * [[https://scholar.google.com/citations?user=kV9XRxYAAAAJ&hl=en|Sam Bowman]] 
 +  * [[https://scholar.google.com/citations?user=czyretsAAAAJ&hl=en|Dan Hendrycks]]
  
 ===== Related Pages ===== ===== Related Pages =====
 +  * [[AGI]]
   * [[Instruction-Tuning]] Instruction-tuning is often similar to alignment, but alignment is broader.  Instruction-tuning methods often falls under alignment.   * [[Instruction-Tuning]] Instruction-tuning is often similar to alignment, but alignment is broader.  Instruction-tuning methods often falls under alignment.
   * [[Language Model]]   * [[Language Model]]
 +  * [[LLM Safety]] 
 +  * [[ml:Mechanistic Interpretability]]
nlp/alignment.1714270237.txt.gz · Last modified: 2024/04/28 02:10 by jmflanig

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki