ml:mechanistic_interpretability
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| ml:mechanistic_interpretability [2025/05/16 05:27] – [Resources] jmflanig | ml:mechanistic_interpretability [2025/06/02 11:23] (current) – [Papers] jmflanig | ||
|---|---|---|---|
| Line 12: | Line 12: | ||
| * [[https:// | * [[https:// | ||
| * **[[https:// | * **[[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * **Induction Heads** | * **Induction Heads** | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| * **Explaining Neurons** | * **Explaining Neurons** | ||
| * [[https:// | * [[https:// | ||
| Line 24: | Line 29: | ||
| ===== Sparse Autoencoders ===== | ===== Sparse Autoencoders ===== | ||
| This section should maybe be moved into its own page. | This section should maybe be moved into its own page. | ||
| + | |||
| + | See also [[nlp: | ||
| * [[https:// | * [[https:// | ||
| Line 31: | Line 38: | ||
| ===== Resources ===== | ===== Resources ===== | ||
| * **Research Threads and Blogs** | * **Research Threads and Blogs** | ||
| - | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * **Companies** | * **Companies** | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| ===== Related Pages ===== | ===== Related Pages ===== | ||
| + | * [[nlp: | ||
| * [[nlp: | * [[nlp: | ||
| * [[nlp: | * [[nlp: | ||
| + | * [[nlp:LLM Safety]] | ||
| * [[Neural Network Psychology]] | * [[Neural Network Psychology]] | ||
| * [[nlp: | * [[nlp: | ||
| * [[Trustworthy AI]] | * [[Trustworthy AI]] | ||
| * [[nlp: | * [[nlp: | ||
ml/mechanistic_interpretability.1747373231.txt.gz · Last modified: 2025/05/16 05:27 by jmflanig