nlp:large_reasoning_models
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| nlp:large_reasoning_models [2025/06/01 20:51] – [Papers] jmflanig | nlp:large_reasoning_models [2025/10/10 09:05] (current) – [Papers] jmflanig | ||
|---|---|---|---|
| Line 10: | Line 10: | ||
| ===== Papers ===== | ===== Papers ===== | ||
| * [[https:// | * [[https:// | ||
| - | * OpenAI o1 | + | |
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 17: | Line 17: | ||
| * R1 replication on small datasets | * R1 replication on small datasets | ||
| * [[https:// | * [[https:// | ||
| - | * General papers | + | |
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 26: | Line 27: | ||
| * [[https:// | * [[https:// | ||
| * [[http:// | * [[http:// | ||
| - | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * **Concise Reasoning** | * **Concise Reasoning** | ||
| * Using RL | * Using RL | ||
| * [[https:// | * [[https:// | ||
| - | * **Parallel Thinking** | + | * **Parallel |
| * [[https:// | * [[https:// | ||
| - | | + | * [[https:// |
| + | * [[https:// | ||
| + | | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| + | * **[[https:// | ||
| * **Models** | * **Models** | ||
| * Phi-4-Reasoning: | * Phi-4-Reasoning: | ||
| Line 40: | Line 44: | ||
| ===== Related Pages ===== | ===== Related Pages ===== | ||
| - | * [[Reasoning Chains]] | + | * [[Reasoning]] |
| + | * [[Reasoning# | ||
| * [[ml: | * [[ml: | ||
| * [[Test-Time Scaling]] | * [[Test-Time Scaling]] | ||
nlp/large_reasoning_models.1748811074.txt.gz · Last modified: 2025/06/01 20:51 by jmflanig