nlp:large_reasoning_models
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| nlp:large_reasoning_models [2025/05/30 23:15] – [Papers] jmflanig | nlp:large_reasoning_models [2025/10/10 09:05] (current) – [Papers] jmflanig | ||
|---|---|---|---|
| Line 10: | Line 10: | ||
| ===== Papers ===== | ===== Papers ===== | ||
| * [[https:// | * [[https:// | ||
| - | * OpenAI o1 | + | |
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 17: | Line 17: | ||
| * R1 replication on small datasets | * R1 replication on small datasets | ||
| * [[https:// | * [[https:// | ||
| - | * General papers | + | |
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 26: | Line 27: | ||
| * [[https:// | * [[https:// | ||
| * [[http:// | * [[http:// | ||
| - | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| - | * Concise Reasoning | + | |
| * Using RL | * Using RL | ||
| * [[https:// | * [[https:// | ||
| - | * Models | + | * **Parallel and Collaborative Thinking** |
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * **Problems, Criticisms and Insights** | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * **[[https:// | ||
| + | * **Models** | ||
| * Phi-4-Reasoning: | * Phi-4-Reasoning: | ||
| + | * [[https:// | ||
| ===== Related Pages ===== | ===== Related Pages ===== | ||
| - | * [[Reasoning Chains]] | + | * [[Reasoning]] |
| + | * [[Reasoning# | ||
| * [[ml: | * [[ml: | ||
| * [[Test-Time Scaling]] | * [[Test-Time Scaling]] | ||
nlp/large_reasoning_models.1748646944.txt.gz · Last modified: 2025/05/30 23:15 by jmflanig