nlp:large_reasoning_models
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| nlp:large_reasoning_models [2025/05/29 07:26] – jmflanig | nlp:large_reasoning_models [2025/10/10 09:05] (current) – [Papers] jmflanig | ||
|---|---|---|---|
| Line 9: | Line 9: | ||
| ===== Papers ===== | ===== Papers ===== | ||
| - | * OpenAI o1 | + | |
| + | * **OpenAI o1** | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 16: | Line 17: | ||
| * R1 replication on small datasets | * R1 replication on small datasets | ||
| * [[https:// | * [[https:// | ||
| - | * General papers | + | |
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| + | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| Line 25: | Line 27: | ||
| * [[https:// | * [[https:// | ||
| * [[http:// | * [[http:// | ||
| - | * [[https:// | ||
| * [[https:// | * [[https:// | ||
| - | * Concise Reasoning | + | |
| * Using RL | * Using RL | ||
| * [[https:// | * [[https:// | ||
| - | * Models | + | * **Parallel and Collaborative Thinking** |
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * **Problems, Criticisms and Insights** | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * **[[https:// | ||
| + | * **Models** | ||
| * Phi-4-Reasoning: | * Phi-4-Reasoning: | ||
| + | * [[https:// | ||
| ===== Related Pages ===== | ===== Related Pages ===== | ||
| - | * [[Reasoning Chains]] | + | * [[Reasoning]] |
| + | * [[Reasoning# | ||
| * [[ml: | * [[ml: | ||
| * [[Test-Time Scaling]] | * [[Test-Time Scaling]] | ||
nlp/large_reasoning_models.1748503581.txt.gz · Last modified: 2025/05/29 07:26 by jmflanig