ml:systems_ml

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
ml:systems_ml [2024/09/07 00:45] – [Conferences and Workshops] jmflanigml:systems_ml [2025/07/18 06:26] (current) – [Papers] jmflanig
Line 1: Line 1:
 ====== Systems & ML ====== ====== Systems & ML ======
 Papers related to systems (making things efficient) and machine learning research. Papers related to systems (making things efficient) and machine learning research.
 +
 +===== Papers =====
 +  * [[https://arxiv.org/pdf/2402.01869|2024 - InferCept: Efficient Intercept Support for Augmented Large Language Model Inference]]
 +  * [[https://arxiv.org/pdf/2410.08391|Horton et al 2024 - KV Prediction for Improved Time to First Token]]
  
 ===== Conferences and Workshops ===== ===== Conferences and Workshops =====
Line 10: Line 14:
  
 ===== Related Pages ===== ===== Related Pages =====
 +  * [[Efficient NNs]]
   * [[GPU Deep Learning]]   * [[GPU Deep Learning]]
 +
  
ml/systems_ml.1725669900.txt.gz · Last modified: 2024/09/07 00:45 by jmflanig

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki