Table of Contents
Systems & ML
Papers
Conferences and Workshops
Related Pages
Systems & ML
Papers related to systems (making things efficient) and machine learning research.
Papers
2024 - InferCept: Efficient Intercept Support for Augmented Large Language Model Inference
Horton et al 2024 - KV Prediction for Improved Time to First Token
Conferences and Workshops
MLSys
Efficient Systems for Foundation Models (Workshop)
International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
(Some ML systems papers go here)
SOSP
OSDI
Related Pages
Efficient NNs
GPU Deep Learning