====== Systems & ML ====== Papers related to systems (making things efficient) and machine learning research. ===== Papers ===== * [[https://arxiv.org/pdf/2402.01869|2024 - InferCept: Efficient Intercept Support for Augmented Large Language Model Inference]] * [[https://arxiv.org/pdf/2410.08391|Horton et al 2024 - KV Prediction for Improved Time to First Token]] ===== Conferences and Workshops ===== * [[https://mlsys.org/|MLSys]] * [[https://es-fomo.com/|Efficient Systems for Foundation Models (Workshop)]] * [[https://www.asplos-conference.org/|International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)]] (Some ML systems papers go here) * SOSP * OSDI ===== Related Pages ===== * [[Efficient NNs]] * [[GPU Deep Learning]]