Skip to content

gpu-mode/awesomeMLSys

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 

Repository files navigation

ML Systems Onboarding Reading List

This is a reading list of papers/videos/repos I've personally found useful as I was ramping up on ML Systems and that I wish more people would just sit and study carefully during their work hours. If you're looking for more recommendations, go through the citations of the below papers and enjoy!

Attention Mechanism

Performance Optimizations

Quantization

Long context length

Sparsity

  • Venom: Vectorized N:M Format for sparse tensor cores when hardware only supports 2:4
  • Megablocks: Efficient Sparse training with mixture of experts
  • ReLu Strikes Back: Really enjoyed this paper as an example of doing model surgery for more efficient inference

Distributed