Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
-
Updated
Dec 25, 2024 - Python
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
[TMLR 2024] Efficient Large Language Models: A Survey
Infrastructures™ for Machine Learning Training/Inference in Production.
Dive into machine learning system, start from reinventing the wheel.
Learn how to design and implement effective Machine Learning systems from start to finish.
Curated collection of papers in machine learning systems
Oort: Efficient Federated Learning via Guided Participant Selection
a curated list of high-quality papers on resource-efficient LLMs 🌱
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
Course Material for the UG Course COMP4901Y
Machine Learning Compiler Road Map
CSCE 585 - Machine Learning Systems
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
A C++ implementation of the scalar-valued autograd engine micrograd
A curated list of resources to deep dive into the intersection of applied machine learning and threat detection.
[Long Term Support] [SIGCOMM 2023] Lightning: A Reconfigurable Photonic-Electronic SmartNIC for Fast and Energy-Efficient Inference
This is the course project for CSCE585: ML Systems. Students will build their machine learning systems based on the provided infrastructure --- Athena.
Assignments for Data Intensive Systems for Machine Learning Coursework
Add a description, image, and links to the machine-learning-systems topic page so that developers can more easily learn about it.
To associate your repository with the machine-learning-systems topic, visit your repo's landing page and select "manage topics."