Awesome Online Machine Learning

Online machine learning is a subset of machine learning where data arrives sequentially. In contrast to the more traditional batch learning, online learning methods update themselves incrementally with one data point at a time.

Courses and books
Blog posts
Software
- Modelling
- Deployment
Papers

Courses and books

Machine Learning for Streaming Data with Python
IE 498: Online Learning and Decision Making
Introduction to Online Learning
Machine Learning the Feature — Gives some insights into the inner workings of Vowpal Wabbit, especially the slides on online linear learning.
Machine learning for data streams with practical examples in MOA
Online Methods in Machine Learning (MIT)
Streaming 101: The world beyond batch
Prediction, Learning, and Games
Introduction to Online Convex Optimization
Reinforcement Learning and Stochastic Optimization: A unified framework for sequential decisions — The entire book builds upon Online Learning paradigm in applied learning/optimization problems, Chapter 3 Online learning being the reference.
Big Data course at the CILVR lab at NYU — Focus on linear models and bandits. Some courses are given by John Langford, the creator of Vowpal Wabbit.
Machine Learning for Personalization — Course from Columbia by Tony Jebara, covers bandits.
An Introduction to Online Learning
Streaming Data Analytics - Course from Politecnico di Milano.

Blog posts

Software

See more here.

Modelling

River — A Python library for general purpose online machine learning.
dask
Jubatus
Flink ML - Apache Flink machine learning library
LIBFFM — A Library for Field-aware Factorization Machines
LIBLINEAR — A Library for Large Linear Classification
LIBOL — A collection of online linear models trained with first and second order gradient descent methods. Not maintained.
MOA
scikit-learn — Some of scikit-learn's estimators can handle incremental updates, although this is usually intended for mini-batch learning. See also the "Computing with scikit-learn" page.
Spark Streaming — Doesn't do online learning per say, but instead mini-batches the data into fixed intervals of time.
SofiaML
StreamDM — A machine learning library on top of Spark Streaming.
Tornado
VFML
Vowpal Wabbit

Deployment

KappaML
django-river-ml — a Django plugin for deploying River models
chantilly — a prototype meant to be compatible with River (previously Creme)

Papers

Linear models

Support vector machines

Neural networks

Three scenarios for continual learning (2019)

Decision trees

Unsupervised learning

Time series

Online Learning for Time Series Prediction (2013)

Drift detection

A Survey on Concept Drift Adaptation (2014)

Anomaly detection

Metric learning

Miscellaneous

Surveys

General-purpose algorithms

Maintaining Sliding Window Skylines on Data Streams (2006)
The Sliding DFT (2003) — An online variant of the Fourier Transform, a concise explanation is available here
Sketching Algorithms for Big Data

Hyperparameter tuning

ChaCha for Online AutoML (2021)

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.gitattributes		.gitattributes
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome Online Machine Learning

Courses and books

Blog posts

Software

Modelling

Deployment

Papers

Linear models

Support vector machines

Neural networks

Decision trees

Unsupervised learning

Time series

Drift detection

Anomaly detection

Metric learning

Graph theory

Ensemble models

Expert learning

Active learning

Miscellaneous

Surveys

General-purpose algorithms

Hyperparameter tuning

Evaluation

About

Releases

Packages

Contributors 7

License

online-ml/awesome-online-machine-learning

Folders and files

Latest commit

History

Repository files navigation

Awesome Online Machine Learning

Courses and books

Blog posts

Software

Modelling

Deployment

Papers

Linear models

Support vector machines

Neural networks

Decision trees

Unsupervised learning

Time series

Drift detection

Anomaly detection

Metric learning

Graph theory

Ensemble models

Expert learning

Active learning

Miscellaneous

Surveys

General-purpose algorithms

Hyperparameter tuning

Evaluation

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Packages