Awesome Computer Vision

A curated list of awesome computer vision resources, inspired by awesome-php.

For a list people in computer vision listed with their academic genealogy, please visit here

Contributing

Please feel free to send me pull requests or email ([email protected]) to add links.

Books

Computer Vision

Computer Vision: Models, Learning, and Inference - Simon J. D. Prince 2012
Computer Vision: Theory and Application - Rick Szeliski 2010
Computer Vision: A Modern Approach (2nd edition) - David Forsyth and Jean Ponce 2011
Multiple View Geometry in Computer Vision - Richard Hartley and Andrew Zisserman 2004
Computer Vision - Linda G. Shapiro 2001
Vision Science: Photons to Phenomenology - Stephen E. Palmer 1999
Visual Object Recognition synthesis lecture - Kristen Grauman and Bastian Leibe 2011
Learning OpenCV: Computer Vision with the OpenCV Library - Gary Bradski and Adrian Kaehler

Machine Learning

Pattern Recognition and Machine Learning - Christopher M. Bishop 2007
Neural Networks for Pattern Recognition - Christopher M. Bishop 1995
Probabilistic Graphical Models: Principles and Techniques - Daphne Koller and Nir Friedman 2009
Pattern Classification - Peter E. Hart, David G. Stork, and Richard O. Duda 2000
Machine Learning - Tom M. Mitchell 1997
[Gaussian processes for machine learning] (http://www.gaussianprocess.org/gpml/) - Carl Edward Rasmussen and Christopher K. I. Williams 2005
Learning From Data- Yaser S. Abu-Mostafa, Malik Magdon-Ismail and Hsuan-Tien Lin 2012
Neural Networks and Deep Learning - Michael Nielsen 2014

Fundamentals

Linear Algebra and Its Applications - Gilbert Strang 1995

Courses

Computer Vision

Visual Object and Activity Recognition - Alexei A. Efros and Trevor Darrell (UC Berkeley)
Computer Vision - Steve Seitz (University of Washington)
Visual Recognition - Kristen Grauman (UT Austin)
Language and Vision - Tamara Berg (UNC Chapel Hill)
Convolutional Neural Networks for Visual Recognition - Fei-Fei Li and Andrej Karpathy (Stanford University)
Computer Vision - Rob Fergus (NYU)
Computer Vision - Derek Hoiem (UIUC)
Computer Vision: Foundations and Applications - Kalanit Grill-Spector and Fei-Fei Li (Stanford University)
High-Level Vision: Behaviors, Neurons and Computational Models - Fei-Fei Li (Stanford University)
Advances in Computer Vision - Antonio Torralba and Bill Freeman (MIT)

Computational Photography

Image Manipulation and Computational Photography - Alexei A. Efros (UC Berkeley)
Computational Photography - Alexei A. Efros (CMU)
Computational Photography - Derek Hoiem (UIUC)
Computational Photography - James Hays (Brown University)
Digital & Computational Photography - Fredo Durand (MIT)
Computational Camera and Photography - Ramesh Raskar (MIT Media Lab)
Computational Photography - Irfan Essa (Georgia Tech)
Courses in Graphics - Stanford University
Computational Photography - Rob Fergus (NYU)
Introduction to Visual Computing - Kyros Kutulakos (University of Toronto)
Computational Photography - Kyros Kutulakos (University of Toronto)

Machine Learning and Statistical Learning

Machine Learning - Andrew Ng (Stanford University)
Learning from Data - Yaser S. Abu-Mostafa (Caltech)
Statistical Learning - Trevor Hastie and Rob Tibshirani (Stanford University)
Statistical Learning Theory and Applications - Tomaso Poggio, Lorenzo Rosasco, Carlo Ciliberto, Charlie Frogner, Georgios Evangelopoulos, Ben Deen (MIT)
Statistical Learning - Genevera Allen (Rice University)
Practical Machine Learning - Michael Jordan (UC Berkeley)
Course on Information Theory, Pattern Recognition, and Neural Networks - David MacKay (University of Cambridge)

Optimization

Convex Optimization I - Stephen Boyd (Stanford University)
Convex Optimization II - Stephen Boyd (Stanford University)
Convex Optimization - Stephen Boyd (Stanford University)
Optimization at MIT - (MIT)
Convex Optimization - Ryan Tibshirani (CMU)

Papers

Conference papers on the web

CVPapers - Computer vision papers on the web
SIGGRAPH Paper on the web - Graphics papers on the web
NIPS Proceedings - NIPS papers on the web
Computer Vision Foundation open access
Annotated Computer Vision Bibliography - Keith Price (USC)
Calendar of Computer Image Analysis, Computer Vision Conferences - (USC)

Survey Papers

Tutorials and talks

Computer Vision

The Three R's of Computer Vision - Jitendra Malik (UC Berkeley) 2013
Applications to Machine Vision - Andrew Blake (Microsoft Research) 2008
The Future of Image Search - Jitendra Malik (UC Berkeley) 2008
Should I do a PhD in Computer Vision? - Fatih Porikli (Australian National University)

3D Computer Vision

3D Computer Vision: Past, Present, and Future - Steve Seitz (University of Washington) 2011
Reconstructing the World from Photos on the Internet - Steve Seitz (University of Washington) 2013

Internet Vision

The Distributed Camera - Noah Snavely (Cornell University) 2011
Planet-Scale Visual Understanding - Noah Snavely (Cornell University) 2014
A Trillion Photos - Steve Seitz (University of Washington) 2013

Computational Photography

Reflections on Image-Based Modeling and Rendering - Richard Szeliski (Microsoft Research) 2013
Photographing Events over Time - William T. Freeman (MIT) 2011
Old and New algorithm for Blind Deconvolution - Yair Weiss (The Hebrew University of Jerusalem) 2011
A Tour of Modern "Image Processing" - Peyman Milanfar (UC Santa Cruz/Google) 2010
Topics in image and video processing Andrew Blake (Microsoft Research) 2007
Computational Photography - William T. Freeman (MIT) 2012

Learning and Vision

Where machine vision needs help from machine learning - William T. Freeman (MIT) 2011
Learning in Computer Vision - Simon Lucey (CMU) 2008
Learning and Inference in Low-Level Vision - Yair Weiss (The Hebrew University of Jerusalem) 2009

Object Recognition

Object Recognition - Larry Zitnick (Microsoft Research)
Generative Models for Visual Objects and Object Recognition via Bayesian Inference - Fei-Fei Li (Stanford University)

Graphical Models

Graphical Models for Computer Vision - Pedro Felzenszwalb (Brown University) 2012
Graphical Models - Zoubin Ghahramani (University of Cambridge) 2009
Machine Learning, Probability and Graphical Models - Sam Roweis (NYU) 2006
Graphical Models and Applications - Yair Weiss (The Hebrew University of Jerusalem) 2009

Machine Learning

A Gentle Tutorial of the EM Algorithm - Jeff A. Bilmes (UC Berkeley) 1998
Introduction To Bayesian Inference - Christopher Bishop (Microsoft Research) 2009
Support Vector Machines - Chih-Jen Lin (National Taiwan University) 2006
Bayesian or Frequentist, Which Are You? - Michael I. Jordan (UC Berkeley)

Optimization

Optimization Algorithms in Machine Learning - Stephen J. Wright (University of Wisconsin-Madison)
Convex Optimization - Lieven Vandenberghe (University of California, Los Angeles)
Continuous Optimization in Computer Vision - Andrew Fitzgibbon (Microsoft Research)
Beyond stochastic gradient descent for large-scale machine learning - Francis Bach (INRIA)

Deep Learning

A tutorial on Deep Learning - Geoffrey E. Hinton (University of Toronto)
Deep Learning - Ruslan Salakhutdinov (University of Toronto)
Scaling up Deep Learning - Yoshua Bengio (University of Montreal)
ImageNet Classification with Deep Convolutional Neural Networks - Alex Krizhevsky (University of Toronto)
The Unreasonable Effectivness Of Deep Learning Yann LeCun (NYU/Facebook Research) 2014
Deep Learning for Computer Vision - Rob Fergus (NYU/Facebook Research)
High-dimensional learning with deep network contractions - Stéphane Mallat (Ecole Normale Superieure)

Software

External Resource Links

Computer Vision Resources - Jia-Bin Huang (UIUC)
Computer Vision Algorithm Implementations - CVPapers
Source Code Collection for Reproducible Research - Xin Li (West Virginia University)
CMU Computer Vision Page

General Purpose Computer Vision Library

Multiple-view Computer Vision

Feature Detection and Extraction

VLFeat
SIFT
- David G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, 60, 2 (2004), pp. 91-110.
BRISK
- Stefan Leutenegger, Margarita Chli and Roland Siegwart, "BRISK: Binary Robust Invariant Scalable Keypoints", ICCV 2011
SURF
- Herbert Bay, Andreas Ess, Tinne Tuytelaars, Luc Van Gool, "SURF: Speeded Up Robust Features", Computer Vision and Image Understanding (CVIU), Vol. 110, No. 3, pp. 346--359, 2008
FREAK
- A. Alahi, R. Ortiz, and P. Vandergheynst, "FREAK: Fast Retina Keypoint", CVPR 2012
AKAZE
- Pablo F. Alcantarilla, Adrien Bartoli and Andrew J. Davison, "KAZE Features", ECCV 2012

Low-level Vision

Stereo Vision

Optical Flow

Image Denoising

BM3D, KSVD,

Super-resolution

Image Deblurring

Non-blind deconvolution

Blind deconvolution

Non-uniform Deblurring

Image Completion

Image Retargeting

RetargetMe

Alpha Matting

Image Pyramid

Edge-preserving image processing

Contour Detection and Image Segmentation

Interactive Image Segmentation

Video Segmentation

Camera calibration

Simultaneous localization and mapping

Single-view Spatial Understanding

Geometric Context - Derek Hoiem (CMU)
Recovering Spatial Layout - Varsha Hedau (UIUC)
Geometric Reasoning - David C. Lee (CMU)
RGBD2Full3D - Ruiqi Guo (UIUC)

Object Detection

Nearest Neighbor Search

General purpose nearest neighbor search

Nearest Neighbor Field Estimation

Visual Tracking

Saliency Detection

Attributes

Action Reconition

Egocentric cameras

Human-in-the-loop systems

Image Captioning

NeuralTalk -

Optimization

Ceres Solver - Nonlinear optimization

Datasets

External Dataset Link Collection

CV Datasets on the web - CVPapers
Are we there yet? - Which paper provides the best results on standard dataset X?
Computer Vision Dataset on the web
Yet Another Computer Vision Index To Datasets
ComputerVisionOnline Datasets
CVOnline Dataset
CV datasets
visionbib

Low-level Vision

Stereo Vision

Optical Flow

Image Super-resolution

Single-Image Super-Resolution: A Benchmark

Intrinsic Images

Material Recognition

Multi-view Reconsturction

Multi-View Stereo Reconstruction

Saliency Detection

Visual Tracking

Visual Survelliance

VIRAT
CAM2

Saliency Detection

Change detection

ChangeDetection.net

Visual Recognition

Image Classification

Scene Recognition

Object Detection

Semantic labeling

Multi-view Object Detection

Fine-grained Visual Recognition

Pedestrian Detection

Caltech Pedestrian Detection Benchmark

Action Recognition

Image-based

Video-based

HOLLYWOOD2 Dataset

Image Deblurring

Sun dataset

Image Captioning

Resources for students

Resource link collection

Resources for students - Frédo Durand (MIT)
Advice for Graduate Students - Aaron Hertzmann (Adobe Research)
Graduate Skills Seminars - Yashar Ganjali, Aaron Hertzmann (University of Toronto)
Research Skills - Simon Peyton Jones (Microsoft Research)

Writing

Write Good Papers - Frédo Durand (MIT)
Notes on writing - Frédo Durand (MIT)
How to Write a Bad Article - Frédo Durand (MIT)
How to write a good CVPR submission - William T. Freeman (MIT)
How to write a great research paper - Simon Peyton Jones (Microsoft Research)
How to write a SIGGRAPH paper - SIGGRAPH ASIA 2011 Course
Writing Research Papers - Aaron Hertzmann (Adobe Research)
How to Write a Paper for SIGGRAPH - Jim Blinn
How to Get Your SIGGRAPH Paper Rejected - Jim Kajiya (Microsoft Research)
How to write a SIGGRAPH paper - Li-Yi Wei (The University of Hong Kong)
How to Write a Great Paper - Martin Martin Hering Hering--Bertram (Hochschule Bremen University of Applied Sciences)
How to have a paper get into SIGGRAPH? - Takeo Igarashi (The University of Tokyo)
Good Writing - Marc H. Raibert (Boston Dynamics, Inc.)
How to Write a Computer Vision Paper - Derek Hoiem (UIUC)

Presentation

Giving a Research Talk - Frédo Durand (MIT)
How to give a good talk - David Fleet (University of Toronto) and Aaron Hertzmann (Adobe Research)
Designing conference posters - Colin Purrington

Research

How to do research - William T. Freeman (MIT)
You and Your Research - Richard Hamming
Warning Signs of Bogus Progress in Research in an Age of Rich Computation and Information - Yi Ma (UIUC)
Seven Warning Signs of Bogus Science - Robert L. Park
Five Principles for Choosing Research Problems in Computer Graphics - Thomas Funkhouser (Cornell University)
How To Do Research In the MIT AI Lab - David Chapman (MIT)
Recent Advances in Computer Vision - Ming-Hsuan Yang (UC Merced)
How to Come Up with Research Ideas in Computer Vision? - Jia-Bin Huang (UIUC)
How to Read Academic Papers - Jia-Bin Huang (UIUC)

Time Management

Time Management - Randy Pausch (CMU)

Links

The Computer Vision Industry - David Lowe
awesome-deep-learning
awesome-maching-learning
Cat Paper Collection

Songs

%

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
README.md		README.md
people.md		people.md

guidealexis/awesome-computer-vision

Folders and files

Latest commit

History

Repository files navigation

Awesome Computer Vision

Contributing

Table of Contents

Books

Computer Vision

Machine Learning

Fundamentals

Courses

Computer Vision

Computational Photography

Machine Learning and Statistical Learning

Optimization

Papers

Conference papers on the web

Survey Papers

Tutorials and talks

Computer Vision

3D Computer Vision

Internet Vision

Computational Photography

Learning and Vision

Object Recognition

Graphical Models

Machine Learning

Optimization

Deep Learning

Software

External Resource Links

General Purpose Computer Vision Library

Multiple-view Computer Vision

Feature Detection and Extraction

Low-level Vision

Stereo Vision

Optical Flow

Image Denoising

Super-resolution

Image Deblurring

Image Completion

Image Retargeting

Alpha Matting

Image Pyramid

Edge-preserving image processing

Contour Detection and Image Segmentation

Interactive Image Segmentation

Video Segmentation

Camera calibration

Simultaneous localization and mapping

Single-view Spatial Understanding

Object Detection

Nearest Neighbor Search

General purpose nearest neighbor search

Nearest Neighbor Field Estimation

Visual Tracking

Saliency Detection

Attributes

Action Reconition

Egocentric cameras

Human-in-the-loop systems

Image Captioning

Optimization

Datasets

External Dataset Link Collection

Low-level Vision

Stereo Vision

Optical Flow

Image Super-resolution

Intrinsic Images

Material Recognition

Multi-view Reconsturction

Saliency Detection

Visual Tracking

Visual Survelliance

Saliency Detection

Change detection

Visual Recognition

Packages