This is the repository for a community-led course on Computer Vision. Over 60 contributors from the Hugging Face Computer Vision community have worked together on the content for this course.
- Welcome
- Fundamentals
- Convolutional Neural Networks
- Vision Transformers
- Multimodal Models
- Generative Models
- Basic CV Tasks
- Video and Video Processing
- 3D Vision, Scene Rendering and Reconstruction
- Model Optimization
- Synthetic Data Creation
- Zero Shot Computer Vision
- Ethics and Biases
- Outlook
The result you have in front of you is as diverse as the community. A typical educational course is created by a small group of people, who try to match the tone of each other closely. We took a different road. While following a plan on which content we wanted to include, all authors had freedom in the choice of their style. Other members of the community reviewed the content and approved or made change suggestions.
The outcome is a truly unique course and proof of what a strong open-source community can achieve.
If you want to contribute content or suggest some typo/bug fixes, head over to the Contribution Guidelines.
If you are curious about the Hugging Face Computer Vision Community, read on 🔽
Join us in Discord 👾
Join the Hugging Face discord, take the role open-source and join us at the channel #cv-community-project for discussions about the course. You can also check out the #computer-vision channel for more general discussions and questions about Computer Vision.