Computer Vision (CMU 16-385)

This course provides a comprehensive introduction to computer vision. Major topics include image processing, detection and recognition, geometry-based and physics-based vision and video analysis. Students will learn basic concepts of computer vision as well as hands on experience to solve real-life vision problems.

Basic Info
Mon/Wed 11:00am-12:20pm
Tepper 1403
Instructor: David Held
See the Course Info page for more info on policies and logistics.
Getting Started
To get started with the class you need to do just three things:
  1. Sign up for the course Piazza.
  2. Sign up for an account on this webpage. (The signup code is on Canvas.)
  3. Carefully read through the Course Info.
Fall 2024 Schedule
Aug 26 (Mon)
Aug 28 (Wed)
Sep 2 (Mon)
Assignment 1 out
Sep 4 (Wed)
Sep 9 (Mon)
Sep 11 (Wed)
Sep 16 (Mon)
Sep 18 (Wed)
Assignment 1 DUE
Assignment 2 out
Sep 23 (Mon)
Sep 25 (Wed)
Sep 30 (Mon)
Oct 2 (Wed)
Assignment 2 DUE
Assignment 3 out
Oct 7 (Mon)
Oct 9 (Wed)
Oct 14 (Mon)
NO CLASS (Spring Break)
Oct 16 (Wed)
NO CLASS (Spring Break)
Oct 21 (Mon)
Oct 23 (Wed)
Assignment 3 DUE
Assignment 4 out
Oct 28 (Mon)
Oct 30 (Wed)
Nov 4 (Mon)
Nov 6 (Wed)
Assignment 4 DUE
Assignment 5 out
Nov 11 (Mon)
Nov 13 (Wed)
Nov 18 (Mon)
Nov 20 (Wed)
Assignment 5 DUE
Assignment 6 out
Nov 25 (Mon)
Nov 27 (Wed)
Dec 2 (Mon)
Dec 4 (Wed)
Dec 6 (Fri)
Assignment 6 DUE
Assignments

Assignments will be released via Piazza. A list of assignments is available below. Reference material is available on the Lectures page.

(Due Sep 18)Programming Assignment 1: Image Filtering and Hough Transform
(Due Oct 2)Programming Assignment 2: Augmented Reality with Planar Homographies
(Due Oct 23)Programming Assignment 3: 3D Reconstruction
(Due Nov 6)Programming Assignment 4: Scene Recognition with Bag of Words
(Due Nov 20)Programming Assignment 5: Neural Networks for Recognition
(Due Dec 6)Programming Assignment 6: Video Tracking
Acknowledgments

The lecture notes have been pieced together from many different people and places. Special thanks to colleagues for sharing their slides: Matt O'Toole, Kris Kitani, Bob Collins, Srinivasa Narashiman, Martial Hebert, Alyosha Efros, Ali Faharadi, Deva Ramanan, Yaser Sheikh, and Todd Zickler. Many thanks also to the following people for making their lecture notes and materials available online: Steve Seitz, Richard Selinsky, Larry Zitnick, Noah Snavely, Lana Lazebnik, Kristen Grauman, Yung-Yu Chuang, Tinne Tuytelaars, Fei-Fei Li, Antonio Torralba, Rob Fergus, David Claus, and Dan Jurafsky.