ECSE-626B: Statistical
Computer Vision
Time: MWF 11:35-12:25pm
Room: Wong 1030
Instructor: Tal Arbel
Teaching Assistant: TBA
Office: MC-425
Office Hrs: Mondays 2:30-3:30pm or by appointment
Phone: 398-8204
Course description:
Computer vision has always been concerned with solving ill-posed problems. For
example, given an image of a scene, we are asked to determine the underlying
structures that generated it. The problem is difficult in that information is
lost when the three dimensional world is projected onto a two dimensional
image. As a result, different scenes could give rise to the same image. To
illustrate the point, consider the example below. On the left, you can see a
toothpaste tube placed on a table. On the right is one particular
2D image of it acquired from a camera at a particularly unfortunate
viewpoint - facing the top of the tube. Note that several different objects
could give rise to that same image. Identifying the toothpaste tube from that
image is difficult even for humans.
In this course, we explore the conjecture that the world is uncertain and
therefore should be described through the language of probabilities. We will
describe several standard problems in computer vision and explore
probabilistic inference methods for their solution. The goal is to provide the
student with the necessary tools to be able to apply these techniques to other
computer vision research problems and to many other areas.
Background requirements
This course is designed for graduate students pursuing interests in the areas
of computer vision, robot vision and artificial intelligence (e.g. machine
learning, decision-making). It is not meant as an introductory course in
computer vision and, as such, does not provide a broad overview of the
field. That being said, we will describe and address a wide range of problems
in computer vision, but the particular focus will be on solutions based on
probability and information theory.
The course assumes some mathematical background in probability and
statistics, linear algebra, and calculus. Students should be
familiar with basic techniques in image processing and optimization. Finally,
students should be comfortable programming in Matlab.
Topics to be covered: (subject to change)
The course will describe a wide variety of difficult and open problems in
computer vision. These include (but are not limited to): optical flow
estimation, shape-from-shading, stereo vision, object recognition,
pose estimation, active vision, image registration and alignment, and face
recognition. We will explore probabilistic solutions to these problems
by studying the following:
Standard regularization and Bayesian regularization approaches to solving
ill-posed vision problems
Sequential Bayesian methods - Kalman filters, particle filters
Bayesian inference
Model-fitting - parametric and non-parametric
Active data selection
Information theory
Pattern recognition
Markov Random Field (MRF) models
Principal component analysis and eigenmodels
Course schedule (tentative)
Lecture Schedule
Reading material
Readings will include journal and conference papers in computer vision as well
as sections from the following textbook:
David J.C. MacKay, "Information Theory, Inference, and Learning
Algorithms".
Some suggested background reading:
David A. Forsyth, Jean Ponce, Computer Vision: a Modern Approach,
Prentice Hall, 2002.
T.M. Cover and J.A. Thomas, Elements of Information Theory,
Wiley & Sons, New York, 1991.
Stan Z. Li, Markov
Random Field Modeling in Image Analysis , Springer-Verlag, 2001.
Grading: (subject to change)
2 Assignments: 20%
Midterm literature review: 20%
Final project: Document: 30% + presentation: 10% = 40%
Final test: 20%
Useful computer vision links:
The Computer Vision Home Page
Annotated Computer Vision Bibliography
CVonline Compendium of Computer Vision
Academic Integrity:
McGill University values academic integrity. Therefore all students must
understand the meaning and consequences of cheating, plagiarism and other
academic offences under the Code of Student Conduct and Disciplinary
Procedures (see www.mcgill.ca/integrity for more
information).