|
|
ICCV
2009 Tutorial on MAP
Inference in Discrete Models |
| ||
|
Speakers
New!
Tutorial slides are now online! Please
download either all slides in pdf format
or
in ppt format
.
In total there are 6 parts. Note, you are welcome to use these slides for
giving talks. If you plan to do, please let the respective author
know. |
Guest
Speaker
| |||
|
Purpose
of this course Many
problems in Computer Vision are formulated in form of a random filed of
discrete variables. Examples range from low-level vision such as image
segmentation, optical flow and stereo reconstruction, to high-level vision
such as object recognition. The goal is typically to infer the most
probable values of the random variables, known as Maximum a Posteriori
(MAP) estimation. This has been widely studied in several areas of
Computer Science (e.g. Computer Vision, Machine Learning, Theory), and the
resulting algorithms have greatly helped in obtaining accurate and
reliable solutions to many problems. These algorithms are extremely
efficient and can find the globally (or strong locally) optimal solutions
for an important class of models in polynomial time. Hence, they have led
to a significant increase in the use of random field models in computer
vision and information engineering in general. This tutorial
is aimed at researchers who wish to use and understand these algorithms
for solving new problems in computer vision and information engineering.
No prior knowledge of probabilistic models or discrete optimization will
be assumed. The tutorial will answer the following questions:
(a) How to
formalize and solve some known vision problems using MAP inference of a
random field? (b) What are the
different genres of MAP inference algorithms? (c) How do they
work? (d) Which algorithm
is suitable for which problem? (e) What are the
recent developments and open questions in this field?
Relationship
to tutorial at ECCV 2008 Pawan Kumar and
Pushmeet Kohli had given a half-day tutorial at ECCV 2008 on MAP
estimation in Computer Vision (see link). The full-day
tutorial at ICCV would cover the following topics that were not part of
the tutorial at ECCV:
(a)
Discussion of empirical studies of the performance of different inference
methods for various classes of random fields. (b)
New material on recent advances: optimization of higher-order functions;
introducing various convex relaxation techniques; new move-making methods.
(c)
More example applications to motivate the need of different classes of
functions and algorithms. Researchers
and students who had attended the previous version of our tutorial at ECCV
would also find this tutorial useful and
informative. Final
Syllabus 9.30-10.00
Introduction (Andrew Blake) –
Part1 10.00-11.00
Discrete Models in Computer Vision (Carsten Rother) – Part2 15min Coffee
break 11.15-12.30
Message Passing: DP, TRW, LP relaxation (Pawan Kumar) – Part3 12.30-13.00
Quadratic pseudo-boolean optimization (Pushmeet Kohli) – Part4 1 hour Lunch
break
15:00-15.30
Speed and Efficiency (Pushmeet Kohli) –
Part4 15min Coffee
break 15:45-16.15
Comparison of Methods (Carsten Rother) –
Part5 16:15-16.45
Dual-decomposition (Carsten Rother) –
Part5 16-45-17.30
Recent Advances in Convex Relaxations (Pawan Kumar) – Part6 About
the Speakers | ||||
|
|
Pushmeet
Kohli is a
researcher in the Machine Learning and Perception group at Microsoft
Research Cambridge, and a post-doctoral associate of Trinity Hall,
University of Cambridge. He completed his PhD studies at Oxford Brookes
University under the supervision of Prof. Philip Torr. His PhD thesis,
titled "Minimizing Dynamic and Higher Order Energy Functions using Graph
Cuts", was the winner of the British Machine Vision Association's Sullivan
Doctoral Thesis Award, and was a runner-up for the British Computer
Society's Distinguished Dissertation Award. Before joining Microsoft
Research Cambridge, Pushmeet was a visiting researcher at Microsoft
Research Bangalore. He previously worked in the Foundation of Software
Engineering Group at MSR Redmond. Pushmeet has worked on a number of
problems in Computer Vision, Machine Learning and Discrete Optimization.
His papers have appeared in SIGGRAPH, PAMI, IJCV, ICCV, CVPR, ICML and
ECCV. | |||
|
|
Pawan
Mudigonda is a
post-doctoral researcher at Stanford University. Pawan obtained a
Bachelors of Technology degree in Computer Science and Engineering from
the International Institute of Information Technology, Hyderabad, India.
He completed his PhD studies in 2008 at Oxford Brookes University under
the supervision of Prof. Philip Torr and Prof. Andrew Zisserman. His work
focuses on combinatorial and convex optimization based solutions for
problems in Computer Vision and Machine Learning, and has appeared in
several reputed conferences and journals such as ICCV, CVPR, NIPS, ICML
and IJCV. Together with his collaborators, he won best paper awards at
ICVGIP 2004, Rank Opto-Electronics Symposium 2007, and NIPS
2007. | |||
|
|
Carsten
Rother received his
Diploma degree with distinction in 1999 at the University of
Karlsruhe/Germany. He did his PhD at the Royal Institute of Technology
Stockholm/Sweden, supervised by Stefan Carlsson and Jan-Olof Eklundh. His
thesis was nominated for the Best Nordic Thesis Award 2003-2004, as one
out of two candidates form Sweden. Since 2003 he is a researcher at
Microsoft Research Cambridge/UK. He supervises several PhD students and
gives frequently invited talks. His research interests are in the field of
“Markov Random Fields for Computer Vision”, “Discrete Optimization”, and
“Vision for Graphics”. He has
published more than 20 high impact papers (at least 10 citations) at
international conferences and journals. He won the best paper honourable
mention award at CVPR ’05. He serves on the program committee of major
conferences (e.g. SIGGRAPH, ICCV, ECCV, CVPR, NIPS), and has been area
chair for BMVC ‘08, and ‘09. He has organized workshops on “interactive
computer vision” at ICCV ’07, and on “Color and Reflectance” at ICCV
’09. | |||