BIL 722 Advanced Topics in Computer Vision

Mondays 13:00-16:00

This course is intended for students that want to acquire in-depth knowledge on computer vision research. Throughout the course, various trending research directions will be explored via the discussion of several conference and journal papers. The students are also expected to carry on research projects related to one of the open problems in vision.

Syllabus
ABOUT COURSE PROJECT

Paper List

Date

Paper

Presenter

Feb 24, 2014 Mid-Level Features
Locality-constrained Linear Coding for Image Classification, Jinjun Wang, Jianchao Yang, Kai Yu, Fengjun Lv, Thomas Huang, and Yihong Gong, CVPR 2010
Tugrul
Unsupervised Discovery of Mid-Level Discriminative Patches, Saurabh Singh, Abhinav Gupta, Alexei A. Efros, ECCV 2012 .
DEMO
Mert
Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection, Joseph J. Lim, C. Lawrence Zitnick and Piotr Dollar, CVPR 2013.
DEMO
Aysun

DEMO: Cagdas
Mar 3, 2014 Segmentation
Segmentation Propagation in ImageNet , Daniel Kuettel, Matthieu Guillaumin, Vittorio Ferrari, ECCV 2012
DEMO
Tugrul
Jointly Aligning and Segmenting Multiple Web Photo Streams for the Inference of Collective Photo Storylines , Gunhee Kim, Eric P. Xing, CVPR 2013 Ezgi
Unsupervised Joint Object Discovery and Segmentation in Internet Images, Michael Rubinstein, Armand Joulin, Johannes Kopf, Ce Liu. CVPR 2013
DEMO
Ilker
DEMO: Cemil
Mar 10, 2014 Importance and Visual Saliency
The Interestingness of Images, M. Gygli, H. Grabner, H. Riemenschneider, F. Nater, L. V. Gool, ICCV 2013
Cemil
Understanding and Predicting Importance in Images, A. Berg, T. Berg, H. Daume, J. Dodge, A. Goyal, X. Han, A, Mensch, M. Mitchell, A. Sood, K. Stratos and K. Yamaguchi. CVPR 2012 Pinar
Category-Independent Object-level Saliency Detection, Yangqing Jia and Mei Han, ICCV 2013 Aysun
Mar 17, 2014 Holistic Scene Understanding
Nonparametric Scene Parsing with Adaptive Feature Relevance and Semantic Context, Gautam Singh and Jana Kosecka, CVPR 2013
Emine
Annotation Propagation in Large Image Databases via Dense Image Correspondence, Michael Rubinstein, Ce Liu and William T. Freeman, ECCV 2012 Mert
Geometric Context from Video , S. Hussain Raza and Matthias Grundmann and Irfan Essa, CVPR 2013 Ezgi
Mar 24, 2014 Natural Language Description of Visual Content
A Sentence Is Worth a Thousand Pixels , Sanja Fidler, Abhishek Sharma, Raquel Urtasun, CVPR 2013
Pinar
YouTube2Text: Recognizing and Describing Arbitrary Activities Using Semantic Hierarchies and Zero-shot Recognition, Sergio Guadarrama, Niveda Krishnamoorthy, Girish Malkarnenkar, Subhashini Venugopalan, Raymond Mooney, Trevor Darrell and Kate Saenko, ICCV 2013 Ozcan
Sentence-based image description with scalable, explicit models, Micah Hodosh and Julia Hockenmaier, V&L Net Workshop on Vision and Language at CVPR 2013 Mert
Mar 31, 2014 Video Analysis and Tracking
Fast object segmentation in unconstraint video, A. Papazoglou and V. Ferrari, ICCV13
DEMO
Aysun
Demo: Aysun
Coherent Filtering:Detecting Coherent Motions from Crowd Clutters, Bolei Zhou, Xiaoou Tang and Xiaogang Wang, ECCV 2012
DEMO
Tugrul
Demo: Pinar
Robust Object Tracking with On­line Multi-lifespan Dictionary Learning, Jun­liang Xing, Jin Gao, Bing Li, Weim­ing Hu, Shuicheng Yan, ICCV 2013. Cagdas
April 7, 2014 Action and Interaction Recognition
Better exploiting motion for better action recognition. Mihir Jain, Hervé Jegou and Patrick Bouthemy, CVPR 2013
DEMO
Ozcan
Demo: Ozcan
Finding Group Interactions in Social Clutter, Ruonan Li, Parker Porfilio, Todd Zickler, CVPR 2013.
Emine
Learning Human Interaction by Interactive Phrases, Yu Kong, Yunde Jia, Yun Fu, ECCV 2012. Ilker
April 14, 2014 Recognition and Object Localization
Overview on Recognition and Localization
From Large Scale Image Categorization to Entry-Level Categories, Vicente Ordonez, Jia Deng, Yejin Choi, Alexander C. Berg, Tamara L. Berg, ICCV 2013
Pinar
Localizing 3D Cuboids in Single-view Images, J. Xiao, B. C. Russell, and A. Torralba, NIPS 2012
DEMO
Cagdas
April 21, 2014 Project Progress Presentations
April 28, 2014 Video and Crowd Analysis, Egocentric Motion
Video Representation Using Temporal Superpixels, Jason Chang, Donglai Wei, John W. Fisher III, CVPR 2013
DEMO
Ezgi
Data-driven Crowd Analysis in Videos, Mikel Rodriguez, Josef Sivic, Ivan Laptev, Jean-Yves Audibert, ICCV 2011. Cemil
Story-Driven Summarization for Egocentric Video. Z. Lu and K. Grauman. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2013.
Emine
May 05, 2014 Miscellaneous
NEIL: Extracting Visual Knowledge from Web Data ,X. Chen, A. Shrivastava and A. Gupta, ICCV 2013.
DEMO
Cemil
Demo : Emine
Searching for objects driven by context, B. Alexe, N. Heess, Y.W. Teh and V. Ferrari, NIPS 2012 Cagdas
The Way They Move: Tracking Multiple Targets with Similar Appearance Caglayan Dicle, Octavia Camps, Mario Sznaier
DEMO
Ilker
Demo : Ilker
May 12, 2014 Final Project Presentations