922 U3710: AMMAI - ADVANCED TOPICS IN MULTIMEDIA ANALYSIS AND INDEXING
(高等多媒體資訊分析與檢索)

Spring 2011 (14:20 ~ 17:20, Thursday, CSIE RM#542)

Brief Introduction

This course focuses on recent development of machine learning techniques that are promising for solving practical problems in video indexing and audio-visual content analysis. The goal is for students to get familiar with the state of the art, learn how to formulate and solve practical video indexing/analysis problems, and acquire hands-on experience through actual experiments. The course will include some topics in depth such as:

Course Goals :

Prerequisites: Background in image processing (or signal processing related courses), probability, and linear algebra. Experience with machine learning or statistical pattern recognition will be useful but not required.

Course Format: Lectures by the instructor and paper critiques by students. Each one is expected to assign one topic (or paper).

Lecturer: Winston Hsu (office: R512, CSIE Building)

TA: Liang-Chi Hsieh

Time: 14:20 ~ 17:20, Thursday; No lectures on March 31 (NII Shanon Meeting), May 25 (ICASSP 2011)

Location: RM#542, CSIE Building

Mailing List: All the course announcements will be sent though the mailing list, please do subscribe for the class.
https://cmlmail.csie.ntu.edu.tw/mailman/listinfo/ammai and browse the discussion archives.

Assessment:

Textbook: NO. We will cover some active research areas not included in any mature textbooks. Nevertheless, we will provide rich papers and reference books.

 

Students and Reading Blogs

 

 

Project Groups

Course Outline

Lecture 01 - Introduction (02/24/11, Thursday)pdf

Lecture 02 - Interesting Points and Local Descriptors (03/03/11, Thursday)

Lecture 03 - Advanced Topics for Large-Scale Image Retrieval (03/10/11, Thursday)

Lecture 04 - Hashing and Semantic-Preserving Hashing (03/17/11, Thursday)

Lecture 05/06 - Latent Semantic Analysis (I) (03/24/11, Thursday)

Lecture 07 - Manifold Methods (04/07/11, Thursday)

Lecture 08 - Latent Semantic Analysis (II) (04/14/11, Thursday)

Lecture 09 - Learning to Rank (I) - RankSVM + AdaRank (04/21/11, Thursday)

Lecture 10 - Learning to Rank (II) - ListNet + Reranking (04/28/11, Thursday)

Lecture 11 - Mining People Attributes and Activities (05/05/11, Thursday)

Lecture 12 - High Performance Analytics (I): Current Solutions (05/12/11, Thursday)

Lecture 13 - High Performance Analytics (II): Algorithms for Image/Video Analysis (05/19/11, Thursday)

Lecture 14 - Object Localization (06/02/11, Thursday)

Lecture 15 - Sparse Coding (06/09/11, Thursday)

Lecture 16 - Project Presentation (06/16/11, Thursday)

 

Tips for Student Presenters

Generally, we had included the *must* papers and optional ones in the reading lists. The goal for the presentation is to help the audiences and presenters understand the breadth and depths in these problems. The presentation time for each topic is around 50 ~ 60 min. We can adjust the duration if necessary.

Presenters can emphasize more on the "must" papers in depth, which are highly cited correspondingly. However, we expect presenters to mention the breadth for the problems as well. Please discuss at side with other related works and their comparisons, which can be found in the optional papers. Students are encouraged to use other materials that are useful for the explanations. Meanwhile, an introduction with sample codes and real examples is the best way for the audiences to comprehend what the details are. I would encourage preparing in advance if applicable.

The guideline for presentation might be a help for students as well.

Please chat with the lecturer one week before the presentation.

Course Material

Books: