Data Mining and Machine Learning


Course Outline (tentative)


Homework

Once every week. Please write your homework/reports in English.
For late homework, the score will be exponentially decreased.
Please print out your homework but not e-mail it to the TA.

Every week at around 12:50pm we randomly select one to present his/her homework. Moreover, you are required to turn in your homework before the 20-minute break. Rules: We do not require you to come every week. If you are absent and are selected, you will be required to do a presentation next week. If you fail to show up then, your mid-term exam will be deducted by 15 points. On the other hand, every week we seek for a volunteer first who will get 10 bonus points for the mid-term. However, you can do this only once in this course. When no one volunteers, everyone can be picked regardless of whether you have presented some homework before or not.


Exams

You can bring your textbook, slides, class notes, but nothing else. For example, you can neither bring a computer nor a person.


Final Project

We will have one final project. Project presentations: May 7 and June 11. Each group: a 20-minute presentation. Please give me your final report (<= 10 pages) by June 9. Each group has three/four members.

Yes, your presentation will be in English.

project topic: spam filtering

We all hate spams but there is no good way yet to deal with them. There are quite a few approaches to control spams. For example, some use black lists and some servers delay the incoming messages to see if the mail is resent. Usine data classification is another possible approach. The basic idea is simple. We have a training set of spams/non-spams. After training a model, we can prdice whether a new mail is a spam or not.

However, there are quite a few problems on using the training/testing procedure.

Of course we don't expect to fully solve these problems in this project. However, we hope to understand the following.

Grading

30% homework, 30% project, 40% Exam. (tentative)

Related Information


Last modified: Tue Jun 12 14:43:53 CST 2007