FAQ (modified from last year)

Last modified:
Q: I am interested in learning data mining and maching learning. Is this course the place to go?

This course aims at attending data mining competitions (e.g., KDD cup). So this is not a place for you to learn basic materials of machine learning and data mining.


Q: Working on a data mining competition requires certain knowledge, but I might not have enough?

There are so many machine learning techniques and no body can be familiar with all. Therefore, some very basic understanding of machine learning might be enough. You must be able to find and learn new techniques by yourselves while working on this course.


What is the capacity of this class? Do we work individually or form teams?

To make sure we provide sufficient supports to every student in the class, we plan to take no more than 18 students in this class. If there are more than 18 students express the interests to join, we would select based on their prerequisite knowledge and motivation. Students form teams (3 persons each team) in this class.


Q: May I audit this course?

In general the answer is no, because you will not learn a lot without getting your hands dirty in this class. We don't want to waste your time and we hope every member in the class indeed spends significant amount of efforts on the competition.


Q: How about the course load?

Please anticipate spending at least 10 hours per week on this course. Simply put this: the more efforts you put in, the better results you will get. When your fellow classmates spend (or have to spend) lots of time and efforts on this, you will not be competitive if you don't.


Q: Where can I find details of this course?

We have a homepage (as you are reading it). However, the course wiki will be the main place to give details. You will see our progress on the competitions there. Every student will get a wiki account.


Q: Is there any homework?

Continuously (on the competition task) EVERY WEEK, including presentations.


Q: Do we have to prepare slides for each presentations?

Basically no unless you think slides can better indicate your ideas and results. However, your presentation must clearly show your progress and problems. Sometimes we even would like to directly see your code and experimental environments.


Q: If I have prepared slides, do I need to upload them to wiki?

Yes, we require those who have prepared slides to upload files to wiki before the class starts. Then all members can easily compare different results. Also we prefer pdf rather than word files.


Q: Today each team is allowed to do an x-minute presentation, but I used x+10. What will happen?

You will be in big trouble. Note that your presentation must be to the point. A long presentation may not be better. See how a former GM CEO lost his job because of making long presentations here.

The time allowed for each presentation may vary. But generally if there are six presentations, each will have 20 minutes, including questions/answers.


Q: Because of team work, I can rely on some smart teammates?

No, you should work as hard as others. We will find a way to evaluate each individual student's performance.


Q: I tried many new ways in the past week, but all gave worse results. What should I present?

Failed approaches indeed show something. You should frankly present what you have tried. Competition results are related but not strongly related to your final scores. We encourage creative thinking and out-of-the-box ideas. Novel ideas will be rewarded even if it is not proven by you to be useful.


What kinds of computational resources do I need for this course? Will you provide any?

In general the deptartment's machines (e.g., 217) should be enough. We will also provide some machines we purchased for this course.


Q: What's the way to calculate scores?

Mainly on your class performance. We will evaluate your performances at least three times in the semester. It's a bit similar to the industry systems (if you work at a company, you are evaluated every quarter).


Q: Is is possible that I fail to pass this course?

Of course. You pass only if you work hard enough. (Similarly, in industry, underperformers will be fired).


Q: What happens if I report fake results?

The least we would do is to let you fail this class. You can even face expelling from the university in the serious cases.


Please contact Chih-Jen Lin for any question.