:::

[2019-05-16] Prof. Chien-Ju Ho, Washington University, "Human-in-the-Loop Bandit Learning"(English Speech)

非專題討論演講公告
Poster:Post date:2019-05-10
Title: Human-in-the-Loop Bandit Learning 
Date: 2019-05-16 3:40pm-5:00pm
Location: R102, CSIE
Speaker: Prof. Chien-Ju Ho, Washington University
Hosted by: Prof. Jane Hsu
 
 

Abstract:

 
Bandit learning is a sequential decision-making framework when only partial feedback is observable. In standard stochastic bandit settings, the learner chooses an action at each time step and observes a reward independently drawn from some distribution associated with the taken action. The goal of the learner is to maximize the sum of the rewards obtained through the taken actions over time. In the past few decades, there has been extensive literature for bandit problems. However, bandit learning has been increasingly used to help make decisions in human-in-the-loop systems, such as online advertisements and user-generated content, where the common assumption of independent reward may not hold anymore. In this talk, I discuss my recent work investigating the design of bandit algorithms with humans in the loop, in the context of crowdsourcing markets and user-generated content platforms.
 
 
Biography:
 
Chien-Ju is an assistant professor in Computer Science & Engineering at Washington University in St. Louis. Previously, he was a postdoctoral associate at Cornell University. He earned his PhD in Computer Science from the University of California, Los Angeles in 2015 and spent three years visiting the EconCS group at Harvard from 2012 to 2015. He is the recipient of the Google Outstanding Graduate Research Award at UCLA in 2015. His work was nominated for Best Paper Award at WWW 2015. His research centers on the design and analysis of human-in-the-loop systems, using techniques drawn from the fields of machine learning, algorithmic economics, optimization, and online behavioral social science. He is interested in developing realistic human behavior models and studying how the models influence the design of machine learning algorithms and incentive mechanisms. 

 
Last modification time:2019-05-10 AM 8:34

cron web_use_log