Title: NLP for Microblog Summarization
Date: 2017-08-17 10:30am-11:30am
Location: R210, CSIE
Speaker: Prof. Kam-Fai Wong, The Chinese University of Hong Kong
Hosted by: Prof. Hsin-Hsi Chen



Microblogging is a popular worldwide. However, the unstructured presentation and briefness in microblog messages render information processing difficult. We hypothesis that microblog is a dialect. Thus, it can be analyzed using NLP techniques. We verify our hypothesis by investigating the application of context analysis to microblog summarization.   A microblog repost tree provides strong clues on how an event described therein develops. To help social media users capture the main clues of events on microblogging sites, we propose a novel repost tree summarization framework by effectively differentiating two kinds of messages on repost trees called leaders and followers, which are derived from content level structure information, i.e., contents of messages and the reposting relations. To this end, Conditional Random Fields (CRF) model is used to detect leaders across repost tree paths. We then present a variant of random-walk-based summarization model to rank and select salient messages based on the result of leader detection. To reduce the error propagation cascaded from leader detection, we improve the framework by enhancing the random walk with adjustment steps for sampling from leader probabilities given all the reposting messages. For evaluation, we construct two annotated corpora, one for leader detection, and the other for repost tree summarization. Experimental results confirm the effectiveness of our method. (This is a paper presented in EMNLP2015).



Kam-Fai Wong obtained his PhD from Edinburgh University, Scotland, in 1987. After his PhD, he was researchers in Heriot-Watt University (Scotland), UniSys (Scotland) and ECRC (Germany).  At present he is Associate Dean (External Affairs) of the Faculty of Engineering, Professor in the Department of Systems Engineering and Engineering Management, Director, Centre for Innovation and Technology (CINTEC), and Associate Director, Centre for Entrepreneurship, The Chinese University of Hong Kong (CUHK) as well as Associate Director, Key-Laboratory of High Confidence Software Technologies (PKU), Ministry of Education, China. He is also Honorary Professor, Harbin Institute of Technology (Shenzhen Graduate School), Adjunct Professor, School of Computer Technology, Peking University and Adjunct Professor, Northeastern University, Shenyang China.


His research interest focuses on Natural Language Processing especially in Chinese and Social Media processing. He has published over 250 peer reviewed papers in these areas. He has published the book “Introduction in Chinese Natural Language Processing” (2009), which is the first of its king written in English.  And his new book, titled: “Social Media Content Analysis: NLP and Beyond”, will be published in 2017Q4.  He is the founding Editor-In-Chief of ACM Transactions on Asian Language Processing (TALIP) and the General Chair of BigComp2016, NLPCC2015, IJCNLP2011, AIRS2008 and ICCPOL2006. He was awarded the Medal of Honour (MH) by the HKSAR Government in 2011 for his valuable contribution to the development of information and communications technology.

最後修改時間:2017-07-14 PM 8:53

