[2018-12-21] Dr. Qiang Huo, Microsoft Research Asia (MSRA), "Microsoft’s New Generation OCR Technology"

Poster:Post date:2018-11-20

Microsoft’s New Generation OCR Technology
Date: 2018-12-21 3:40pm-5:00pm
Location: R103, CSIE
Speaker: Dr. Qiang Huo, Microsoft Research Asia (MSRA)
Hosted by: Prof. Winston Hsu


Optical Character Recognition (OCR) is the electronic conversion of printed or handwritten text on a picture into machine-encoded text. It is an important enabling technology that empowers people and organizations to do more and achieve more. In Microsoft Research Asia (MSRA), we have been developing Microsoft’s new generation OCR engine, which can detect both printed and handwritten text in an image captured by a camera or mobile phone, and recognize the detected text for follow-up actions. Compared with the other industry-leading OCR technology, Microsoft’s new generation OCR engine has significantly improved recognition accuracy in various application scenarios (such as scanned document and natural scene images). In this talk, I will give you background on how this fundamental technology of OCR works, a glimpse of what we have achieved so far and the road map ahead.
Dr. Qiang Huo is a Principal Research Manager of Speech Group in Microsoft Research Asia (MSRA), Beijing, China. Prior to joining MSRA in August 2007, he had been a faculty member at the Department of Computer Science, The University of Hong Kong for about ten years. Many of his students have become leaders in both academia and industry. From 1995 to 1997, Dr. Huo worked on speech recognition for the world’s first spoken language translation system at Advanced Telecommunications Research Institute (ATR) in Kyoto, Japan. In the past 30 years, he has been doing research and making fundamental contributions in the areas of speech recognition, handwriting recognition, OCR, gesture recognition, biometric-based user authentication, hardware design for speech and image processing. Many core technologies developed by his teams have been deployed widely in industry, including Microsoft’s products and services such as Windows, Office, Cognitive Services, and Bing.
霍強博士現任微軟亞洲研究院語音組首席研究員和負責人,2007年加入微軟,此前在香港大學任教近十年,他培養的許多學生現已成為業界領袖。霍強博士于1995 至 1997年期間在日本京都“國際先端通信技術研究所”(ATR)從事語音辨識研究並參與了世界上首個口語機器自動翻譯項目。過去近30年,霍強博士一直堅持研究,為語音辨識、手寫辨識、光學字元辨識、手勢識別、基於生物特徵的使用者識別、語音和影像處理的硬體設計等領域做出重要貢獻,研發的技術已被廣泛應用於Windows,Office,微軟認知服務和必應(Bing)搜索。

Last modification time:2018-11-20 AM 9:59

cron web_use_log