:::

[2018-12-21] Dr. Qiang Huo, Microsoft Research Asia (MSRA), "Microsoft’s New Generation OCR Technology"

專題討論演講公告
張貼人:白師瑜公告日期:2018-11-20
577_0f558a4d.jpg

Title:
Microsoft’s New Generation OCR Technology
Date: 2018-12-21 3:40pm-5:00pm
Location: R103, CSIE
Speaker: Dr. Qiang Huo, Microsoft Research Asia (MSRA)
Hosted by: Prof. Winston Hsu
 
 

Abstract:

 
Optical Character Recognition (OCR) is the electronic conversion of printed or handwritten text on a picture into machine-encoded text. It is an important enabling technology that empowers people and organizations to do more and achieve more. In Microsoft Research Asia (MSRA), we have been developing Microsoft’s new generation OCR engine, which can detect both printed and handwritten text in an image captured by a camera or mobile phone, and recognize the detected text for follow-up actions. Compared with the other industry-leading OCR technology, Microsoft’s new generation OCR engine has significantly improved recognition accuracy in various application scenarios (such as scanned document and natural scene images). In this talk, I will give you background on how this fundamental technology of OCR works, a glimpse of what we have achieved so far and the road map ahead.
 
 
光學字元辨識(OCR)是指檢測和識別圖片上的文本資訊並將其轉換成電腦文字的過程。這是一項重要的賦能技術,可以幫助用戶更加高效地完成工作。近年來,微軟亞洲研究院致力於開發新一代的OCR引擎,該引擎可以從相機、手機等拍攝的照片中檢測出列印體和手寫體文本,並對其內容進行識別,從而驅動後續的工作。跟業內其它領先的OCR技術相比,微軟新一代OCR引擎在各種應用場景(如掃描文檔圖片、自然場景圖像)下的識別準確率都有大幅的提升。在此次演講中,我將為大家講解OCR的技術背景,並就我們已取得的成果和未來的路線圖為大家進行簡要介紹。
 
 
Biography:
 
Dr. Qiang Huo is a Principal Research Manager of Speech Group in Microsoft Research Asia (MSRA), Beijing, China. Prior to joining MSRA in August 2007, he had been a faculty member at the Department of Computer Science, The University of Hong Kong for about ten years. Many of his students have become leaders in both academia and industry. From 1995 to 1997, Dr. Huo worked on speech recognition for the world’s first spoken language translation system at Advanced Telecommunications Research Institute (ATR) in Kyoto, Japan. In the past 30 years, he has been doing research and making fundamental contributions in the areas of speech recognition, handwriting recognition, OCR, gesture recognition, biometric-based user authentication, hardware design for speech and image processing. Many core technologies developed by his teams have been deployed widely in industry, including Microsoft’s products and services such as Windows, Office, Cognitive Services, and Bing.
 
 
霍強博士現任微軟亞洲研究院語音組首席研究員和負責人,2007年加入微軟,此前在香港大學任教近十年,他培養的許多學生現已成為業界領袖。霍強博士于1995 至 1997年期間在日本京都“國際先端通信技術研究所”(ATR)從事語音辨識研究並參與了世界上首個口語機器自動翻譯項目。過去近30年,霍強博士一直堅持研究,為語音辨識、手寫辨識、光學字元辨識、手勢識別、基於生物特徵的使用者識別、語音和影像處理的硬體設計等領域做出重要貢獻,研發的技術已被廣泛應用於Windows,Office,微軟認知服務和必應(Bing)搜索。


 
   
最後修改時間:2018-11-20 AM 9:59

cron web_use_log