Speaker Gerhard Weikum
Host Milan Vojnovic
Affiliation Max Planck Institute for Informatics
Date recorded 23 May 2013
News, social media, web sites, and enterprise sources produce huge amounts of valuable contents in the form of text and speech. To tap this wealth of unstructured big data and obtain insights, a decisive step is to identify the entities that are referred to and relationships between entities. This allows linking unstructured contents with structured data. However, this step faces the fundamental problem that names and phrases are often highly ambiguous; mapping them to entities and relations is a challenging task. The talk will discuss the state of the art and open problems on disambiguating named entities and relational phrases. It will also put this line of research in perspective to the bigger picture of big data analytics.
©2013 Microsoft Corporation. All rights reserved.