A. Borusan

18.10.2012, 11:00 a.m. HPI, Prof.-Dr.-Helmert-Str. 2-3, 14482 Potsdam Raum: H.2-57: "Connecting the Dots: Data Integration for the Web of Data" (Gerard de Melo, ICSI)

The long-abandoned vision of establishing a database of the world's
knowledge has seen a remarkable resurgence with the rapid growth of the Web of Data.
However, instead of a single shared database we typically face a plethora of
disparate data sets that are only loosely coupled and hence difficult to use
in practice. In this talk, I discuss models and algorithms that allow us to
combine data from different sources into more tightly integrated databases.
A major focus here is establishing and cleaning identity links at the entity
level. Another aspect is producing coherent taxonomies that connect
information across different sources. Finally, I will present
applications of this work, including MENTA, a large-scale knowledge base,
and the<> site, which provides integrated semantic
and linguistic information as Linked Data.