Please use this identifier to cite or link to this item: http://hdl.handle.net/2307/40522
DC FieldValueLanguage
dc.contributor.advisorMerialdo, Paolo-
dc.contributor.authorCannaviccio, Matteo-
dc.date.accessioned2021-12-17T10:50:22Z-
dc.date.available2021-12-17T10:50:22Z-
dc.date.issued2018-04-23-
dc.identifier.urihttp://hdl.handle.net/2307/40522-
dc.description.abstractIn last ten years, massive amounts of world knowledge have been accumulated into large knowledge graphs (KGs). These knowledge repositories store millions of facts about the world, such as information about people, places and organizations, and have become a powerful asset for semantic applications such as search, analytics, recommendations, and data integration. Several approaches have been proposed to create KGs from Wikipedia, as in the cases of YAGO and DBpedia, or collaboratively as for Freebase and Wikidata. Despite their seemingly huge size, these knowledge graphs are greatly incomplete and approaches to populate them automatically are needed to increase their coverage. This thesis describes principled methods to model knowledge graph relations with natural language. These models allow the extraction of facts from text or to annotate web tables with KG relations, with the aim of populating state-of-the-art KGs. The first contribution is a pattern-based extraction system which can extract automatically highquality facts from the text of Wikipedia articles. Indeed, the approaches used to derive KGs from Wikipedia are focused only on its structured components like the info-boxes. Although valuable, they represent only a fraction of the actual information expressed in the articles. We experiment our system on five different languages, showing that it can extract a large number of facts that are out of reach of common infobox-based extractions. The second contribution is an approach that uses language models, derived from aWeb-scale corpus, to rank KG relations that hold over pairs of entities juxtaposed in tables or structured lists. Our experimental evaluation shows the effectiveness of the approach in predicting KG relations even when entities are missing from the graph and thus represents a significant advancement of the state-of-the-art.en_US
dc.language.isoenen_US
dc.publisherUniversità degli studi Roma Treen_US
dc.subjectKNOWLEDGE GRAPHen_US
dc.subjectWEB TABLESen_US
dc.subjectRELATION EXTRACTIONen_US
dc.titleAugmenting knowledge graphs with natural language evidenceen_US
dc.typeDoctoral Thesisen_US
dc.subject.miurSettori Disciplinari MIUR::Ingegneria industriale e dell'informazione::SISTEMI DI ELABORAZIONE DELLE INFORMAZIONIen_US
dc.subject.isicruiCategorie ISI-CRUI::Ingegneria industriale e dell'informazione::Information Technology & Communications Systemsen_US
dc.subject.anagraferoma3Ingegneria industriale e dell'informazioneen_US
dc.rights.accessrightsinfo:eu-repo/semantics/openAccess-
dc.description.romatrecurrentDipartimento di Ingegneria*
item.grantfulltextrestricted-
item.languageiso639-1other-
item.fulltextWith Fulltext-
Appears in Collections:X_Dipartimento di Ingegneria
T - Tesi di dottorato
Files in This Item:
File Description SizeFormat
tesi.pdf4.72 MBAdobe PDFView/Open
Show simple item record Recommend this item

Page view(s)

146
checked on Nov 21, 2024

Download(s)

248
checked on Nov 21, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.