Augmenting knowledge graphs with natural language evidence

Cannaviccio, Matteo

Please use this identifier to cite or link to this item: http://hdl.handle.net/2307/40522

DC Field	Value	Language
dc.contributor.advisor	Merialdo, Paolo	-
dc.contributor.author	Cannaviccio, Matteo	-
dc.date.accessioned	2021-12-17T10:50:22Z	-
dc.date.available	2021-12-17T10:50:22Z	-
dc.date.issued	2018-04-23	-
dc.identifier.uri	http://hdl.handle.net/2307/40522	-
dc.description.abstract	In last ten years, massive amounts of world knowledge have been accumulated into large knowledge graphs (KGs). These knowledge repositories store millions of facts about the world, such as information about people, places and organizations, and have become a powerful asset for semantic applications such as search, analytics, recommendations, and data integration. Several approaches have been proposed to create KGs from Wikipedia, as in the cases of YAGO and DBpedia, or collaboratively as for Freebase and Wikidata. Despite their seemingly huge size, these knowledge graphs are greatly incomplete and approaches to populate them automatically are needed to increase their coverage. This thesis describes principled methods to model knowledge graph relations with natural language. These models allow the extraction of facts from text or to annotate web tables with KG relations, with the aim of populating state-of-the-art KGs. The first contribution is a pattern-based extraction system which can extract automatically highquality facts from the text of Wikipedia articles. Indeed, the approaches used to derive KGs from Wikipedia are focused only on its structured components like the info-boxes. Although valuable, they represent only a fraction of the actual information expressed in the articles. We experiment our system on five different languages, showing that it can extract a large number of facts that are out of reach of common infobox-based extractions. The second contribution is an approach that uses language models, derived from aWeb-scale corpus, to rank KG relations that hold over pairs of entities juxtaposed in tables or structured lists. Our experimental evaluation shows the effectiveness of the approach in predicting KG relations even when entities are missing from the graph and thus represents a significant advancement of the state-of-the-art.	en_US
dc.language.iso	en	en_US
dc.publisher	Università degli studi Roma Tre	en_US
dc.subject	KNOWLEDGE GRAPH	en_US
dc.subject	WEB TABLES	en_US
dc.subject	RELATION EXTRACTION	en_US
dc.title	Augmenting knowledge graphs with natural language evidence	en_US
dc.type	Doctoral Thesis	en_US
dc.subject.miur	Settori Disciplinari MIUR::Ingegneria industriale e dell'informazione::SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI	en_US
dc.subject.isicrui	Categorie ISI-CRUI::Ingegneria industriale e dell'informazione::Information Technology & Communications Systems	en_US
dc.subject.anagraferoma3	Ingegneria industriale e dell'informazione	en_US
dc.rights.accessrights	info:eu-repo/semantics/openAccess	-
dc.description.romatrecurrent	Dipartimento di Ingegneria	*
item.languageiso639-1	other	-
item.fulltext	With Fulltext	-
item.grantfulltext	restricted	-
Appears in Collections:	X_Dipartimento di Ingegneria T - Tesi di dottorato

Files in This Item:

File	Description	Size	Format
tesi.pdf		4.72 MB	Adobe PDF	View/Open

Show simple item record Recommend this item

Page view(s)

376

checked on Jun 24, 2026

Download(s)

426

checked on Jun 24, 2026

Google Scholar^TM

Check

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM