Constructing and cleaning identity graphs in the LOD Cloud

Raad, Joe; Beek, Wouter; van Harmelen, Frank; Wielemaker, Jan; Pernelle, Nathalie; Saïs, Fatiha

doi:10.1162/dint_a_00057

J. Raad (Joe), W. Beek (Wouter), F.A.H. van Harmelen (Frank), J. Wielemaker (Jan), N. Pernelle (Nathalie) and F. Saïs (Fatiha)

2020-07-01

Constructing and cleaning identity graphs in the LOD Cloud

Data Intelligence , Volume 2 - Issue 3 p. 323- 352

In the absence of a central naming authority on the Semantic Web, it is common for different data sets to refer to the same thing by different names. Whenever multiple names are used to denote the same thing, owl:sameAs statements are needed in order to link the data and foster reuse. Studies that date back as far as 2009, observed that the owl:sameAs property is sometimes used incorrectly. In our previous work, we presented an identity graph containing over 500 million explicit and 35 billion implied owl:sameAs statements, and presented a scalable approach for automatically calculating an error degree for each identity statement. In this paper, we generate subgraphs of the overall identity graph that correspond to certain error degrees. We show that even though the Semantic Web contains many erroneous owl:sameAs statements, it is still possible to use Semantic Web data while at the same time minimising the adverse effects of misusing owl:sameAs.

Additional Metadata
Keywords	Linked Open Data, Identity, Quality, Reasoning
Persistent URL	doi.org/10.1162/dint_a_00057
Journal	Data Intelligence
Organisation	Human-Centered Data Analytics
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Raad, J., Beek, W., van Harmelen, F., Wielemaker, J., Pernelle, N., & Saïs, F. (2020). Constructing and cleaning identity graphs in the LOD Cloud. Data Intelligence, 2(3), 323–352. doi:10.1162/dint_a_00057

View at Publisher

Free Full Text ( Final Version , 705kb )

Constructing and cleaning identity graphs in the LOD Cloud

Publication

Publication

Address

CWI researchers

Questions or comments?

Constructing and cleaning identity graphs in the LOD Cloud

Publication

Publication

Workflow

Workflow

Add Content