BOLD: Knowledge Graph Exploration and Analysis Platform

Publication date

2024-03-18

Authors

Dmitriev, EgorISNI 0000000518137344
Chekol, Melisachew WudageISNI 0000000433166787
Schäfer, Mirko TobiasORCID 0000-0003-0212-7016ISNI 0000000356270811

Editors

Advisors

Supervisors

Document Type

Part of book
Open Access logo

License

cc_by_nc_nd

Abstract

The linked open data (LOD) cloud maintains several interlinked knowledge graphs. These graphs span various domains such as government, media, life sciences, etc. The graphs are often manually curated or automatically extracted (e.g. YAGO—Yet Another Great Ontology) using information extraction techniques. They are used in various applications such as data governance, fraud detection, fact checking, etc. Although the graphs in LOD are widely used, they do not contain metadata about their representativeness (distribution of key features). Since most of the graphs are automatically curated, bias can manifest due to sensitive features and their causal influences, or through under (over)representation of certain entities (e.g. people) and relations (e.g. president-of, works-for). The aim of this work is to develop a system to automatically generate bias profiles (metadata about the representativeness of data) for knowledge graphs. As a result, the metadata can be used as a guide for users to choose bias free (balanced) datasets for their studies. Moreover, it enables researchers to quickly gauge the relevance of a graph for a problem at hand (e.g. classification task).

Keywords

Information Systems, Software, Computer Science Applications

Citation

Dmitriev, E, Chekol, M W & Schaefer, M T 2024, BOLD : Knowledge Graph Exploration and Analysis Platform. in Advances in Database Technology - EDBT. 3 edn, Advances in Database Technology - EDBT, no. 3, vol. 27, OpenProceedings.org, pp. 814-817, 27th International Conference on Extending Database Technology, EDBT 2024, Paestum, Italy, 25/03/24. https://doi.org/10.48786/edbt.2024.77, conference