Discovering order dependencies through order compatibility

Publication date

2019-01-01

Authors

Consonni, Cristian
Sottovia, Paolo
Montresor, Alberto
Velegrakis, YannisORCID 0000-0001-6332-0296ISNI 0000000125737584

Editors

Kaoudi, Zoi
Galhardas, Helena
Fundulaki, Irini
Reinwald, Berthold
Herschel, Melanie
Binnig, Carsten

Advisors

Supervisors

Document Type

Part of book
Open Access logo

License

Abstract

A relevant task in the exploration and understanding of large datasets is the discovery of hidden relationships in the data. In particular, functional dependencies have received considerable attention in the past. However, there are other kinds of relationships that are significant both for understanding the data and for performing query optimization. Order dependencies belong to this category. An order dependency states that if a table is ordered on a list of attributes, then it is also ordered on another list of attributes. The discovery of order dependencies has been only recently studied. In this paper, we propose a novel approach for discovering order dependencies in a given dataset. Our approach leverages the observation that discovering order dependencies can be guided by the discovery of a more specific form of dependencies called order compatibility dependencies. We show that our algorithm outperforms existing approaches on real datasets. Furthermore, our algorithm can be parallelized leading to further improvements when it is executed on multiple threads. We present several experiments that illustrate the effectiveness and efficiency of our proposal and discuss our findings.

Keywords

Information Systems, Software, Computer Science Applications

Citation

Consonni, C, Sottovia, P, Montresor, A & Velegrakis, Y 2019, Discovering order dependencies through order compatibility. in Z Kaoudi, H Galhardas, I Fundulaki, B Reinwald, M Herschel & C Binnig (eds), Advances in Database Technology - EDBT 2019 : 22nd International Conference on Extending Database Technology, Proceedings. vol. 2019-March, OpenProceedings.org, pp. 409-420, 22nd International Conference on Extending Database Technology, EDBT 2019, Lisbon, Portugal, 26/03/19. https://doi.org/10.5441/002/edbt.2019.36, conference