Exploring the data wilderness through examples
Publication date
2019-06-25
Editors
Advisors
Supervisors
Document Type
Part of book
Metadata
Show full item recordCollections
License
taverne
Abstract
Exploration is one of the primordial ways to accrue knowledge about the world and its nature. As we accumulate, mostly automatically, data at unprecedented volumes and speed, our datasets have become complex and hard to understand. In this context exploratory search provides a handy tool for progressively gather the necessary knowledge by starting from a tentative query that hopefully leads to answers at least partially relevant and that can provide cues about the next queries to issue. Recently, we have witnessed a rediscovery of the so-called example-based methods, in which the user or the analyst circumvent query languages by using examples as input. This shift in semantics has led to a number of methods receiving as query a set of example members of the answer set. The search system then infers the entire answer set based on the given examples and any additional information provided by the underlying database. In this tutorial, we present an excursus over the main example-based methods for exploratory analysis, show techniques tailored to dierent data types, and provide a unifying view of the problem. We show how dierent data types require dier-ent techniques, and present algorithms that are specically designed for relational, textual, and graph data.
Keywords
Taverne, Software, Information Systems
Citation
Mottin, D, Lissandrini, M, Velegrakis, Y & Palpanas, T 2019, Exploring the data wilderness through examples. in SIGMOD 2019 - Proceedings of the 2019 International Conference on Management of Data. Association for Computing Machinery, pp. 2031-2035, 2019 International Conference on Management of Data, SIGMOD 2019, Amsterdam, Netherlands, 30/06/19. https://doi.org/10.1145/3299869.3314031, conference