Semi-Automatic Data Annotation guided by Feature Space Projection
Publication date
2020
Editors
Advisors
Supervisors
Document Type
Article
Metadata
Show full item recordCollections
License
taverne
Abstract
Data annotation using visual inspection (supervision) of each training sample can be laborious. Interactive solutions alleviate this by helping experts propagate labels from a few supervised samples to unlabeled ones based solely on the visual analysis of their feature space projection (with no further sample supervision). We present a semi-automatic data annotation approach based on suitable feature space projection and semi-supervised label estimation. We validate our method on the popular MNIST dataset and on images of human intestinal parasites with and without fecal impurities, a large and diverse dataset that makes classification very hard. We evaluate two approaches for semi-supervised learning from the latent and projection spaces, to choose the one that best reduces user annotation effort and also increases classification accuracy on unseen data. Our results demonstrate the added-value of visual analytics tools that combine complementary abilities of humans and machines for more effective machine learning.
Keywords
semi-supervised learning, unsupervised feature learning, interactive data annotation, autoencoder-neutral networks, data visualization, Taverne
Citation
Benato, B, Gomes, J, Telea, A & Falcao, A 2020, 'Semi-Automatic Data Annotation guided by Feature Space Projection', Pattern Recognition, vol. 109, 107612. https://doi.org/10.1016/j.patcog.2020.107612