MediaEval 2014: A Multimodal Approach to Drop Detection in Electronic Dance Music
Files
Publication date
2014
Editors
Larson, Martha
Ionescu, Bogdan
Anguera, Xavier
Advisors
Supervisors
DOI
Document Type
Part of book
Metadata
Show full item recordCollections
License
Abstract
We predict drops in electronic dance music (EDM), employing different multimodal approaches. We combine three sources of data: noisy labels collected through crowdsourcing, timed comments from SoundCloud and audio content analysis. We predict the correct labels from the noisy labels using the majority vote and Dawid-Skene methods. We also employ timed comments from SoundCloud users to count the occurrence of specific terms near the potential drop event, and, finally, we conduct an acoustic analysis of the audio excerpts. The best results are obtained, when both annotations, metadata and audio, are combined, though the differences between them are not significant.
Keywords
Citation
Aljanaki, A, Soleymani, M, Wiering, F & Veltkamp, R 2014, MediaEval 2014: A Multimodal Approach to Drop Detection in Electronic Dance Music. in M Larson, B Ionescu & X Anguera (eds), MediaEval 2014 Multimedia Benchmark Workshop : Working Notes Proceedings of the MediaEval 2014 Workshop Barcelona, Catalunya, Spain, October 16-17, 2014.. CEUR workshop proceedings , vol. 1263. < http://ceur-ws.org/Vol-1263/ >