MediaEval 2014: A Multimodal Approach to Drop Detection in Electronic Dance Music

Publication date

2014

Authors

Aljanaki, A.ISNI 0000000419508357
Soleymani, Mohammad
Wiering, FransORCID 0000-0002-2984-8932ISNI 0000000053360131
Veltkamp, Remco C.ISNI 0000000109665680

Editors

Larson, Martha
Ionescu, Bogdan
Anguera, Xavier

Advisors

Supervisors

DOI

Document Type

Part of book
Open Access logo

License

Abstract

We predict drops in electronic dance music (EDM), employing different multimodal approaches. We combine three sources of data: noisy labels collected through crowdsourcing, timed comments from SoundCloud and audio content analysis. We predict the correct labels from the noisy labels using the majority vote and Dawid-Skene methods. We also employ timed comments from SoundCloud users to count the occurrence of specific terms near the potential drop event, and, finally, we conduct an acoustic analysis of the audio excerpts. The best results are obtained, when both annotations, metadata and audio, are combined, though the differences between them are not significant.

Keywords

Citation

Aljanaki, A, Soleymani, M, Wiering, F & Veltkamp, R 2014, MediaEval 2014: A Multimodal Approach to Drop Detection in Electronic Dance Music. in M Larson, B Ionescu & X Anguera (eds), MediaEval 2014 Multimedia Benchmark Workshop : Working Notes Proceedings of the MediaEval 2014 Workshop Barcelona, Catalunya, Spain, October 16-17, 2014.. CEUR workshop proceedings , vol. 1263. < http://ceur-ws.org/Vol-1263/ >