Multi-level Fusion of Fisher Vector Encoded BERT and Wav2vec 2.0 Embeddings for Native Language Identification
Publication date
2022-11-10
Editors
Prasanna, S. R. Mahadeva
Karpov, Alexey
Samudravijaya, K.
Agrawal, Shyam S.
Advisors
Supervisors
Document Type
Part of book
Metadata
Show full item recordCollections
License
taverne
Abstract
Native Language Identification is a prominent paralinguistic study with applications ranging from biometric analysis to speaker adaptation. Former studies on this task have benefited from alternative acoustic feature representations and pre-trained neural networks. In this work, we explore the Native Language Identification performance of contextual acoustic (wav2vec 2.0) and linguistic (BERT) embeddings as state-of-the-art feature representations and combine them with acoustic features at different levels. We encode acoustic and linguistic features using Fisher Vectors, applying Fisher Vector encoding on BERT word embeddings and wav2vec 2.0 for the first time for a paralinguistic task. We compare this approach with conventional functional summarization. In line with our former study using only acoustic modality, the results indicate the superiority of Fisher Vectors encoding over the traditional techniques. Moreover, we show the efficacy of combining alternative representations now in both acoustic and linguistic modalities. Results indicate a notable contribution of the transformer-based contextual auditory and linguistic feature representations to bimodal Native Language Identification systems.
Keywords
BERT, Computational Paralinguistics, Fisher Vector, Native Language Identification, Wav2vec 2.0, Taverne, Theoretical Computer Science, General Computer Science
Citation
Krebbers, D, Kaya, H & Karpov, A 2022, Multi-level Fusion of Fisher Vector Encoded BERT and Wav2vec 2.0 Embeddings for Native Language Identification. in S R M Prasanna, A Karpov, K Samudravijaya & S S Agrawal (eds), Speech and Computer : 24th International Conference, SPECOM 2022, Gurugram, India, November 14–16, 2022, Proceedings. 1 edn, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 13721 LNAI, Springer, pp. 391-403. https://doi.org/10.1007/978-3-031-20980-2_34