Skip navigation
Please use this identifier to cite or link to this item: http://repository.iitr.ac.in/handle/123456789/21737
Title: A Segment Level Approach to Speech Emotion Recognition Using Transfer Learning
Authors: Sahoo S.
Kumar P.
Raman, Balasubramanian
Pratim Roy, Partha
Palaiahnakote S.
Sanniti di Baja G.
Wang L.
Yan W.Q.
Published in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
5th Asian Conference on Pattern Recognition, ACPR 2019
Abstract: Speech emotion recognition (SER) is a non-trivial task considering that the very definition of emotion is ambiguous. In this paper, we propose a speech emotion recognition system that predicts emotions for multiple segments of a single audio clip unlike the conventional emotion recognition models that predict the emotion of an entire audio clip directly. The proposed system consists of a pre-trained deep convolutional neural network (CNN) followed by a single layered neural network which predicts the emotion classes of the audio segments. The predictions for the individual segments are finally combined to predict the emotion of a particular clip. We define several new types of accuracies while evaluating the performance of the proposed model. The proposed model attains an accuracy of 68.7% surpassing the current state-of-the-art models in classifying the data into one of the four emotional classes (angry, happy, sad and neutral) when trained and evaluated on IEMOCAP audio-only dataset. © 2020, Springer Nature Switzerland AG.
Citation: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020), 12047 LNCS: 435-448
URI: https://doi.org/10.1007/978-3-030-41299-9_34
http://repository.iitr.ac.in/handle/123456789/21737
Issue Date: 2020
Publisher: Springer
Keywords: Affective computing
Computational paralinguistics
Deep learning
Emotion recognition
Mel spectrograms
ISBN: 9.78E+12
ISSN: 3029743
Author Scopus IDs: 57215662056
57200340329
23135470700
56880478500
Author Affiliations: Sahoo, S., Department of Electrical Engineering, Indian Institute of Technology Madras, Chennai, 600036, India
Kumar, P., Department of Computer Science and Engineering, Indian Institute of Technology Roorkee, Roorkee, 247667, India
Raman, B., Department of Computer Science and Engineering, Indian Institute of Technology Roorkee, Roorkee, 247667, India
Roy, P.P., Department of Computer Science and Engineering, Indian Institute of Technology Roorkee, Roorkee, 247667, India
Corresponding Author: Sahoo, S.; Department of Electrical Engineering, India; email: sourav.sahoo@smail.iitm.ac.in
Appears in Collections:Conference Publications [CS]

Files in This Item:
There are no files associated with this item.
Show full item record


Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.