Utiliza este identificador para citar o vincular este elemento:
|Títulos:||Continuous tracking of the emotion temperature||Autores/as:||Alonso-Hernández, Jesús B.
Cabrera Cruz, Josué Jacob
Travieso-González, Carlos M.
Sánchez Medina, Agustín J.
|Clasificación UNESCO:||33 Ciencias tecnológicas
530602 Innovación tecnológica
|Palabras clave:||Emotional speech recognition
|Fecha de publicación:||2017||Revistas:||Neurocomputing||Resumen:||The speech emotion recognition has a huge potential in human computer interaction applications in fields such as psychology, psychiatry and affective computing technology. The great majority of research works on speech emotion recognition have been made based on record repositories consisting of short sentences recorded under laboratory conditions. In this work, we researched the use of the Emotional Temperature strategy for continuous tracking in long-term samples of speech in which there are emotional changes during the speech. Emotional Temperature uses a few prosodic and paralinguistic features set obtained from a temporal segmentation of the speech signal. The simplicity and limitation of the set, previously validated under laboratory conditions, make it appropriate to be used under real conditions, where the spontaneous speech is continuous and the emotions are expressed in certain moments of the dialogue, given emotional turns. This strategy is robust, offers low computational cost, ability to detect emotional changes and improves the performance of a segmentation based on linguistic aspects. The German Corpus EMO-DB (Berlin Database of Emotional Speech), the English Corpus LDC (Emotional Prosody Speech and Transcripts database), the Polish Emotional Speech Database and RECOLA (Remote Collaborative and Affective Interactions) database are used to validate the system of continuous tracking from emotional speech. Two experimentation conditions are analyzed, dependence and independence on language and gender, using acted and spontaneous speech respectively. In acted conditions, the approach obtained accuracies of 67-97% while under spontaneous conditions, compared to annotation performed by human judges, accuracies of 41-50% were obtained. In comparison with previous studies in continuous emotion recognition, the approach improves the existing results with an accuracy of 9% higher on average. Therefore, this approach has a good performance with low complexity to develop real-time applications or continuous tracking emotional speech applications.||URI:||http://hdl.handle.net/10553/35406||ISSN:||0925-2312||DOI:||10.1016/j.neucom.2016.06.093|
|Aparece en la colección:||Artículos|
Muestra el registro completo del elemento
Vista de página(s) 54
actualizado el 26-nov-2018
Los elementos en ULPGC accedaCRIS están protegidos por derechos de autor, con todos los derechos reservados, a menos que se indique lo contrario.