default search action
SLAM@INTERSPEECH 2014: Penang, Malaysia
- 2nd International Workshop on Speech, Language and Audio in Multimedia, SLAM 2014, Penang, Malaysia, September 11-12, 2014. ISCA 2014
Keynote Papers
- Shrikanth S. Narayanan:
Behavioral informatics from multimodal human interaction cues. 1 - Min-Yen Kan:
Opportunities for multimedia analysis in scholarly digital libraries. 2
Multimodality, Event Detection
- Benjamin Elizalde, Mirco Ravanelli, Karl Ni, Damian Borth, Gerald Friedland:
Audio-concept features and hidden Markov models for multimedia event detection. 3-8 - Carl Quillen, Kara Greenfield, William M. Campbell:
Talking head detection by likelihood-ratio test. 9-13 - Fernando Fernández Martínez, Alejandro Hernández-García, Ascensión Gallardo-Antolín, Fernando Díaz-de-María:
Combining audio-visual features for viewers' perception classification of Youtube car commercials. 14-18
NLP in Speech and Video Processing
- Anca-Roxana Simon, Camille Guinaudeau, Pascale Sébillot, Guillaume Gravier:
Investigating domain-independent nlp techniques for precise target selection in video hyperlinking. 19-23 - Géraldine Damnati, Benoît Favre, Frédéric Béchet, Delphine Charlet:
Person name recognition and linking from overlay text in TV broadcast shows. 24-28 - Irina Illina, Dominique Fohr, Georges Linarès:
Proper name retrieval from diachronic documents for automatic speech transcription using lexical and temporal context. 29-33
Speaker-Related Processing in Multimedia
- Guillaume Bernard, Olivier Galibert, Juliette Kahn:
The second official REPERE evaluation. 34-38 - Jaime Lorenzo-Trueba, Julián D. Echeverry-Correa, Roberto Barra-Chicote, Rubén San-Segundo-Hernández, Javier Ferreiros, Ascensión Gallardo-Antolín, Junichi Yamagishi, Simon King, Juan Manuel Montero-Martínez:
Development of a genre-dependent TTS system with cross-speaker speaking-style transplantation. 39-42 - Mateusz Budnik, Johann Poignant, Laurent Besacier, Georges Quénot:
Active selection with label propagation for minimizing human effort in speaker annotation of TV shows. 43-47 - Nilesh Madhu, Sung-Kyo Jung:
Speaker recognition performance under ideal-knowledge noise suppression: an investigation. 48-52
Multimedia-Related Issues
- Yen-Min Jasmina Khaw, Tien-Ping Tan:
Preparation of MaDiTS corpus for Malay dialect translation and speech synthesis system. 53-57 - Cevahir Parlak, Banu Diri, Fikret Gürgen:
A cross-corpus experiment in speech emotion recognition. 58-61 - Shammur Absar Chowdhury, Giuseppe Riccardi, Firoj Alam:
Unsupervised recognition and clustering of speech overlaps in spoken conversations. 62-66 - Taejin Park, Seungkwon Beack, Taejin Lee:
Noise robust feature for automatic speech recognition based on mel-spectrogram gradient histogram. 67-71
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.