default search action
Björn W. Schuller
Person information
- affiliation: Imperial College London, GLAM, UK
- affiliation: University of Augsburg, Department of Computer Science, Germany
- affiliation (former): University of Passau, Faculty of Computer Science and Mathematics, Germany
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j271]Adria Mallol-Ragolta, Björn W. Schuller:
Coupling Sentiment and Arousal Analysis Towards an Affective Dialogue Manager. IEEE Access 12: 20654-20662 (2024) - [j270]Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Berrak Sisman, Björn W. Schuller, Carlos Busso:
emoDARTS: Joint Optimization of CNN and Sequential Neural Network Architectures for Superior Speech Emotion Recognition. IEEE Access 12: 110492-110503 (2024) - [j269]Andreas Triantafyllopoulos, Anastasia Semertzidou, Meishu Song, Florian B. Pokorny, Björn W. Schuller:
Introducing the COVID-19 YouTube (COVYT) speech dataset featuring the same speakers with and without infection. Biomed. Signal Process. Control. 88(Part A): 105642 (2024) - [j268]Zhao Ren, Yi Chang, Thanh Tam Nguyen, Yang Tan, Kun Qian, Björn W. Schuller:
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era. IEEE Comput. Intell. Mag. 19(3): 42-57 (2024) - [j267]Helena Bilandzic, Anja Kalch, Susanne Kinnebrock, Benedikt Buchner, Ingo Kollar, Björn W. Schuller:
Discursive Resilience. Datenschutz und Datensicherheit (dud) 48(6): 341-345 (2024) - [j266]Chao Li, Ning Bian, Ziping Zhao, Haishuai Wang, Björn W. Schuller:
Multi-view domain-adaptive representation learning for EEG-based emotion recognition. Inf. Fusion 104: 102156 (2024) - [j265]Jiangjian Xie, Yuwei Shi, Dongming Ni, Manuel Milling, Shuo Liu, Junguo Zhang, Kun Qian, Björn W. Schuller:
Automatic Bird Sound Source Separation Based on Passive Acoustic Devices in Wild Environment. IEEE Internet Things J. 11(9): 16604-16617 (2024) - [j264]Manuel Milling, Shuo Liu, Andreas Triantafyllopoulos, Ilhan Aslan, Björn W. Schuller:
Audio Enhancement for Computer Audition - An Iterative Training Paradigm Using Sample Importance. J. Comput. Sci. Technol. 39(4): 895-911 (2024) - [j263]Harry Coppock, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Kieran Baker, Jobie Budd, Richard Payne, Emma Karoune, David Hurley, Alexander Titcomb, Sabrina Egglestone, Ana Tendero Cañadas, Lorraine Butler, Radka Jersakova, Jonathon Mellor, Selina Patel, Tracey Thornley, Peter Diggle, Sylvia Richardson, Josef Packham, Björn W. Schuller, Davide Pigoli, Steven G. Gilmour, Stephen J. Roberts, Christopher C. Holmes:
Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers. Nat. Mac. Intell. 6(2): 229-242 (2024) - [j262]Mani Kumar Tellamekala, Shahin Amiriparian, Björn W. Schuller, Elisabeth André, Timo Giesbrecht, Michel F. Valstar:
COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 46(2): 805-822 (2024) - [j261]Georgios Rizos, Jenna Lawson, Simon Mitchell, Pranay Shah, Xin Wen, Cristina Banks-Leite, Robert M. Ewers, Björn W. Schuller:
Propagating variational model uncertainty for bioacoustic call label smoothing. Patterns 5(3): 100932 (2024) - [j260]Georgios Rizos, Jenna L. Lawson, Björn W. Schuller:
Meet the authors: Georgios Rizos, Jenna L. Lawson, and Björn W. Schuller. Patterns 5(3): 100952 (2024) - [j259]Mani Kumar Tellamekala, Ömer Sümer, Björn W. Schuller, Elisabeth André, Timo Giesbrecht, Michel F. Valstar:
Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities? IEEE Trans. Affect. Comput. 15(2): 535-548 (2024) - [j258]Rui Liu, Haolin Zuo, Zheng Lian, Björn W. Schuller, Haizhou Li:
Contrastive Learning Based Modality-Invariant Feature Acquisition for Robust Multimodal Emotion Recognition With Missing Modalities. IEEE Trans. Affect. Comput. 15(4): 1856-1873 (2024) - [j257]Mostafa M. Amin, Rui Mao, Erik Cambria, Björn W. Schuller:
A Wide Evaluation of ChatGPT on Affective Computing Tasks. IEEE Trans. Affect. Comput. 15(4): 2204-2212 (2024) - [j256]Cong Pang, Jingjie Fan, Qifan Shen, Yue Xie, Chengwei Huang, Björn W. Schuller:
Multichannel Speech Enhancement Based on Neural Beamforming and a Context-Focused Post-Filtering Network. IEEE Trans. Cogn. Dev. Syst. 16(3): 973-983 (2024) - [j255]Ruiyu Liang, Yue Xie, Jiaming Cheng, Cong Pang, Björn W. Schuller:
A Non-Invasive Speech Quality Evaluation Algorithm for Hearing Aids With Multi-Head Self-Attention and Audiogram-Based Features. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2166-2176 (2024) - [j254]Jiaming Cheng, Ruiyu Liang, Lin Zhou, Li Zhao, Chengwei Huang, Björn W. Schuller:
Residual Fusion Probabilistic Knowledge Distillation for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2680-2691 (2024) - [j253]Wanyong Qiu, Chen Quan, Lixian Zhu, Yongzi Yu, Zhihua Wang, Yu Ma, Mengkai Sun, Yi Chang, Kun Qian, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
Heart Sound Abnormality Detection From Multi-Institutional Collaboration: Introducing a Federated Learning Framework. IEEE Trans. Biomed. Eng. 71(10): 2802-2813 (2024) - [j252]Zixing Zhang, Liyizhe Peng, Tao Pang, Jing Han, Huan Zhao, Björn W. Schuller:
Refashioning Emotion Recognition Modeling: The Advent of Generalized Large Models. IEEE Trans. Comput. Soc. Syst. 11(5): 6690-6704 (2024) - [j251]Mingyue Niu, Ya Li, Jianhua Tao, Xiuzhuang Zhou, Björn W. Schuller:
DepressionMLP: A Multi-Layer Perceptron Architecture for Automatic Depression Level Prediction via Facial Keypoints and Action Units. IEEE Trans. Circuits Syst. Video Technol. 34(9): 8924-8938 (2024) - [j250]Lixian Zhu, Wanyong Qiu, Yu Ma, Fuze Tian, Mengkai Sun, Zhihua Wang, Kun Qian, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
LEPCNet: A Lightweight End-to-End PCG Classification Neural Network Model for Wearable Devices. IEEE Trans. Instrum. Meas. 73: 1-11 (2024) - [j249]Wanyong Qiu, Yifan Feng, Yuying Li, Yi Chang, Kun Qian, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
Fed-MStacking: Heterogeneous Federated Learning With Stacking Misaligned Labels for Abnormal Heart Sound Detection. IEEE J. Biomed. Health Informatics 28(9): 5055-5066 (2024) - [j248]Chao Li, Feng Wang, Ziping Zhao, Haishuai Wang, Björn W. Schuller:
Attention-Based Temporal Graph Representation Learning for EEG-Based Emotion Recognition. IEEE J. Biomed. Health Informatics 28(10): 5755-5767 (2024) - [j247]Vincent Karas, Dagmar M. Schuller, Björn W. Schuller:
Audiovisual Affect Recognition for Autonomous Vehicles: Applications and Future Agendas. IEEE Trans. Intell. Transp. Syst. 25(6): 4918-4932 (2024) - [c657]Lukas Christ, Shahin Amiriparian, Manuel Milling, Ilhan Aslan, Björn W. Schuller:
Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning. ACL (Findings) 2024: 7144-7159 - [c656]Nikolai Körber, Eduard Kromer, Andreas Siebert, Sascha Hauke, Daniel Mueller-Gritschneder, Björn W. Schuller:
EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation. ECCV (35) 2024: 202-220 - [c655]Jiahao Ji, Lixian Zhu, Haojie Zhang, Kun Qian, Kele Xu, Zikai Song, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
Weight Light, Hear Right: Heart Sound Classification with a Low-Complexity Model. EUSIPCO 2024: 326-330 - [c654]Philipp Wagner, Andreas Triantafyllopoulos, Alexander Gebhard, Björn W. Schuller:
Audio-Based Step-Count Estimation for Running - Windowing and Neural Network Baselines. EUSIPCO 2024: 331-335 - [c653]Meishu Song, Ilhan Aslan, Emilia Parada-Cabaleiro, Zijiang Yang, Elisabeth André, Yoshiharu Yamamoto, Björn W. Schuller:
Lecture Video Highlights Detection from Speech. EUSIPCO 2024: 361-365 - [c652]Gauri Deshpande, Björn W. Schuller:
Analysis of Respiratory Health Indicators in Speech-Breathing-Patterns. EUSIPCO 2024: 371-375 - [c651]Harish Battula, Gauri Deshpande, Sachin Patel, Björn W. Schuller:
Heart Rate from Read-Speech Influenced by Physical Exercise. EUSIPCO 2024: 376-380 - [c650]Meishu Song, Xin Jing, Emilia Parada-Cabaleiro, Zijiang Yang, Yoshiharu Yamamoto, Björn W. Schuller:
Temporal Oriented ResNet for Gaming Dimensional Emotion Prediction. EUSIPCO 2024: 596-600 - [c649]Andreas Triantafyllopoulos, Alexander Gebhard, Manuel Milling, Simon David Noel Rampp, Björn W. Schuller:
An Automatic Analysis of Ultrasound Vocalisations for the Prediction of Interaction Context in Captive Egyptian Fruit Bats. EUSIPCO 2024: 1277-1281 - [c648]Adria Mallol-Ragolta, Anika A. Spiesberger, Andreas Triantafyllopoulos, Björn W. Schuller:
Personalised Anomaly Detectors and Prototypical Representations for Relapse Detection from Wearable-Based Digital Phenotyping. ICASSP Workshops 2024: 103-104 - [c647]Manuel Milling, Andreas Triantafyllopoulos, Iosif Tsangko, Simon David Noel Rampp, Björn Wolfgang Schuller:
Bringing the Discussion of Minima Sharpness to the Audio Domain: A Filter-Normalised Evaluation for Acoustic Scene Classification. ICASSP 2024: 391-395 - [c646]Zixing Zhang, Tao Pang, Jing Han, Björn W. Schuller:
Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation. ICASSP 2024: 861-865 - [c645]Alexander Gebhard, Andreas Triantafyllopoulos, Teresa Bez, Lukas Christ, Alexander Kathan, Björn W. Schuller:
Exploring Meta Information for Audio-Based Zero-Shot Bird Classification. ICASSP 2024: 1211-1215 - [c644]Chengyu Yuan, Hao Xiong, Guoqing Shangguan, Hualei Shen, Dong Liu, Haojie Zhang, Zhonghua Liu, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto, Shlomo Berkovsky:
Deep Fusion of Shifted MLP and CNN for Medical Image Segmentation. ICASSP 2024: 1676-1680 - [c643]Alican Akman, Björn W. Schuller:
AttHear: Explaining Audio Transformers Using Attention-Aware NMF. ICASSP 2024: 7015-7019 - [c642]Chia-Hsin Lin, Charles Jones, Björn W. Schuller, Harry Coppock, Alican Akman:
Synthia's Melody: A Benchmark Framework for Unsupervised Domain Adaptation in Audio. ICASSP 2024: 7450-7454 - [c641]Zhongren Dong, Zixing Zhang, Weixiang Xu, Jing Han, Jianjun Ou, Björn W. Schuller:
HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech. ICASSP 2024: 11246-11250 - [c640]Liyizhe Peng, Zixing Zhang, Tao Pang, Jing Han, Huan Zhao, Hao Chen, Björn W. Schuller:
Customising General Large Language Models for Specialised Emotion Recognition Tasks. ICASSP 2024: 11326-11330 - [c639]Yong Wang, Cheng Lu, Hailun Lian, Yan Zhao, Björn W. Schuller, Yuan Zong, Wenming Zheng:
Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition. ICASSP 2024: 11646-11650 - [c638]Cheng Lu, Yuan Zong, Hailun Lian, Yan Zhao, Björn W. Schuller, Wenming Zheng:
Improving Speaker-Independent Speech Emotion Recognition using Dynamic Joint Distribution Adaptation. ICASSP 2024: 11696-11700 - [c637]Yan Zhao, Jincen Wang, Cheng Lu, Sunan Li, Björn W. Schuller, Yuan Zong, Wenming Zheng:
Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion Recognition. ICASSP 2024: 11846-11850 - [c636]Xiangheng He, Junjie Chen, Björn W. Schuller:
Task Selection and Assignment for Multi-Modal Multi-Task Dialogue Act Classification with Non-Stationary Multi-Armed Bandits. ICASSP 2024: 12091-12095 - [c635]Gauri Deshpande, Björn W. Schuller:
The Effect of Lung Volume on Glottal Parameters: An Empirical Study. ICPR (20) 2024: 304-315 - [c634]Ziping Zhao, Shizhao Liu, Mingyue Niu, Haishuai Wang, Björn W. Schuller:
Dense Coordinate Channel Attention Network for Depression Level Estimation from Speech. ICPR (13) 2024: 402-413 - [c633]Haoyu Chen, Björn W. Schuller, Ehsan Adeli, Guoying Zhao:
The 2nd Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA) 2024: Dataset and Results. MiGA@IJCAI 2024 - [c632]Zheng Lian, Bin Liu, Rui Liu, Kele Xu, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MRAC'24 Track 2: 2nd International Workshop on Multimodal and Responsible Affective Computing. MRAC@MM 2024: 39-40 - [c631]Zheng Lian, Haiyang Sun, Licai Sun, Zhuofan Wen, Siyuan Zhang, Shun Chen, Hao Gu, Jinming Zhao, Ziyang Ma, Xie Chen, Jiangyan Yi, Rui Liu, Kele Xu, Bin Liu, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition. MRAC@MM 2024: 41-48 - [c630]Shahin Amiriparian, Lukas Christ, Alexander Kathan, Maurice Gerczuk, Niklas Müller, Steffen Klug, Lukas Stappen, Andreas König, Erik Cambria, Björn W. Schuller, Simone Eulitz:
The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition. MuSe@ACM Multimedia 2024: 1-9 - [c629]Lukas Christ, Shahin Amiriparian, Andreas König, Simone Eulitz, Erik Cambria, Björn W. Schuller:
MuSe '24: The 5th Multimodal Sentiment Analysis Challenge and Workshop: Social Perception & Humor. MuSe@ACM Multimedia 2024: 10-11 - [c628]Selim Solmaz, Pamela Innerwinkler, Michal Wójcik, Kailin Tong, Elena Politi, George Dimitrakopoulos, Patrick Purucker, Alfred Höß, Björn W. Schuller, Reiner John:
Robust Robotic Search and Rescue in Harsh Environments: An Example and Open Challenges. ROSE 2024: 1-8 - [e27]Haoyu Chen, Björn W. Schuller, Ehsan Adeli, Guoying Zhao:
Proceedings of IJCAI 2024 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2024) co-located with 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), Jeju, Korea, August 04, 2024. CEUR Workshop Proceedings 3848, CEUR-WS.org 2024 [contents] - [e26]Jianhua Tao, Shreya Ghosh, Zheng Lian, Zhixi Cai, Björn W. Schuller, Abhinav Dhall, Guoying Zhao, Dimitrios Kollias, Erik Cambria, Roland Goecke, Tom Gedeon:
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, MRAC 2024, Melbourne VIC, Australia, 28 October 2024- 1 November 2024. ACM 2024, ISBN 979-8-4007-1203-6 [contents] - [e25]Shahin Amiriparian, Lukas Christ, Simone Eulitz, Andreas König, Erik Cambria, Björn W. Schuller:
Proceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor, MuSe2024, Melbourne, VIC, Australia, 28 October 2024- 1 November 2024. ACM 2024, ISBN 979-8-4007-1199-2 [contents] - [i220]Cheng Lu, Yuan Zong, Hailun Lian, Yan Zhao, Björn W. Schuller, Wenming Zheng:
Improving Speaker-independent Speech Emotion Recognition Using Dynamic Joint Distribution Adaptation. CoRR abs/2401.09752 (2024) - [i219]Yong Wang, Cheng Lu, Hailun Lian, Yan Zhao, Björn W. Schuller, Yuan Zong, Wenming Zheng:
Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition. CoRR abs/2401.10536 (2024) - [i218]Yan Zhao, Jincen Wang, Cheng Lu, Sunan Li, Björn W. Schuller, Yuan Zong, Wenming Zheng:
Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion Recognition. CoRR abs/2401.12925 (2024) - [i217]Yi Chang, Zhao Ren, Zixing Zhang, Xin Jing, Kun Qian, Xi Shao, Bin Hu, Tanja Schultz, Björn W. Schuller:
STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition. CoRR abs/2402.01227 (2024) - [i216]Mostafa M. Amin, Björn W. Schuller:
On Prompt Sensitivity of ChatGPT in Affective Computing. CoRR abs/2403.14006 (2024) - [i215]Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Berrak Sisman, Björn W. Schuller, Carlos Busso:
emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition. CoRR abs/2403.14083 (2024) - [i214]Shahin Amiriparian, Maurice Gerczuk, Justina Lutz, Wolfgang Strube, Irina Papazova, Alkomiet Hasan, Alexander Kathan, Björn W. Schuller:
Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in Emergency Medicine. CoRR abs/2404.12132 (2024) - [i213]Zheng Lian, Haiyang Sun, Licai Sun, Zhuofan Wen, Siyuan Zhang, Shun Chen, Hao Gu, Jinming Zhao, Ziyang Ma, Xie Chen, Jiangyan Yi, Rui Liu, Kele Xu, Bin Liu, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition. CoRR abs/2404.17113 (2024) - [i212]Andreas Triantafyllopoulos, Björn W. Schuller:
Expressivity and Speech Synthesis. CoRR abs/2404.19363 (2024) - [i211]Zhongren Dong, Zixing Zhang, Weixiang Xu, Jing Han, Jianjun Ou, Björn W. Schuller:
HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech. CoRR abs/2405.03952 (2024) - [i210]Zixing Zhang, Tao Pang, Jing Han, Björn W. Schuller:
Intelligent Cardiac Auscultation for Murmur Detection via Parallel-Attentive Models with Uncertainty Estimation. CoRR abs/2405.03953 (2024) - [i209]Rong Gao, Xin Liu, Bohao Xing, Zitong Yu, Björn W. Schuller, Heikki Kälviäinen:
Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding. CoRR abs/2405.13206 (2024) - [i208]Lukas Christ, Shahin Amiriparian, Manuel Milling, Ilhan Aslan, Björn W. Schuller:
Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning. CoRR abs/2406.02251 (2024) - [i207]Andreas Triantafyllopoulos, Alexander Gebhard, Manuel Milling, Simon David Noel Rampp, Björn W. Schuller:
An automatic analysis of ultrasound vocalisations for the prediction of interaction context in captive Egyptian fruit bats. CoRR abs/2406.06332 (2024) - [i206]Philipp Wagner, Andreas Triantafyllopoulos, Alexander Gebhard, Björn W. Schuller:
Audio-based Step-count Estimation for Running - Windowing and Neural Network Baselines. CoRR abs/2406.06339 (2024) - [i205]Andreas Triantafyllopoulos, Anton Batliner, Wolfgang Mayr, Markus Fendler, Florian B. Pokorny, Maurice Gerczuk, Shahin Amiriparian, Thomas M. Berghaus, Björn W. Schuller:
Sustained Vowels for Pre- vs Post-Treatment COPD Classification. CoRR abs/2406.06355 (2024) - [i204]Andreas Triantafyllopoulos, Anton Batliner, Simon David Noel Rampp, Manuel Milling, Björn W. Schuller:
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition. CoRR abs/2406.06401 (2024) - [i203]Andreas Triantafyllopoulos, Björn W. Schuller:
Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition. CoRR abs/2406.06665 (2024) - [i202]Xin Jing, Andreas Triantafyllopoulos, Björn W. Schuller:
ParaCLAP - Towards a general language-audio model for computational paralinguistic tasks. CoRR abs/2406.07203 (2024) - [i201]Shahin Amiriparian, Lukas Christ, Alexander Kathan, Maurice Gerczuk, Niklas Müller, Steffen Klug, Lukas Stappen, Andreas König, Erik Cambria, Björn W. Schuller, Simone Eulitz:
The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition. CoRR abs/2406.07753 (2024) - [i200]Xin Jing, Luyang Zhang, Jiangjian Xie, Alexander Gebhard, Alice Baird, Björn W. Schuller:
DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition. CoRR abs/2406.08517 (2024) - [i199]Shahin Amiriparian, Filip Packan, Maurice Gerczuk, Björn W. Schuller:
ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets. CoRR abs/2406.10275 (2024) - [i198]Yi Chang, Zhao Ren, Zhonghao Zhao, Thanh Tam Nguyen, Kun Qian, Tanja Schultz, Björn W. Schuller:
Speech Emotion Recognition under Resource Constraints with Data Distillation. CoRR abs/2406.15119 (2024) - [i197]Lukas Christ, Shahin Amiriparian, Friederike Hawighorst, Ann-Kathrin Schill, Angelo Boutalikakis, Lorenz Graf-Vlachy, Andreas König, Björn W. Schuller:
This Paper Had the Smartest Reviewers - Flattery Detection Utilising an Audio-Textual Transformer-Based Approach. CoRR abs/2406.17667 (2024) - [i196]Oliver Schrüfer, Manuel Milling, Felix Burkhardt, Florian Eyben, Björn W. Schuller:
Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition. CoRR abs/2407.01143 (2024) - [i195]Rui Liu, Haolin Zuo, Zheng Lian, Xiaofen Xing, Björn W. Schuller, Haizhou Li:
Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset. CoRR abs/2407.02751 (2024) - [i194]Maurice Gerczuk, Shahin Amiriparian, Justina Lutz, Wolfgang Strube, Irina Papazova, Alkomiet Hasan, Björn W. Schuller:
Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment. CoRR abs/2407.11012 (2024) - [i193]Andreas Triantafyllopoulos, Iosif Tsangko, Alexander Gebhard, Annamaria Mesaros, Tuomas Virtanen, Björn W. Schuller:
Computer Audition: From Task-Specific Machine Learning to Foundation Models. CoRR abs/2407.15672 (2024) - [i192]Anika A. Spiesberger, Andreas Triantafyllopoulos, Iosif Tsangko, Björn W. Schuller:
Abusive Speech Detection in Indic Languages Using Acoustic Features. CoRR abs/2407.20808 (2024) - [i191]Manuel Milling, Shuo Liu, Andreas Triantafyllopoulos, Ilhan Aslan, Björn W. Schuller:
Audio Enhancement for Computer Audition - An Iterative Training Paradigm Using Sample Importance. CoRR abs/2408.06264 (2024) - [i190]Dionyssos Kounadis-Bastian, Oliver Schrüfer, Anna Derington, Hagen Wierstorf, Florian Eyben, Felix Burkhardt, Björn W. Schuller:
Wav2Small: Distilling Wav2Vec2 to 72K parameters for Low-Resource Speech emotion recognition. CoRR abs/2408.13920 (2024) - [i189]Mohammad Nadeem, Shahab Saquib Sohail, Erik Cambria, Björn W. Schuller, Amir Hussain:
Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation. CoRR abs/2409.00105 (2024) - [i188]Xin Jing, Kun Zhou, Andreas Triantafyllopoulos, Björn W. Schuller:
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models. CoRR abs/2409.06451 (2024) - [i187]Björn W. Schuller, Adria Mallol-Ragolta, Alejandro Peña Almansa, Iosif Tsangko, Mostafa M. Amin, Anastasia Semertzidou, Lukas Christ, Shahin Amiriparian:
Affective Computing Has Changed: The Foundation Model Disruption. CoRR abs/2409.08907 (2024) - [i186]Zhengxin Joseph Ye, Björn W. Schuller:
Trading through Earnings Seasons using Self-Supervised Contrastive Representation Learning. CoRR abs/2409.17392 (2024) - [i185]Nikolai Körber, Eduard Kromer, Andreas Siebert, Sascha Hauke, Daniel Mueller-Gritschneder, Björn W. Schuller:
PerCo (SD): Open Perceptual Compression. CoRR abs/2409.20255 (2024) - [i184]Alican Akman, Qiyang Sun, Björn W. Schuller:
Audio Explanation Synthesis with Generative Foundation Models. CoRR abs/2410.07530 (2024) - [i183]Qiyang Sun, Alican Akman, Xin Jing, Manuel Milling, Björn W. Schuller:
Audio-based Kinship Verification Using Age Domain Conversion. CoRR abs/2410.11120 (2024) - [i182]Simon David Noel Rampp, Manuel Milling, Andreas Triantafyllopoulos, Björn W. Schuller:
Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning. CoRR abs/2411.00973 (2024) - 2023
- [j246]Srividya Tirunellai Rajamani, Kumar T. Rajamani, Ashwin Venkateshvaran, Andreas Triantafyllopoulos, Alexander Kathan, Björn W. Schuller:
Toward Detecting and Addressing Corner Cases in Deep Learning Based Medical Image Segmentation. IEEE Access 11: 95334-95345 (2023) - [j245]Jiangjian Xie, Yujie Zhong, Junguo Zhang, Changchun Zhang, Björn W. Schuller:
A weakly supervised spatial group attention network for fine-grained visual recognition. Appl. Intell. 53(20): 23301-23315 (2023) - [j244]Zhihua Wang, Kun Qian, Houguang Liu, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
Exploring interpretable representations for heart sound abnormality detection. Biomed. Signal Process. Control. 82: 104569 (2023) - [j243]Sebastian P. Bayerl, Maurice Gerczuk, Anton Batliner, Christian Bergler, Shahin Amiriparian, Björn W. Schuller, Elmar Nöth, Korbinian Riedhammer:
Classification of stuttering - The ComParE challenge and beyond. Comput. Speech Lang. 81: 101519 (2023) - [j242]Jingtan Li, Mengkai Sun, Zhonghao Zhao, Xingcan Li, Gaigai Li, Chen Wu, Kun Qian, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
Battling with the low-resource condition for snore sound recognition: introducing a meta-learning strategy. EURASIP J. Audio Speech Music. Process. 2023(1): 43 (2023) - [j241]Zhengxin Joseph Ye, Björn W. Schuller:
Human-aligned trading by imitative multi-loss reinforcement learning. Expert Syst. Appl. 234: 120939 (2023) - [j240]Mostafa M. Amin, Erik Cambria, Björn W. Schuller:
Will Affective Computing Emerge From Foundation Models and General Artificial Intelligence? A First Evaluation of ChatGPT. IEEE Intell. Syst. 38(2): 15-23 (2023) - [j239]Mostafa M. Amin, Erik Cambria, Björn W. Schuller:
Can ChatGPT's Responses Boost Traditional Natural Language Processing? IEEE Intell. Syst. 38(5): 5-11 (2023) - [j238]Björn W. Schuller, Shahin Amiriparian, Anton Batliner, Alexander Gebhard, Maurice Gerczuk, Vincent Karas, Alexander Kathan, Lennart Seizer, Johanna Löchner:
Computational charisma - A brick by brick blueprint for building charismatic artificial intelligence. Frontiers Comput. Sci. 5 (2023) - [j237]Andreas Triantafyllopoulos, Uwe D. Reichel, Shuo Liu, Stephan Huber, Florian Eyben, Björn W. Schuller:
Multistage linguistic conditioning of convolutional layers for speech emotion recognition. Frontiers Comput. Sci. 5 (2023) - [j236]Harry Coppock, Alican Akman, Christian Bergler, Maurice Gerczuk, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Jing Han, Shahin Amiriparian, Alice Baird, Lukas Stappen, Sandra Ottl, Panagiotis Tzirakis, Anton Batliner, Cecilia Mascolo, Björn W. Schuller:
A summary of the ComParE COVID-19 challenges. Frontiers Digit. Health 5 (2023) - [j235]Kun Qian, Gyorgy Fazekas, Shengchen Li, Zijin Li, Björn W. Schuller:
Editorial: Human-centred computer audition: sound, music, and healthcare. Frontiers Digit. Health 5 (2023) - [j234]Andreas Triantafyllopoulos, Alexander Kathan, Alice Baird, Lukas Christ, Alexander Gebhard, Maurice Gerczuk, Vincent Karas, Tobias Hübner, Xin Jing, Shuo Liu, Adria Mallol-Ragolta, Manuel Milling, Sandra Ottl, Anastasia Semertzidou, Srividya Tirunellai Rajamani, Tianhao Yan, Zijiang Yang, Judith Dineley, Shahin Amiriparian, Katrin D. Bartl-Pokorny, Anton Batliner, Florian B. Pokorny, Björn W. Schuller:
HEAR4Health: a blueprint for making computer audition a staple of modern healthcare. Frontiers Digit. Health 5 (2023) - [j233]Anton Batliner, Michael Neumann, Felix Burkhardt, Alice Baird, Sarina Meyer, Ngoc Thang Vu, Björn W. Schuller:
Ethical Awareness in Paralinguistics: A Taxonomy of Applications. Int. J. Hum. Comput. Interact. 39(9): 1904-1921 (2023) - [j232]Johannes Wagner, Andreas Triantafyllopoulos, Hagen Wierstorf, Maximilian Schmitt, Felix Burkhardt, Florian Eyben, Björn W. Schuller:
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap. IEEE Trans. Pattern Anal. Mach. Intell. 45(9): 10745-10759 (2023) - [j231]Maurice Gerczuk, Andreas Triantafyllopoulos, Shahin Amiriparian, Alexander Kathan, Jonathan Bauer, Matthias Berking, Björn W. Schuller:
Zero-shot personalization of speech foundation models for depressed mood monitoring. Patterns 4(11): 100873 (2023) - [j230]Rodrigo Mira, Eduardo Coutinho, Emilia Parada-Cabaleiro, Björn W. Schuller:
Automated composition of Galician Xota - tuning RNN-based composers for specific musical styles using deep Q-learning. PeerJ Comput. Sci. 9: e1356 (2023) - [j229]Björn W. Schuller, Matti Pietikäinen:
Affective Computing [Scanning the Issue]. Proc. IEEE 111(10): 1139-1141 (2023) - [j228]Andreas Triantafyllopoulos, Björn W. Schuller, Gökçe Iymen, Tevfik Metin Sezgin, Xiangheng He, Zijiang Yang, Panagiotis Tzirakis, Shuo Liu, Silvan Mertes, Elisabeth André, Ruibo Fu, Jianhua Tao:
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era. Proc. IEEE 111(10): 1355-1381 (2023) - [j227]Jiaming Cheng, Ruiyu Liang, Li Zhao, Chengwei Huang, Björn W. Schuller:
Speech Denoising and Compensation for Hearing Aids Using an FTCRN-Based Metric GAN. IEEE Signal Process. Lett. 30: 374-378 (2023) - [j226]Shahin Amiriparian, Björn W. Schuller, Nabiha Asghar, Heiga Zen, Felix Burkhardt:
Guest Editorial: Special Issue on Affective Speech and Language Synthesis, Generation, and Conversion. IEEE Trans. Affect. Comput. 14(1): 3-5 (2023) - [j225]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Emotion Intensity and its Control for Emotional Voice Conversion. IEEE Trans. Affect. Comput. 14(1): 31-48 (2023) - [j224]Lukas Stappen, Alice Baird, Lea Schumann, Björn W. Schuller:
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements. IEEE Trans. Affect. Comput. 14(2): 1334-1350 (2023) - [j223]Maurice Gerczuk, Shahin Amiriparian, Sandra Ottl, Björn W. Schuller:
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition. IEEE Trans. Affect. Comput. 14(2): 1472-1487 (2023) - [j222]Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Junaid Qadir, Björn W. Schuller:
Survey of Deep Representation Learning for Speech Emotion Recognition. IEEE Trans. Affect. Comput. 14(2): 1634-1654 (2023) - [j221]Frank Xing, Björn W. Schuller, Iti Chaturvedi, Erik Cambria, Amir Hussain:
Guest Editorial Neurosymbolic AI for Sentiment Analysis. IEEE Trans. Affect. Comput. 14(3): 1711-1715 (2023) - [j220]Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Björn W. Schuller:
Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition. IEEE Trans. Affect. Comput. 14(3): 1912-1926 (2023) - [j219]Mingyue Niu, Ziping Zhao, Jianhua Tao, Ya Li, Björn W. Schuller:
Dual Attention and Element Recalibration Networks for Automatic Depression Level Prediction. IEEE Trans. Affect. Comput. 14(3): 1954-1965 (2023) - [j218]Decky Aspandi, Federico Sukno, Björn W. Schuller, Xavier Binefa:
Audio-Visual Gated-Sequenced Neural Networks for Affect Recognition. IEEE Trans. Affect. Comput. 14(3): 2193-2208 (2023) - [j217]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Speech Synthesis With Mixed Emotions. IEEE Trans. Affect. Comput. 14(4): 3120-3134 (2023) - [j216]Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Björn W. Schuller:
Multitask Learning From Augmented Auxiliary Data for Improving Speech Emotion Recognition. IEEE Trans. Affect. Comput. 14(4): 3164-3176 (2023) - [j215]Kun Qian, Björn W. Schuller, Xiaohong Guan, Bin Hu:
Intelligent Music Intervention for Mental Disorders: Insights and Perspectives. IEEE Trans. Comput. Soc. Syst. 10(1): 2-9 (2023) - [j214]Guihua Tian, Kun Qian, Xinyi Li, Mengkai Sun, Hao Jiang, Wanyong Qiu, Xiaoming Xie, Zhonghao Zhao, Liangqing Huang, Siyan Luo, Tianxing Guo, Ran Cai, Zhihua Wang, Björn W. Schuller:
Can a Holistic View Facilitate the Development of Intelligent Traditional Chinese Medicine? A Survey. IEEE Trans. Comput. Soc. Syst. 10(2): 700-713 (2023) - [j213]Rodrigo Mira, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Björn W. Schuller, Maja Pantic:
End-to-End Video-to-Speech Synthesis Using Generative Adversarial Networks. IEEE Trans. Cybern. 53(6): 3454-3466 (2023) - [j212]Zhao Ren, Björn W. Schuller, Björn M. Eskofier, Thanh Tam Nguyen, Wolfgang Nejdl:
Guest Editorial Trustworthy and Collaborative AI for Personalised Healthcare Through Edge-of-Things. IEEE J. Biomed. Health Informatics 27(11): 5213-5215 (2023) - [c627]Shahin Amiriparian, Alexander Meiners, Daniel Lukas Rothenpieler, Alexander Kathan, Maurice Gerczuk, Björn W. Schuller:
Universal Lesion Detection Utilising Cascading R-CNNs and a Novel Video Pretraining Method. EMBC 2023: 1-4 - [c626]Zhihao Bao, Kun Qian, Zhonghao Zhao, Mengkai Sun, Ruolan Huang, Dewen Xu, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
Somatisation Disorder Detection via Speech: Introducing a Self-Supervised Learning Model. EMBC 2023: 1-4 - [c625]Gauri Deshpande, Björn W. Schuller, Pallavi Deshpande, Anuradha Rajiv Joshi:
Automatic Breathing Pattern Analysis from Reading-Speech Signals. EMBC 2023: 1-4 - [c624]Maurice Gerczuk, Shahin Amiriparian, Alexander Kathan, Jonathan Bauer, Matthias Berking, Björn W. Schuller:
Noise Robust Recognition of Depression Status and Treatment Response from Speech via Unsupervised Feature Aggregation. EMBC 2023: 1-4 - [c623]Yagna Gudipalli, Gauri Deshpande, Sachin Patel, Björn W. Schuller:
Deep Modelling Strategies for Human Confidence Classification using Audio-visual Data. EMBC 2023: 1-4 - [c622]Gang Luo, Shuting Sun, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
How does Music Affect Your Brain? A Pilot Study on EEG and Music Features for Automatic Analysis. EMBC 2023: 1-4 - [c621]Manuel Milling, Michelle Lienhart, Yuliia Oksymets, Alexander Gebhard, Manuel Brugger, Christoph Westerhausen, Björn W. Schuller:
NeuroCellCentreDB: Exploring a Novel Dataset for Neuron-like Cell Centre Detection with Deep Neural Networks. EMBC 2023: 1-4 - [c620]Srividya Tirunellai Rajamani, Kumar T. Rajamani, Björn W. Schuller:
A novel and simple approach to regularise attention frameworks and its efficacy in segmentation. EMBC 2023: 1-4 - [c619]Zikai Song, Lixian Zhu, Yiyan Wang, Mengkai Sun, Kun Qian, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
Cutting Weights of Deep Learning Models for Heart Sound Classification: Introducing a Knowledge Distillation Approach. EMBC 2023: 1-4 - [c618]Cuiping Zhu, Zhonghao Zhao, Yang Tan, Mengkai Sun, Kun Qian, Tao Jiang, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
Less is More: A Novel Feature Extraction Method for Heart Sound Classification via Fractal Transformation. EMBC 2023: 1-4 - [c617]Fabrizio Nunnari, Annette Rios, Uwe D. Reichel, Chirag Bhuvaneshwara, Panagiotis Paraskevas Filntisis, Petros Maragos, Felix Burkhardt, Florian Eyben, Björn W. Schuller, Sarah Ebling:
Multimodal Recognition of Valence, Arousal and Dominance via Late-Fusion of Text, Audio and Facial Expressions. ESANN 2023 - [c616]Gauri Deshpande, Yagna Gudipalli, Sachin Patel, Björn W. Schuller:
Applying Speech Derived Breathing Patterns to Automatically Classify Human Confidence. EUSIPCO 2023: 1335-1339 - [c615]Runze Ge, Zhihua Wang, Zhonghao Zhao, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
An End-to-End Model for Speech-based Somatisation Disorder Detection. GCCE 2023: 603-605 - [c614]Luyu Chen, Lin Shen, Dan Yu, Zhihua Wang, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
Multi-Track Music Generation with WGAN-GP and Attention Mechanisms. GCCE 2023: 606-607 - [c613]Minki Cho, Zhonghao Zhao, Zhihua Wang, Kun Qian, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
Snore Sound Recognition via an Explainable Capsule Network. GCCE 2023: 1048-1049 - [c612]Yang Tan, Zhihua Wang, Kun Qian, Zhihao Bao, Zheyu Cao, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
AMNet: Introducing an Adaptive Mel-Spectrogram End-to-End Neural Network for Heart Sound Classification. HealthCom 2023: 90-94 - [c611]Jonah Anton, Harry Coppock, Pancham Shukla, Björn W. Schuller:
Audio Barlow Twins: Self-Supervised Audio Representation Learning. ICASSP 2023: 1-5 - [c610]Felix Burkhardt, Anna Derington, Matthias Kahlau, Klaus R. Scherer, Florian Eyben, Björn W. Schuller:
Masking Speech Contents by Random Splicing: is Emotional Expression Preserved? ICASSP 2023: 1-5 - [c609]Yi Chang, Zhao Ren, Thanh Tam Nguyen, Kun Qian, Björn W. Schuller:
Knowledge Transfer for on-Device Speech Emotion Recognition With Neural Structured Learning. ICASSP 2023: 1-5 - [c608]Najla D. Al Futaisi, Alejandrina Cristià, Björn W. Schuller:
Hearttoheart: The Arts of Infant Versus Adult-Directed Speech Classification. ICASSP 2023: 1-5 - [c607]Shuo Liu, Adria Mallol-Ragolta, Björn W. Schuller:
COVID-19 Detection from Speech in Noisy Conditions. ICASSP 2023: 1-5 - [c606]Zhao Ren, Thanh Tam Nguyen, Yi Chang, Björn W. Schuller:
Fast Yet Effective Speech Emotion Recognition with Self-Distillation. ICASSP 2023: 1-5 - [c605]Georgios Rizos, Rafael A. Calvo, Björn W. Schuller:
Positive-Pair Redundancy Reduction Regularisation for Speech-Based Asthma Diagnosis Prediction. ICASSP 2023: 1-5 - [c604]Meishu Song, Andreas Triantafyllopoulos, Zijiang Yang, Hiroki Takeuchi, Toru Nakamura, Akifumi Kishi, Tetsuro Ishizawa, Kazuhiro Yoshiuchi, Xin Jing, Vincent Karas, Zhonghao Zhao, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
Daily Mental Health Monitoring from Speech: A Real-World Japanese Dataset and Multitask Learning Analysis. ICASSP 2023: 1-5 - [c603]Panagiotis Tzirakis, Alice Baird, Jeffrey A. Brooks, Christopher Gagne, Lauren Kim, Michael Opara, Christopher B. Gregory, Jacob Metrick, Garrett Boseck, Vineet Tiruvadi, Björn W. Schuller, Dacher Keltner, Alan Cowen:
Large-Scale Nonverbal Vocalization Detection Using Transformers. ICASSP 2023: 1-5 - [c602]Xinzhou Xu, Jun Deng, Zixing Zhang, Zhen Yang, Björn W. Schuller:
Zero-Shot Speech Emotion Recognition Using Generative Learning with Reconstructed Prototypes. ICASSP 2023: 1-5 - [c601]Yongzi Yu, Wanyong Qiu, Chen Quan, Kun Qian, Zhihua Wang, Yu Ma, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
Federated Intelligent Terminals Facilitate Stuttering Monitoring. ICASSP 2023: 1-5 - [c600]Ziping Zhao, Huan Wang, Haishuai Wang, Björn W. Schuller:
Hierarchical Network with Decoupled Knowledge Distillation for Speech Emotion Recognition. ICASSP 2023: 1-5 - [c599]Tomoya Koike, Zhihua Wang, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
An Investigation on Data Augmentation and Multiple Instance Learning for Diagnosis of COVID-19 from Speech and Cough Sound. ICCE-Taiwan 2023: 783-784 - [c598]Meishu Song, Zijiang Yang, Andreas Triantafyllopoulos, Toru Nakamura, Yongxin Zhang, Zhao Ren, Hiroki Takeuchi, Akifumi Kishi, Tetsuro Ishizawa, Kazuhiro Yoshiuchi, Haojie Zhang, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
Crossmodal Transformer on Multi-Physical Signals for Personalised Daily Mental Health Prediction. ICDM (Workshops) 2023: 1299-1305 - [c597]Dewen Xu, Zhihua Wang, Tsuyoshi Kitajima, Toru Nakamura, Hiroko Shimura, Hiroki Takeuchi, Yang Tan, Runze Ge, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
An End-to-End Model for Mental Disorders Detection by Spontaneous Physical Activity Data. ICDM (Workshops) 2023: 1306-1312 - [c596]Mina A. Nessiem, Mostafa M. Amin, Björn W. Schuller:
SMILENets: Audio Representation Learning via Neural Knowledge Distillation of Traditional Audio-Feature Extractors. ICFSP 2023: 32-37 - [c595]Yu Ma, Yuting Huang, Kaixiang Yuan, Guangzhe Xuan, Yongzi Yu, Hengrui Zhong, Rui Li, Jian Shen, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
Explainable Stuttering Recognition Using Axial Attention. ICIC (3) 2023: 209-220 - [c594]Mohammad Ibrahim Malik, Siddique Latif, Raja Jurdak, Björn W. Schuller:
A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model. INTERSPEECH 2023: 646-650 - [c593]Monica González Machorro, Pascal Hecker, Uwe D. Reichel, Helly N. Hammer, Robert Hoepner, Lisa Pedrotti, Alisha Zmutt, Hesam Sagha, Johan van Beek, Florian Eyben, Dagmar M. Schuller, Björn W. Schuller, Bert Arnrich:
Towards Supporting an Early Diagnosis of Multiple Sclerosis using Vocal Features. INTERSPEECH 2023: 1518-1522 - [c592]Felix Burkhardt, Florian Eyben, Björn W. Schuller:
Nkululeko: Machine Learning Experiments on Speaker Characteristics Without Programming. INTERSPEECH 2023: 2010-2011 - [c591]Adria Mallol-Ragolta, Nils Urbach, Shuo Liu, Anton Batliner, Björn W. Schuller:
The MASCFLICHT Corpus: Face Mask Type and Coverage Area Recognition from Speech. INTERSPEECH 2023: 2358-2362 - [c590]Ziping Zhao, Tian Gao, Haishuai Wang, Björn W. Schuller:
SWRR: Feature Map Classifier Based on Sliding Window Attention and High-Response Feature Reuse for Multimodal Emotion Recognition. INTERSPEECH 2023: 2433-2437 - [c589]Anika A. Spiesberger, Andreas Triantafyllopoulos, Iosif Tsangko, Björn W. Schuller:
Abusive Speech Detection in Indic Languages Using Acoustic Features. INTERSPEECH 2023: 2683-2687 - [c588]Shahin Amiriparian, Lukas Christ, Regina Kushtanova, Maurice Gerczuk, Alexandra Teynor, Björn W. Schuller:
Speech-Based Classification of Defensive Communication: A Novel Dataset and Results. INTERSPEECH 2023: 2703-2707 - [c587]Alexander Kathan, Andreas Triantafyllopoulos, Shahin Amiriparian, Sabrina Milkus, Alexander Gebhard, Jonas Hohmann, Pauline Muderlak, Jürgen Schottdorf, Björn W. Schuller, Richard Musil:
The effect of clinical intervention on the speech of individuals with PTSD: features and recognition performances. INTERSPEECH 2023: 4139-4143 - [c586]Andreas Triantafyllopoulos, Alexander Gebhard, Alexander Kathan, Maurice Gerczuk, Shahin Amiriparian, Björn W. Schuller:
Analysis and automatic prediction of exertion from speech: Contrasting objective and subjective measures collected while running. INTERSPEECH 2023: 4144-4148 - [c585]Lukas Stappen, Jeremy Dillmann, Serena Striegel, Hans-Jörg Vögel, Nicolas Flores-Herr, Björn W. Schuller:
Integrating Generative Artificial Intelligence in Intelligent Vehicle Systems. ITSC 2023: 5790-5797 - [c584]Arne Bruns, Anika A. Spiesberger, Andreas Triantafyllopoulos, Patric Müller, Björn W. Schuller:
"Do touch!" - 3D Scanning and Printing Technologies for the Haptic Representation of Cultural Assets: A Study with Blind Target Users. SUMAC @ ACM Multimedia 2023: 21-28 - [c583]Zheng Lian, Haiyang Sun, Licai Sun, Kang Chen, Mingyu Xu, Kexin Wang, Ke Xu, Yu He, Ying Li, Jinming Zhao, Ye Liu, Bin Liu, Jiangyan Yi, Meng Wang, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning. ACM Multimedia 2023: 9610-9614 - [c582]Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Alexander Barnhill, Maurice Gerczuk, Andreas Triantafyllopoulos, Alice E. Baird, Panagiotis Tzirakis, Chris Gagne, Alan S. Cowen, Nikola Lackovic, Marie-José Caraty, Claude Montacié:
The ACM Multimedia 2023 Computational Paralinguistics Challenge: Emotion Share & Requests. ACM Multimedia 2023: 9635-9639 - [c581]Zheng Lian, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MRAC'23: 1st International Workshop on Multimodal and Responsible Affective Computing. ACM Multimedia 2023: 9713-9714 - [c580]Shahin Amiriparian, Lukas Christ, Andreas König, Alan Cowen, Eva-Maria Meßner, Erik Cambria, Björn W. Schuller:
MuSe 2023 Challenge: Multimodal Prediction of Mimicked Emotions, Cross-Cultural Humour, and Personalised Recognition of Affects. ACM Multimedia 2023: 9723-9725 - [c579]Zhao Ren, Kun Qian, Tanja Schultz, Björn W. Schuller:
An Overview of the ICASSP Special Session on AI Security and Privacy in Speech and Audio Processing. MMAsia (Workshops) 2023: 8:1-8:6 - [c578]Alexander Kathan, Shahin Amiriparian, Alexander Gebhard, Andreas Triantafyllopoulos, Maurice Gerczuk, Björn W. Schuller:
Personalised Speech-Based Heart Rate Categorisation Using Weighted-Instance Learning. MMSports@MM 2023: 9-13 - [c577]Lukas Christ, Shahin Amiriparian, Alice Baird, Alexander Kathan, Niklas Müller, Steffen Klug, Chris Gagne, Panagiotis Tzirakis, Lukas Stappen, Eva-Maria Meßner, Andreas König, Alan Cowen, Erik Cambria, Björn W. Schuller:
The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation. MuSe@ACM Multimedia 2023: 1-10 - [c576]Gauri Deshpande, Björn W. Schuller, Pallavi Deshpande, Anuradha Rajiv Joshi, S. K. Oza, Sachin Patel:
Analysing Breathing Patterns in Reading and Spontaneous Speech. SPECOM (2) 2023: 3-17 - [e24]Guoying Zhao, Björn W. Schuller, Ehsan Adeli, Tingshao Zhu, Haoyu Chen:
Proceedings of IJCAI-2023 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2023) co-located with 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023), Macau, China, August 21-22, 2023. CEUR Workshop Proceedings 3522, CEUR-WS.org 2023 [contents] - [e23]Shahin Amiriparian, Lukas Christ, Andreas König, Alan Cowen, Eva-Maria Meßner, Erik Cambria, Björn W. Schuller:
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, Humour and Personalisation, MuSe 2023, Ottawa, ON, Canada, 2 November 2023. ACM 2023 [contents] - [d2]Manuel Milling, Michelle Lienhart, Yuliia Oksymets, Alexander Gebhard, Manuel Brugger, Christoph Westerhausen, Björn W. Schuller:
NeuroCellCentreDB. Zenodo, 2023 - [i181]Björn W. Schuller, Shahin Amiriparian, Anton Batliner, Alexander Gebhard, Maurice Gerczuk, Vincent Karas, Alexander Kathan, Lennart Seizer, Johanna Löchner:
Computational Charisma - A Brick by Brick Blueprint for Building Charismatic Artificial Intelligence. CoRR abs/2301.00142 (2023) - [i180]Zhao Ren, Yi Chang, Thanh Tam Nguyen, Yang Tan, Kun Qian, Björn W. Schuller:
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era. CoRR abs/2301.09362 (2023) - [i179]Andreas Triantafyllopoulos, Alexander Kathan, Alice Baird, Lukas Christ, Alexander Gebhard, Maurice Gerczuk, Vincent Karas, Tobias Hübner, Xin Jing, Shuo Liu, Adria Mallol-Ragolta, Manuel Milling, Sandra Ottl, Anastasia Semertzidou, Srividya Tirunellai Rajamani, Tianhao Yan, Zijiang Yang, Judith Dineley, Shahin Amiriparian, Katrin D. Bartl-Pokorny, Anton Batliner, Florian B. Pokorny, Björn W. Schuller:
HEAR4Health: A blueprint for making computer audition a staple of modern healthcare. CoRR abs/2301.10477 (2023) - [i178]Hagen Wierstorf, Johannes Wagner, Florian Eyben, Felix Burkhardt, Björn W. Schuller:
audb - Sharing and Versioning of Audio and Annotation Data in Python. CoRR abs/2303.00645 (2023) - [i177]Mostafa M. Amin, Erik Cambria, Björn W. Schuller:
Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT. CoRR abs/2303.03186 (2023) - [i176]Ziping Zhao, Huan Wang, Haishuai Wang, Björn W. Schuller:
hierarchical network with decoupled knowledge distillation for speech emotion recognition. CoRR abs/2303.05134 (2023) - [i175]Zheng Lian, Haiyang Sun, Licai Sun, Jinming Zhao, Ye Liu, Bin Liu, Jiangyan Yi, Meng Wang, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning. CoRR abs/2304.08981 (2023) - [i174]Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Alexander Barnhill, Maurice Gerczuk, Andreas Triantafyllopoulos, Alice Baird, Panagiotis Tzirakis, Chris Gagne, Alan S. Cowen, Nikola Lackovic, Marie-José Caraty, Claude Montacié:
The ACM Multimedia 2023 Computational Paralinguistics Challenge: Emotion Share & Requests. CoRR abs/2304.14882 (2023) - [i173]Lukas Christ, Shahin Amiriparian, Alice Baird, Alexander Kathan, Niklas Müller, Steffen Klug, Chris Gagne, Panagiotis Tzirakis, Eva-Maria Meßner, Andreas König, Alan Cowen, Erik Cambria, Björn W. Schuller:
The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation. CoRR abs/2305.03369 (2023) - [i172]Ibrahim Malik, Siddique Latif, Raja Jurdak, Björn W. Schuller:
A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model. CoRR abs/2305.11413 (2023) - [i171]Xin Jing, Yi Chang, Zijiang Yang, Jiangjian Xie, Andreas Triantafyllopoulos, Björn W. Schuller:
U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech. CoRR abs/2305.13195 (2023) - [i170]Aljoscha Düsterhöft, Felix Burkhardt, Björn W. Schuller:
Happy or Evil Laughter? Analysing a Database of Natural Audio Samples. CoRR abs/2305.14023 (2023) - [i169]Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Berrak Sisman, Björn W. Schuller:
Improving Speech Emotion Recognition Performance using Differentiable Architecture Search. CoRR abs/2305.14402 (2023) - [i168]Lukas Stappen, Jeremy Dillmann, Serena Striegel, Hans-Jörg Vögel, Nicolas Flores-Herr, Björn W. Schuller:
Integrating Generative Artificial Intelligence in Intelligent Vehicle Systems. CoRR abs/2305.17137 (2023) - [i167]Felix Burkhardt, Johannes Wagner, Hagen Wierstorf, Florian Eyben, Björn W. Schuller:
Speech-based Age and Gender Prediction with Transformers. CoRR abs/2306.16962 (2023) - [i166]Felix Burkhardt, Uwe D. Reichel, Florian Eyben, Björn W. Schuller:
Going Retro: Astonishingly Simple Yet Effective Rule-based Prosody Modelling for Speech Synthesis Simulating Emotion Dimensions. CoRR abs/2307.02132 (2023) - [i165]Mostafa M. Amin, Erik Cambria, Björn W. Schuller:
Can ChatGPT's Responses Boost Traditional Natural Language Processing? CoRR abs/2307.04648 (2023) - [i164]Siddique Latif, Muhammad Usama, Mohammad Ibrahim Malik, Björn W. Schuller:
Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers. CoRR abs/2307.06090 (2023) - [i163]Zixing Zhang, Liyizhe Peng, Tao Pang, Jing Han, Huan Zhao, Björn W. Schuller:
Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models. CoRR abs/2308.11578 (2023) - [i162]Yuezhou Zhang, Amos A. Folarin, Judith Dineley, Pauline Conde, Valeria de Angel, Shaoxiong Sun, Yatharth Ranjan, Zulqarnain Rashid, Callum L. Stewart, Petroula Laiou, Heet Sankesara, Linglong Qian, Faith Matcham, Katie M. White, Carolin Oetzmann, Femke Lamers, Sara Siddi, Sara Simblett, Björn W. Schuller, Srinivasan Vairavan, Til Wykes, Josep Maria Haro, Brenda W. J. H. Penninx, Vaibhav A. Narayan, Matthew Hotopf, Richard J. B. Dobson, Nicholas Cummins, RADAR-CNS Consortium:
Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model. CoRR abs/2308.11773 (2023) - [i161]Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Erik Cambria, Björn W. Schuller:
Sparks of Large Audio Models: A Survey and Outlook. CoRR abs/2308.12792 (2023) - [i160]Mostafa M. Amin, Rui Mao, Erik Cambria, Björn W. Schuller:
A Wide Evaluation of ChatGPT on Affective Computing Tasks. CoRR abs/2308.13911 (2023) - [i159]Alexander Gebhard, Andreas Triantafyllopoulos, Teresa Bez, Lukas Christ, Alexander Kathan, Björn W. Schuller:
Exploring Meta Information for Audio-based Zero-shot Bird Classification. CoRR abs/2309.08398 (2023) - [i158]Xiangheng He, Junjie Chen, Björn W. Schuller:
Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits. CoRR abs/2309.09832 (2023) - [i157]Chia-Hsin Lin, Charles Jones, Björn W. Schuller, Harry Coppock:
Synthia's Melody: A Benchmark Framework for Unsupervised Domain Adaptation in Audio. CoRR abs/2309.15024 (2023) - [i156]Manuel Milling, Andreas Triantafyllopoulos, Iosif Tsangko, Simon David Noel Rampp, Björn Wolfgang Schuller:
Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene Classification. CoRR abs/2309.16369 (2023) - [i155]Liyizhe Peng, Zixing Zhang, Tao Pang, Jing Han, Huan Zhao, Hao Chen, Björn W. Schuller:
Customising General Large Language Models for Specialised Emotion Recognition Tasks. CoRR abs/2310.14225 (2023) - [i154]Anna Derington, Hagen Wierstorf, Ali Özkil, Florian Eyben, Felix Burkhardt, Björn W. Schuller:
Testing Speech Emotion Recognition Machine Learning Models. CoRR abs/2312.06270 (2023) - 2022
- [j211]Josef Schmid, Alfred Höß, Björn W. Schuller:
A Survey on Client Throughput Prediction Algorithms in Wired and Wireless Networks. ACM Comput. Surv. 54(9): 194:1-194:33 (2022) - [j210]Johanna Löchner, Björn W. Schuller:
Child and Youth Affective Computing - Challenge Accepted. IEEE Intell. Syst. 37(6): 69-76 (2022) - [j209]Iulia Lefter, Alice Baird, Lukas Stappen, Björn W. Schuller:
A Cross-Corpus Speech-Based Analysis of Escalating Negative Interactions. Frontiers Comput. Sci. 4: 749804 (2022) - [j208]Adria Mallol-Ragolta, Anastasia Semertzidou, Maria Pateraki, Björn W. Schuller:
Outer Product-Based Fusion of Smartwatch Sensor Data for Human Activity Recognition. Frontiers Comput. Sci. 4: 796866 (2022) - [j207]Manuel Milling, Alice Baird, Katrin D. Bartl-Pokorny, Shuo Liu, Alyssa M. Alcorn, Jie Shen, Teresa Tavassoli, Eloise Ainger, Elizabeth Pellicano, Maja Pantic, Nicholas Cummins, Björn W. Schuller:
Evaluating the Impact of Voice Activity Detection on Speech Emotion Recognition for Autistic Children. Frontiers Comput. Sci. 4 (2022) - [j206]Lukas Stappen, Alice Baird, Michelle Lienhart, Annalena Bätz, Björn W. Schuller:
An Estimation of Online Video User Engagement From Features of Time- and Value-Continuous, Dimensional Emotions. Frontiers Comput. Sci. 4: 773154 (2022) - [j205]Alican Akman, Harry Coppock, Alexander Gaskell, Panagiotis Tzirakis, Lyn Jones, Björn W. Schuller:
Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 From Audio Challenges. Frontiers Digit. Health 4 (2022) - [j204]Pascal Hecker, Nico Steckhan, Florian Eyben, Björn W. Schuller, Bert Arnrich:
Voice Analysis for Neurological Disorder Recognition-A Systematic Review and Perspective on Emerging Trends. Frontiers Digit. Health 4 (2022) - [j203]Alexander Kathan, Mathias Harrer, Ludwig Küster, Andreas Triantafyllopoulos, Xiangheng He, Manuel Milling, Maurice Gerczuk, Tianhao Yan, Srividya Tirunellai Rajamani, Elena Heber, Inga Grossmann, David D. Ebert, Björn W. Schuller:
Personalised depression forecasting using mobile sensor data and ecological momentary assessment. Frontiers Digit. Health 4 (2022) - [j202]Manuel Milling, Florian B. Pokorny, Katrin D. Bartl-Pokorny, Björn W. Schuller:
Is Speech the New Blood? Recent Progress in AI-Based Disease Detection From Audio in a Nutshell. Frontiers Digit. Health 4: 886615 (2022) - [j201]Yash Mehta, Clemens Stachl, Konstantin Markov, Joseph T. Yun, Björn W. Schuller:
Future-generation personality prediction from digital footprints. Future Gener. Comput. Syst. 136: 322-325 (2022) - [j200]Shahin Amiriparian, Tobias Hübner, Vincent Karas, Maurice Gerczuk, Sandra Ottl, Björn W. Schuller:
DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing From Decentralized Data. Frontiers Artif. Intell. 5: 856232 (2022) - [j199]Emilia Parada-Cabaleiro, Anton Batliner, Alice Baird, Björn W. Schuller:
Correction to: The perception of emotional cues by children in artificial background noise. Int. J. Speech Technol. 25(1): 289 (2022) - [j198]Björn W. Schuller, Yonina C. Eldar, Maja Pantic, Shrikanth Narayanan, Tuomas Virtanen, Jianhua Tao:
Editorial: Intelligent Signal Analysis for Contagious Virus Diseases. IEEE J. Sel. Top. Signal Process. 16(2): 159-163 (2022) - [j197]Liang Zhang, Johann Li, Ping Li, Xiaoyuan Lu, Maoguo Gong, Peiyi Shen, Guangming Zhu, Syed Afaq Ali Shah, Mohammed Bennamoun, Kun Qian, Björn W. Schuller:
MEDAS: an open-source platform as a service to help break the walls between medicine and informatics. Neural Comput. Appl. 34(8): 6547-6567 (2022) - [j196]Sicheng Zhao, Xingxu Yao, Jufeng Yang, Guoli Jia, Guiguang Ding, Tat-Seng Chua, Björn W. Schuller, Kurt Keutzer:
Affective Image Content Analysis: Two Decades Review and New Perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6729-6751 (2022) - [j195]Shuo Liu, Adria Mallol-Ragolta, Emilia Parada-Cabaleiro, Kun Qian, Xin Jing, Alexander Kathan, Bin Hu, Björn W. Schuller:
Audio self-supervised learning: A survey. Patterns 3(12): 100616 (2022) - [j194]Gauri Deshpande, Anton Batliner, Björn W. Schuller:
AI-Based human audio processing for COVID-19: A comprehensive overview. Pattern Recognit. 122: 108289 (2022) - [j193]Mostafa M. Mohamed, Mina A. Nessiem, Anton Batliner, Christian Bergler, Simone Hantke, Maximilian Schmitt, Alice Baird, Adria Mallol-Ragolta, Vincent Karas, Shahin Amiriparian, Björn W. Schuller:
Face mask recognition from audio: The MASC database and an overview on the mask challenge. Pattern Recognit. 122: 108361 (2022) - [j192]Shuo Liu, Jing Han, Estela Laporta Puyal, Spyridon Kontaxis, Shaoxiong Sun, Patrick Locatelli, Judith Dineley, Florian B. Pokorny, Gloria Dalla Costa, Letizia Leocani, Ana Isabel Guerrero, Carlos Nos, Ana Zabalza, Per Soelberg Sørensen, Mathias Buron, Melinda Magyari, Yatharth Ranjan, Zulqarnain Rashid, Pauline Conde, Callum L. Stewart, Amos A. Folarin, Richard J. B. Dobson, Raquel Bailón, Srinivasan Vairavan, Nicholas Cummins, Vaibhav A. Narayan, Matthew Hotopf, Giancarlo Comi, Björn W. Schuller, RADAR-CNS Consortium:
Fitbeat: COVID-19 estimation based on wristband heart rate using a contrastive convolutional auto-encoder. Pattern Recognit. 123: 108403 (2022) - [j191]Yue Zhang, Felix Weninger, Björn W. Schuller, Rosalind W. Picard:
Holistic Affect Recognition Using PaNDA: Paralinguistic Non-Metric Dimensional Analysis. IEEE Trans. Affect. Comput. 13(2): 769-780 (2022) - [j190]Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Julien Epps, Björn W. Schuller:
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition. IEEE Trans. Affect. Comput. 13(2): 992-1004 (2022) - [j189]Anton Batliner, Simone Hantke, Björn W. Schuller:
Ethics and Good Practice in Computational Paralinguistics. IEEE Trans. Affect. Comput. 13(3): 1236-1253 (2022) - [j188]Cheng Lu, Yuan Zong, Wenming Zheng, Yang Li, Chuangao Tang, Björn W. Schuller:
Domain Invariant Feature Learning for Speaker-Independent Speech Emotion Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2217-2230 (2022) - [j187]Björn W. Schuller, Johanna Löchner, Kun Qian, Bin Hu:
COVID-19's Impact on Mental Health - The Hour of Computational Aid? IEEE Trans. Comput. Soc. Syst. 9(4): 967-973 (2022) - [j186]Bin Hu, Kun Qian, Qunxi Dong, Yuejia Luo, Yoshiharu Yamamoto, Björn W. Schuller:
Psychological Field Versus Physiological Field: From Qualitative Analysis to Quantitative Modeling of the Mental Status. IEEE Trans. Comput. Soc. Syst. 9(5): 1275-1281 (2022) - [j185]Xinzhou Xu, Jun Deng, Zixing Zhang, Xijian Fan, Li Zhao, Laurence Devillers, Björn W. Schuller:
Rethinking Auditory Affective Descriptors Through Zero-Shot Emotion Recognition in Speech. IEEE Trans. Comput. Soc. Syst. 9(5): 1530-1541 (2022) - [j184]Björn W. Schuller, Johanna Löchner, Kun Qian, Bin Hu:
Digital Mental Health - Breaking a Lance for Prevention. IEEE Trans. Comput. Soc. Syst. 9(6): 1584-1588 (2022) - [j183]Mingyue Niu, Ziping Zhao, Jianhua Tao, Ya Li, Björn W. Schuller:
Selective Element and Two Orders Vectorization Networks for Automatic Depression Severity Diagnosis via Facial Changes. IEEE Trans. Circuits Syst. Video Technol. 32(11): 8065-8077 (2022) - [j182]Shuo Liu, Adria Mallol-Ragolta, Tianhao Yan, Kun Qian, Emilia Parada-Cabaleiro, Bin Hu, Björn W. Schuller:
Capturing Time Dynamics From Speech Using Neural Networks for Surgical Mask Detection. IEEE J. Biomed. Health Informatics 26(8): 4291-4302 (2022) - [j181]Kun Qian, Tomoya Koike, Toru Nakamura, Björn W. Schuller, Yoshiharu Yamamoto:
Learning Multimodal Representations for Drowsiness Detection. IEEE Trans. Intell. Transp. Syst. 23(8): 11539-11548 (2022) - [j180]Xinzhou Xu, Jun Deng, Nicholas Cummins, Zixing Zhang, Li Zhao, Björn W. Schuller:
Exploring Zero-Shot Emotion Recognition in Speech Using Semantic-Embedding Prototypes. IEEE Trans. Multim. 24: 2752-2765 (2022) - [j179]Shiping Wen, Tingwen Huang, Björn W. Schuller, Ahmad Taher Azar:
Guest Editorial: Introduction to the Special Section on Efficient Network Design for Convergence of Deep Learning and Edge Computing. IEEE Trans. Netw. Sci. Eng. 9(1): 109-110 (2022) - [c575]Alice Baird, Panagiotis Tzirakis, Jeffrey A. Brooks, Christopher B. Gregory, Björn W. Schuller, Anton Batliner, Dacher Keltner, Alan Cowen:
The ACII 2022 Affective Vocal Bursts Workshop & Competition. ACIIW 2022: 1-5 - [c574]Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Jiajun Liu, Björn W. Schuller:
A Novel Policy for Pre-trained Deep Reinforcement Learning for Speech Emotion Recognition. ACSW 2022: 96-105 - [c573]Xin Jing, Shuo Liu, Emilia Parada-Cabaleiro, Andreas Triantafyllopoulos, Meishu Song, Zijiang Yang, Björn W. Schuller:
A Temporal-oriented Broadcast ResNet for COVID-19 Detection. BHI 2022: 1-5 - [c572]Adria Mallol-Ragolta, Shuo Liu, Björn W. Schuller:
COVID-19 Detection Exploiting Self-Supervised Learning Representations of Respiratory Sounds. BHI 2022: 1-4 - [c571]Zheng Gu, Xinzhou Xu, Shuo Liu, Björn W. Schuller:
Zero-Shot Audio Classification Using Synthesised Classifiers and Pre-Trained Models. CISP-BMEI 2022: 1-6 - [c570]Vincent Karas, Mani Kumar Tellamekala, Adria Mallol-Ragolta, Michel F. Valstar, Björn W. Schuller:
Time-Continuous Audiovisual Fusion with Recurrence vs Attention for In-The-Wild Affect Recognition. CVPR Workshops 2022: 2381-2390 - [c569]Chao Li, Yong Sheng, Haishuai Wang, Mingyue Niu, Peiguang Jing, Ziping Zhao, Björn W. Schuller:
EEG Emotion Recognition Based on Self-attention Dynamic Graph Neural Networks. EMBC 2022: 292-296 - [c568]Adria Mallol-Ragolta, Florian B. Pokorny, Katrin D. Bartl-Pokorny, Anastasia Semertzidou, Björn W. Schuller:
Triplet Loss-Based Models for COVID-19 Detection from Vocal Sounds. EMBC 2022: 998-1001 - [c567]Wanyong Qiu, Kun Qian, Zhihua Wang, Yi Chang, Zhihao Bao, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto:
A Federated Learning Paradigm for Heart Sound Classification. EMBC 2022: 1045-1048 - [c566]Anne Wullenweber, Alican Akman, Björn W. Schuller:
CoughLIME: Sonified Explanations for the Predictions of COVID-19 Cough Classifiers. EMBC 2022: 1342-1345 - [c565]Srividya Tirunellai Rajamani, Kumar Rajamani, Priya Rani, Rashmita Barick, Ramasubramanya M. S, Sridevi V. Aithal, Elagiri Ramalingam Rajkumar, Sahana D. Gowda, Björn W. Schuller:
Novel no-reference multi-dimensional perceptual similarity metric. EMBC 2022: 2045-2048 - [c564]Srividya Tirunellai Rajamani, Kumar Rajamani, Alexander Kathan, Björn W. Schuller:
Novel Insights on Induced Sparsity in Multi-Time Attention Networks. EMBC 2022: 2615-2618 - [c563]Andreas Triantafyllopoulos, Sandra Zänkert, Alice Baird, Julian Konzok, Brigitte M. Kudielka, Björn W. Schuller:
Insights on Modelling Physiological, Appraisal, and Affective Indicators of Stress using Audio Features. EMBC 2022: 2619-2622 - [c562]Andreas Triantafyllopoulos, Sandra Ottl, Alexander Gebhard, Esther Rituerto-González, Mirko Jaumann, Steffen Hüttner, Valerie Dieter, Patrick Schneeweiß, Inga Krauß, Maurice Gerczuk, Shahin Amiriparian, Björn W. Schuller:
Fatigue Prediction in Outdoor Running Conditions using Audio Data. EMBC 2022: 2623-2626 - [c561]Alexander Kathan, Andreas Triantafyllopoulos, Xiangheng He, Manuel Milling, Tianhao Yan, Srividya Tirunellai Rajamani, Ludwig Küster, Mathias Harrer, Elena Heber, Inga Grossmann, David D. Ebert, Björn W. Schuller:
Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting. EMBC 2022: 2627-2630 - [c560]Lixian Zhu, Kun Qian, Zhihua Wang, Bin Hu, Yoshiharu Yamamoto, Björn W. Schuller:
Heart Sound Classification based on Residual Shrinkage Networks. EMBC 2022: 4469-4472 - [c559]Xiangheng He, Andreas Triantafyllopoulos, Alexander Kathan, Manuel Milling, Tianhao Yan, Srividya Tirunellai Rajamani, Ludwig Küster, Mathias Harrer, Elena Heber, Inga Grossmann, David D. Ebert, Björn W. Schuller:
Depression Diagnosis and Forecast based on Mobile Phone Sensor Data. EMBC 2022: 4679-4682 - [c558]Zishen Li, Yi Chang, Björn W. Schuller:
CNN-Based Heart Sound Classification with an Imbalance-Compensating Weighted Loss Function. EMBC 2022: 4934-4937 - [c557]Kun Qian, Tanja Schultz, Björn W. Schuller:
An Overview of the FIRST ICASSP Special Session on Computer Audition for Healthcare. ICASSP 2022: 9002-9006 - [c556]Shuai Yu, Yiwei Ding, Kun Qian, Bin Hu, Wei Li, Björn W. Schuller:
A Glance-and-Gaze Network for Respiratory Sound Classification. ICASSP 2022: 9007-9011 - [c555]Tianhao Yan, Hao Meng, Shuo Liu, Emilia Parada-Cabaleiro, Zhao Ren, Björn W. Schuller:
Convoluational Transformer With Adaptive Position Embedding For Covid-19 Detection From Cough Sounds. ICASSP 2022: 9092-9096 - [c554]Pascal Hecker, Arpita Kappattanavar, Maximilian Schmitt, Sidratul Moontaha, Johannes Wagner, Florian Eyben, Björn W. Schuller, Bert Arnrich:
Quantifying Cognitive Load from Voice using Transformer-Based Models and a Cross-Dataset Evaluation. ICMLA 2022: 337-344 - [c553]Andreas Triantafyllopoulos, Johannes Wagner, Hagen Wierstorf, Maximilian Schmitt, Uwe D. Reichel, Florian Eyben, Felix Burkhardt, Björn W. Schuller:
Probing speech emotion recognition transformers for linguistic knowledge. INTERSPEECH 2022: 146-150 - [c552]Jiaming Cheng, Ruiyu Liang, Yue Xie, Li Zhao, Björn W. Schuller, Jie Jia, Yiyuan Peng:
Cross-Layer Similarity Knowledge Distillation for Speech Enhancement. INTERSPEECH 2022: 926-930 - [c551]Rodrigo Schoburg Carrillo de Mira, Alexandros Haliassos, Stavros Petridis, Björn W. Schuller, Maja Pantic:
SVTS: Scalable Video-to-Speech Synthesis. INTERSPEECH 2022: 1836-1840 - [c550]Adria Mallol-Ragolta, Helena Cuesta, Emilia Gómez, Björn W. Schuller:
Multi-Type Outer Product-Based Fusion of Respiratory Sounds for Detecting COVID-19. INTERSPEECH 2022: 2163-2167 - [c549]Dominika Woszczyk, Anna Hlédiková, Alican Akman, Soteris Demetriou, Björn W. Schuller:
Data Augmentation for Dementia Detection in Spoken Language. INTERSPEECH 2022: 2858-2862 - [c548]Andreas Triantafyllopoulos, Markus Fendler, Anton Batliner, Maurice Gerczuk, Shahin Amiriparian, Thomas M. Berghaus, Björn W. Schuller:
Distinguishing between pre- and post-treatment in the speech of patients with chronic obstructive pulmonary disease. INTERSPEECH 2022: 3623-3627 - [c547]Yi Chang, Zhao Ren, Thanh Tam Nguyen, Wolfgang Nejdl, Björn W. Schuller:
Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis. INTERSPEECH 2022: 4003-4007 - [c546]Zijiang Yang, Xin Jing, Andreas Triantafyllopoulos, Meishu Song, Ilhan Aslan, Björn W. Schuller:
An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion. INTERSPEECH 2022: 4915-4919 - [c545]Rui Liu, Berrak Sisman, Björn W. Schuller, Guanglai Gao, Haizhou Li:
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning. INTERSPEECH 2022: 5493-5497 - [c544]Manuel Milling, Ilhan Aslan, Moritz Berghofer, Adria Mallol-Ragolta, Utkarsh Kunwar, Björn Wolfgang Schuller:
Online Personalisation of Deep Mobile Activity Recognisers. iWOAR 2022: 11:1-11:7 - [c543]Yang Tan, Zhihua Wang, Kun Qian, Bin Hu, Shiliang Zhao, Björn W. Schuller, Yoshiharu Yamamoto:
Heart Sound Classification based on Fractional Fourier Transformation Entropy. LifeTech 2022: 588-589 - [c542]Felix Burkhardt, Anabell Hacker, Uwe D. Reichel, Hagen Wierstorf, Florian Eyben, Björn W. Schuller:
A Comparative Cross Language View On Acted Databases Portraying Basic Emotions Utilising Machine Learning. LREC 2022: 1917-1924 - [c541]Felix Burkhardt, Johannes Wagner, Hagen Wierstorf, Florian Eyben, Björn W. Schuller:
Nkululeko: A Tool For Rapid Speaker Characteristics Detection. LREC 2022: 1925-1932 - [c540]Lukas Christ, Shahin Amiriparian, Alice Baird, Panagiotis Tzirakis, Alexander Kathan, Niklas Müller, Lukas Stappen, Eva-Maria Meßner, Andreas König, Alan Cowen, Erik Cambria, Björn W. Schuller:
The MuSe 2022 Multimodal Sentiment Analysis Challenge: Humor, Emotional Reactions, and Stress. MuSe @ ACM Multimedia 2022: 5-14 - [c539]Alexander Gebhard, Andreas Triantafyllopoulos, Shahin Amiriparian, Sandra Ottl, Valerie Dieter, Maurice Gerczuk, Mirko Jaumann, David Hildner, Patrick Schneeweiß, Inka Rösel, Inga Krauß, Björn W. Schuller:
Improving Exertion and Wellbeing Prediction in Outdoor Running Conditions using Audio-based Surface Recognition. MMSports@MM 2022: 19-27 - [c538]Alexander Kathan, Shahin Amiriparian, Lukas Christ, Andreas Triantafyllopoulos, Niklas Müller, Andreas König, Björn W. Schuller:
A Personalised Approach to Audiovisual Humour Recognition and its Individual-level Fairness. MuSe @ ACM Multimedia 2022: 29-36 - [c537]Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Christian Bergler, Maurice Gerczuk, Natalie Holz, Pauline Larrouy-Maestri, Sebastian P. Bayerl, Korbinian Riedhammer, Adria Mallol-Ragolta, Maria Pateraki, Harry Coppock, Ivan Kiskin, Marianne Sinka, Stephen J. Roberts:
The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes. ACM Multimedia 2022: 7120-7124 - [c536]Shahin Amiriparian, Lukas Christ, Andreas König, Eva-Maria Meßner, Alan Cowen, Erik Cambria, Björn W. Schuller:
MuSe 2022 Challenge: Multimodal Humour, Emotional Reactions, and Stress. ACM Multimedia 2022: 7389-7391 - [p13]Dagmar M. Schuller, Björn W. Schuller:
Ist Stimme das neue Blut? KI und Stimmbiomarker zu früheren Diagnose - für jedermann, überall und jederzeit. Künstliche Intelligenz im Gesundheitswesen 2022: 565-579 - [e22]Shahin Amiriparian, Lukas Christ, Andreas König, Alan Cowen, Eva-Maria Meßner, Erik Cambria, Björn W. Schuller:
MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, Lisboa, Portugal, 10 October 2022. ACM 2022, ISBN 978-1-4503-9484-0 [contents] - [d1]Andreas Triantafyllopoulos, Anastasia Semertzidou, Meishu Song, Florian B. Pokorny, Björn W. Schuller:
Introducing the COVID-19 YouTube (COVYT) speech dataset featuring the same speakers with and without infection. Zenodo, 2022 - [i153]Toby Godwin, Georgios Rizos, Alice Baird, Najla D. Al Futaisi, Vincent Brisse, Björn W. Schuller:
Evaluating Deep Music Generation Methods Using Data Augmentation. CoRR abs/2201.00052 (2022) - [i152]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Emotion Intensity and its Control for Emotional Voice Conversion. CoRR abs/2201.03967 (2022) - [i151]Mostafa M. Mohamed, Björn W. Schuller:
Normalise for Fairness: A Simple Normalisation Technique for Fairness in Regression Machine Learning Problems. CoRR abs/2202.00993 (2022) - [i150]Harry Coppock, Alican Akman, Christian Bergler, Maurice Gerczuk, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Jing Han, Shahin Amiriparian, Alice Baird, Lukas Stappen, Sandra Ottl, Panagiotis Tzirakis, Anton Batliner, Cecilia Mascolo, Björn W. Schuller:
A Summary of the ComParE COVID-19 Challenges. CoRR abs/2202.08981 (2022) - [i149]Lukas Stappen, Manuel Milling, Valentin Munst, Korakot Hoffmann, Björn W. Schuller:
Predicting Sex and Stroke Success - Computer-aided Player Grunt Analysis in Tennis Matches. CoRR abs/2202.09102 (2022) - [i148]Shuo Liu, Adria Mallol-Ragolta, Emilia Parada-Cabaleiro, Kun Qian, Xin Jing, Alexander Kathan, Bin Hu, Björn W. Schuller:
Audio Self-supervised Learning: A Survey. CoRR abs/2203.01205 (2022) - [i147]Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse H. Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk:
HEAR 2021: Holistic Evaluation of Audio Representations. CoRR abs/2203.03022 (2022) - [i146]Yi Chang, Sofiane Laridi, Zhao Ren, Gregory Palmer, Björn W. Schuller, Marco Fisichella:
Robust Federated Learning Against Adversarial Attacks for Speech Emotion Recognition. CoRR abs/2203.04696 (2022) - [i145]Björn W. Schuller, Alican Akman, Yi Chang, Harry Coppock, Alexander Gebhard, Alexander Kathan, Esther Rituerto-González, Andreas Triantafyllopoulos, Florian B. Pokorny:
Climate Change & Computer Audition: A Call to Action and Overview on Audio Intelligence to Help Save the Planet. CoRR abs/2203.06064 (2022) - [i144]Johannes Wagner, Andreas Triantafyllopoulos, Hagen Wierstorf, Maximilian Schmitt, Felix Burkhardt, Florian Eyben, Björn W. Schuller:
Dawn of the transformer era in speech emotion recognition: closing the valence gap. CoRR abs/2203.07378 (2022) - [i143]Björn W. Schuller, Dagmar M. Schuller:
Audiovisual Affect Assessment and Autonomous Automobiles: Applications. CoRR abs/2203.07482 (2022) - [i142]Vincent Karas, Mani Kumar Tellamekala, Adria Mallol-Ragolta, Michel F. Valstar, Björn W. Schuller:
Continuous-Time Audiovisual Fusion with Recurrence vs. Attention for In-The-Wild Affect Recognition. CoRR abs/2203.13285 (2022) - [i141]Zijiang Yang, Xin Jing, Andreas Triantafyllopoulos, Meishu Song, Ilhan Aslan, Björn W. Schuller:
An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion. CoRR abs/2203.15873 (2022) - [i140]Yi Chang, Zhao Ren, Thanh Tam Nguyen, Wolfgang Nejdl, Björn W. Schuller:
Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis. CoRR abs/2203.16141 (2022) - [i139]Xin Jing, Shuo Liu, Emilia Parada-Cabaleiro, Andreas Triantafyllopoulos, Meishu Song, Zijiang Yang, Björn W. Schuller:
A Temporal-oriented Broadcast ResNet for COVID-19 Detection. CoRR abs/2203.17012 (2022) - [i138]Andreas Triantafyllopoulos, Johannes Wagner, Hagen Wierstorf, Maximilian Schmitt, Uwe D. Reichel, Florian Eyben, Felix Burkhardt, Björn W. Schuller:
Probing Speech Emotion Recognition Transformers for Linguistic Knowledge. CoRR abs/2204.00400 (2022) - [i137]Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Björn W. Schuller:
Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition. CoRR abs/2204.08625 (2022) - [i136]Alice Baird, Panagiotis Tzirakis, Gauthier Gidel, Marco Jiralerspong, Eilif B. Muller, Kory W. Mathewson, Björn W. Schuller, Erik Cambria, Dacher Keltner, Alan Cowen:
The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts. CoRR abs/2205.01780 (2022) - [i135]Rodrigo Mira, Alexandros Haliassos, Stavros Petridis, Björn W. Schuller, Maja Pantic:
SVTS: Scalable Video-to-Speech Synthesis. CoRR abs/2205.02058 (2022) - [i134]Alexander Kathan, Andreas Triantafyllopoulos, Xiangheng He, Manuel Milling, Tianhao Yan, Srividya Tirunellai Rajamani, Ludwig Küster, Mathias Harrer, Elena Heber, Inga Grossmann, David D. Ebert, Björn W. Schuller:
Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting. CoRR abs/2205.03391 (2022) - [i133]Andreas Triantafyllopoulos, Sandra Zänkert, Alice Baird, Julian Konzok, Brigitte M. Kudielka, Björn W. Schuller:
Insights on Modelling Physiological, Appraisal, and Affective Indicators of Stress using Audio Features. CoRR abs/2205.04328 (2022) - [i132]Andreas Triantafyllopoulos, Sandra Ottl, Alexander Gebhard, Esther Rituerto-González, Mirko Jaumann, Steffen Hüttner, Valerie Dieter, Patrick Schneeweiß, Inga Krauß, Maurice Gerczuk, Shahin Amiriparian, Björn W. Schuller:
Fatigue Prediction in Outdoor Running Conditions using Audio Data. CoRR abs/2205.04343 (2022) - [i131]Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Christian Bergler, Maurice Gerczuk, Natalie Holz, Pauline Larrouy-Maestri, Sebastian P. Bayerl, Korbinian Riedhammer, Adria Mallol-Ragolta, Maria Pateraki, Harry Coppock, Ivan Kiskin, Marianne Sinka, Stephen J. Roberts:
The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes. CoRR abs/2205.06799 (2022) - [i130]Xiangheng He, Andreas Triantafyllopoulos, Alexander Kathan, Manuel Milling, Tianhao Yan, Srividya Tirunellai Rajamani, Ludwig Küster, Mathias Harrer, Elena Heber, Inga Grossmann, David D. Ebert, Björn W. Schuller:
Depression Diagnosis and Forecast based on Mobile Phone Sensor Data. CoRR abs/2205.07861 (2022) - [i129]Mani Kumar Tellamekala, Shahin Amiriparian, Björn W. Schuller, Elisabeth André, Timo Giesbrecht, Michel F. Valstar:
COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition. CoRR abs/2206.05833 (2022) - [i128]Andreas Triantafyllopoulos, Meishu Song, Zijiang Yang, Xin Jing, Björn W. Schuller:
Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction. CoRR abs/2206.06680 (2022) - [i127]Rui Liu, Berrak Sisman, Björn W. Schuller, Guanglai Gao, Haizhou Li:
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning. CoRR abs/2206.07229 (2022) - [i126]Xin Jing, Meishu Song, Andreas Triantafyllopoulos, Zijiang Yang, Björn W. Schuller:
Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression. CoRR abs/2206.09142 (2022) - [i125]Andreas Triantafyllopoulos, Anastasia Semertzidou, Meishu Song, Florian B. Pokorny, Björn W. Schuller:
COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection. CoRR abs/2206.11045 (2022) - [i124]Meishu Song, Zijiang Yang, Andreas Triantafyllopoulos, Xin Jing, Vincent Karas, Jiangjian Xie, Zixing Zhang, Yoshiharu Yamamoto, Björn W. Schuller:
Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression. CoRR abs/2206.11049 (2022) - [i123]Anna Hlédiková, Dominika Woszczyk, Alican Akman, Soteris Demetriou, Björn W. Schuller:
Data Augmentation for Dementia Detection in Spoken Language. CoRR abs/2206.12879 (2022) - [i122]Mani Kumar Tellamekala, Ömer Sümer, Björn W. Schuller, Elisabeth André, Timo Giesbrecht, Michel F. Valstar:
Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities? CoRR abs/2207.01113 (2022) - [i121]Alice Baird, Panagiotis Tzirakis, Jeffrey A. Brooks, Christopher B. Gregory, Björn W. Schuller, Anton Batliner, Dacher Keltner, Alan Cowen:
The ACII 2022 Affective Vocal Bursts Workshop & Competition: Understanding a critically understudied modality of emotional expression. CoRR abs/2207.03572 (2022) - [i120]Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Björn W. Schuller:
Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion Recognition. CoRR abs/2207.05298 (2022) - [i119]Lukas Christ, Shahin Amiriparian, Alice Baird, Panagiotis Tzirakis, Alexander Kathan, Niklas Müller, Lukas Stappen, Eva-Maria Meßner, Andreas König, Alan Cowen, Erik Cambria, Björn W. Schuller:
The MuSe 2022 Multimodal Sentiment Analysis Challenge: Humor, Emotional Reactions, and Stress. CoRR abs/2207.05691 (2022) - [i118]Alice Baird, Panagiotis Tzirakis, Gauthier Gidel, Marco Jiralerspong, Eilif B. Muller, Kory W. Mathewson, Björn W. Schuller, Erik Cambria, Dacher Keltner, Alan Cowen:
Proceedings of the ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts. CoRR abs/2207.06958 (2022) - [i117]Andreas Triantafyllopoulos, Markus Fendler, Anton Batliner, Maurice Gerczuk, Shahin Amiriparian, Thomas M. Berghaus, Björn W. Schuller:
Distinguishing between pre- and post-treatment in the speech of patients with chronic obstructive pulmonary disease. CoRR abs/2207.12784 (2022) - [i116]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Speech Synthesis with Mixed Emotions. CoRR abs/2208.05890 (2022) - [i115]Vincent Karas, Andreas Triantafyllopoulos, Meishu Song, Björn W. Schuller:
Self-Supervised Attention Networks and Uncertainty Loss Weighting for Multi-Task Emotion Recognition on Vocal Bursts. CoRR abs/2209.07384 (2022) - [i114]Lukas Christ, Shahin Amiriparian, Alexander Kathan, Niklas Müller, Andreas König, Björn W. Schuller:
Multimodal Prediction of Spontaneous Humour: A Novel Dataset and First Results. CoRR abs/2209.14272 (2022) - [i113]Jonah Anton, Harry Coppock, Pancham Shukla, Björn W. Schuller:
Audio Barlow Twins: Self-Supervised Audio Representation Learning. CoRR abs/2209.14345 (2022) - [i112]Andreas Triantafyllopoulos, Björn W. Schuller, Gökçe Iymen, Tevfik Metin Sezgin, Xiangheng He, Zijiang Yang, Panagiotis Tzirakis, Shuo Liu, Silvan Mertes, Elisabeth André, Ruibo Fu, Jianhua Tao:
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era. CoRR abs/2210.03538 (2022) - [i111]Georgios Rizos, Jenna Lawson, Simon Mitchell, Pranay Shah, Xin Wen, Cristina Banks-Leite, Robert M. Ewers, Björn W. Schuller:
Propagating Variational Model Uncertainty for Bioacoustic Call Label Smoothing. CoRR abs/2210.10526 (2022) - [i110]Zhao Ren, Thanh Tam Nguyen, Yi Chang, Björn W. Schuller:
Fast Yet Effective Speech Emotion Recognition with Self-distillation. CoRR abs/2210.14636 (2022) - [i109]Yi Chang, Zhao Ren, Thanh Tam Nguyen, Kun Qian, Björn W. Schuller:
Knowledge Transfer For On-Device Speech Emotion Recognition with Neural Structured Learning. CoRR abs/2210.14977 (2022) - [i108]Alice Baird, Panagiotis Tzirakis, Jeffrey A. Brooks, Christopher B. Gregory, Björn W. Schuller, Anton Batliner, Dacher Keltner, Alan Cowen:
Proceedings of the ACII Affective Vocal Bursts Workshop and Competition 2022 (A-VB): Understanding a critically understudied modality of emotional expression. CoRR abs/2210.15754 (2022) - [i107]Siddique Latif, Hafiz Shehbaz Ali, Muhammad Usama, Rajib Rana, Björn W. Schuller, Junaid Qadir:
AI-Based Emotion Recognition: Promise, Peril, and Prescriptions for Prosocial Path. CoRR abs/2211.07290 (2022) - [i106]Jobie Budd, Kieran Baker, Emma Karoune, Harry Coppock, Selina Patel, Ana Tendero Cañadas, Alexander Titcomb, Richard Payne, David Hurley, Sabrina Egglestone, Lorraine Butler, Jonathon Mellor, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Radka Jersakova, Rachel A. McKendry, Peter Diggle, Sylvia Richardson, Björn W. Schuller, Steven Gilmour, Davide Pigoli, Stephen J. Roberts, Josef Packham, Tracey Thornley, Chris C. Holmes:
A large-scale and PCR-referenced vocal audio dataset for COVID-19. CoRR abs/2212.07738 (2022) - [i105]Harry Coppock, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Kieran Baker, Jobie Budd, Richard Payne, Emma Karoune, David Hurley, Alexander Titcomb, Sabrina Egglestone, Ana Tendero Cañadas, Lorraine Butler, Radka Jersakova, Jonathon Mellor, Selina Patel, Tracey Thornley, Peter Diggle, Sylvia Richardson, Josef Packham, Björn W. Schuller, Davide Pigoli, Steven Gilmour, Stephen J. Roberts, Chris C. Holmes:
Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers. CoRR abs/2212.08570 (2022) - [i104]Davide Pigoli, Kieran Baker, Jobie Budd, Lorraine Butler, Harry Coppock, Sabrina Egglestone, Steven G. Gilmour, Chris C. Holmes, David Hurley, Radka Jersakova, Ivan Kiskin, Vasiliki Koutra, Jonathon Mellor, George Nicholson, Joe Packham, Selina Patel, Richard Payne, Stephen J. Roberts, Björn W. Schuller, Ana Tendero Cañadas, Tracey Thornley, Alexander Titcomb:
Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19. CoRR abs/2212.08571 (2022) - [i103]Lukas Christ, Shahin Amiriparian, Manuel Milling, Ilhan Aslan, Björn W. Schuller:
Automatic Emotion Modelling in Written Stories. CoRR abs/2212.11382 (2022) - 2021
- [j178]Katrin D. Bartl-Pokorny, Malgorzata Pykala, Pinar Uluer, Duygun Erol Barkana, Alice Baird, Hatice Kose, Tatjana Zorcec, Ben Robins, Björn W. Schuller, Agnieszka Landowska:
Robot-Based Intervention for Children With Autism Spectrum Disorder: A Systematic Literature Review. IEEE Access 9: 165433-165450 (2021) - [j177]Jing Han, Zixing Zhang, Zhao Ren, Björn W. Schuller:
Exploring Perception Uncertainty for Emotion Recognition in Dyadic Conversation and Music Listening. Cogn. Comput. 13(2): 231-240 (2021) - [j176]Benjamin Sertolli, Zhao Ren, Björn W. Schuller, Nicholas Cummins:
Representation transfer learning from deep end-to-end speech recognition networks for the classification of health states from speech. Comput. Speech Lang. 68: 101204 (2021) - [j175]Zhengxin Joseph Ye, Björn W. Schuller:
Capturing dynamics of post-earnings-announcement drift using a genetic algorithm-optimized XGBoost. Expert Syst. Appl. 177: 114892 (2021) - [j174]Zixing Zhang, Ding Liu, Jing Han, Kun Qian, Björn W. Schuller:
Learning audio sequence representations for acoustic event classification. Expert Syst. Appl. 178: 115007 (2021) - [j173]Lukas Stappen, Alice Baird, Erik Cambria, Björn W. Schuller:
Sentiment Analysis and Topic Recognition in Video Transcriptions. IEEE Intell. Syst. 36(2): 88-95 (2021) - [j172]Sushovan Chanda, Kedar Fitwe, Gauri Deshpande, Björn W. Schuller, Sachin Patel:
A Deep Audiovisual Approach for Human Confidence Classification. Frontiers Comput. Sci. 3: 674533 (2021) - [j171]Alice Baird, Andreas Triantafyllopoulos, Sandra Zänkert, Sandra Ottl, Lukas Christ, Lukas Stappen, Julian Konzok, Sarah Sturmbauer, Eva-Maria Meßner, Brigitte M. Kudielka, Nicolas Rohleder, Harald Baumeister, Björn W. Schuller:
An Evaluation of Speech-Based Recognition of Emotional and Physiological Markers of Stress. Frontiers Comput. Sci. 3: 750284 (2021) - [j170]Novi Quadrianto, Björn W. Schuller, Finnian Rachel Lattimore:
Editorial: Ethical Machine Learning and Artificial Intelligence. Frontiers Big Data 4: 742589 (2021) - [j169]Björn W. Schuller, Dagmar M. Schuller, Kun Qian, Juan Liu, Huaiyuan Zheng, Xiao Li:
COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis. Frontiers Digit. Health 3: 564906 (2021) - [j168]Yi Chang, Xin Jing, Zhao Ren, Björn W. Schuller:
CovNet: A Transfer Learning Framework for Automatic COVID-19 Detection From Crowd-Sourced Cough Sounds. Frontiers Digit. Health 3: 799067 (2021) - [j167]Jing Han, Zixing Zhang, Maja Pantic, Björn W. Schuller:
Internet of emotional people: Towards continual affective computing cross cultures via audiovisual signals. Future Gener. Comput. Syst. 114: 294-306 (2021) - [j166]Sicheng Zhao, Min Xu, Qingming Huang, Björn W. Schuller:
Introduction to the Special Issue on MMAC: Multimodal Affective Computing of Large-Scale Multimedia Data. IEEE Multim. 28(2): 8-10 (2021) - [j165]Panagiotis Tzirakis, Jiaxin Chen, Stefanos Zafeiriou, Björn W. Schuller:
End-to-end multimodal affect recognition in real-world environments. Inf. Fusion 68: 46-53 (2021) - [j164]Kun Qian, Tomoya Koike, Kazuhiro Yoshiuchi, Björn W. Schuller, Yoshiharu Yamamoto:
Can Appliances Understand the Behavior of Elderly Via Machine Learning? A Feasibility Study. IEEE Internet Things J. 8(10): 8343-8355 (2021) - [j163]Kun Qian, Maximilian Schmitt, Huaiyuan Zheng, Tomoya Koike, Jing Han, Juan Liu, Wei Ji, Junjun Duan, Meishu Song, Zijiang Yang, Zhao Ren, Shuo Liu, Zixing Zhang, Yoshiharu Yamamoto, Björn W. Schuller:
Computer Audition for Fighting the SARS-CoV-2 Corona Crisis - Introducing the Multitask Speech Corpus for COVID-19. IEEE Internet Things J. 8(21): 16035-16046 (2021) - [j162]Shuo Liu, Gil Keren, Emilia Parada-Cabaleiro, Björn W. Schuller:
N-HANS: A neural network-based toolkit for in-the-wild audio enhancement. Multim. Tools Appl. 80(18): 28365-28389 (2021) - [j161]Ziping Zhao, Qifei Li, Zixing Zhang, Nicholas Cummins, Haishuai Wang, Jianhua Tao, Björn W. Schuller:
Combining a parallel 2D CNN with a self-attention Dilated Residual Network for CTC-based discrete speech emotion recognition. Neural Networks 141: 52-60 (2021) - [j160]Jean Kossaifi, Robert Walecki, Yannis Panagakis, Jie Shen, Maximilian Schmitt, Fabien Ringeval, Jing Han, Vedhas Pandit, Antoine Toisoul, Björn W. Schuller, Kam Star, Elnar Hajiyev, Maja Pantic:
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild. IEEE Trans. Pattern Anal. Mach. Intell. 43(3): 1022-1040 (2021) - [j159]Kun Qian, Zixing Zhang, Yoshiharu Yamamoto, Björn W. Schuller:
Artificial Intelligence Internet of Things for the Elderly: From Assisted Living to Health-Care Monitoring. IEEE Signal Process. Mag. 38(4): 78-88 (2021) - [j158]Björn W. Schuller, Rosalind W. Picard, Elisabeth André, Jonathan Gratch, Jianhua Tao:
Intelligent Signal Processing for Affective Computing [From the Guest Editors]. IEEE Signal Process. Mag. 38(6): 9-11 (2021) - [j157]Jing Han, Zixing Zhang, Cecilia Mascolo, Elisabeth André, Jianhua Tao, Ziping Zhao, Björn W. Schuller:
Deep Learning for Mobile Mental Health: Challenges and recent advances. IEEE Signal Process. Mag. 38(6): 96-105 (2021) - [j156]Jing Han, Zixing Zhang, Zhao Ren, Björn W. Schuller:
EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings. IEEE Trans. Affect. Comput. 12(3): 553-564 (2021) - [j155]Zengjie Zhang, Kun Qian, Björn W. Schuller, Dirk Wollherr:
An Online Robot Collision Detection and Identification Scheme by Supervised Learning and Bayesian Decision Theory. IEEE Trans Autom. Sci. Eng. 18(3): 1144-1156 (2021) - [j154]Jiaming Cheng, Ruiyu Liang, Zhenlin Liang, Li Zhao, Chengwei Huang, Björn W. Schuller:
A Deep Adaptation Network for Speech Enhancement: Combining a Relativistic Discriminator With Multi-Kernel Maximum Mean Discrepancy. IEEE ACM Trans. Audio Speech Lang. Process. 29: 41-53 (2021) - [j153]N. P. Narendra, Björn W. Schuller, Paavo Alku:
The Detection of Parkinson's Disease From Speech Using Voice Source Information. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1925-1936 (2021) - [j152]Kazi Nazmul Haque, Rajib Rana, Jiajun Liu, John H. L. Hansen, Nicholas Cummins, Carlos Busso, Björn W. Schuller:
Guided Generative Adversarial Neural Network for Representation Learning and Audio Generation Using Fewer Labelled Audio Data. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2575-2590 (2021) - [j151]Kun Qian, Christoph Janott, Maximilian Schmitt, Zixing Zhang, Clemens Heiser, Werner Hemmert, Yoshiharu Yamamoto, Björn W. Schuller:
Can Machine Learning Assist Locating the Excitation of Snore Sound? A Review. IEEE J. Biomed. Health Informatics 25(4): 1233-1246 (2021) - [j150]Zhao Ren, Qiuqiang Kong, Jing Han, Mark D. Plumbley, Björn W. Schuller:
CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification. IEEE Trans. Multim. 23: 4131-4142 (2021) - [j149]Bob R. Schadenberg, Dennis Reidsma, Vanessa Evers, Daniel P. Davison, Jamy J. Li, Dirk K. J. Heylen, Carlos Neves, Paulo Alvito, Jie Shen, Maja Pantic, Björn W. Schuller, Nicholas Cummins, Vlad Olaru, Cristian Sminchisescu, Snezana Babovic Dimitrijevic, Suncica Petrovic, Aurelie Baranger, Alria Williams, Alyssa M. Alcorn, Elizabeth Pellicano:
Predictable Robots for Autistic Children - Variance in Robot Behaviour, Idiosyncrasies in Autistic Children's Characteristics, and Child-Robot Engagement. ACM Trans. Comput. Hum. Interact. 28(5): 36:1-36:42 (2021) - [j148]Ziping Zhao, Zhongtian Bao, Zixing Zhang, Nicholas Cummins, Shihuang Sun, Haishuai Wang, Jianhua Tao, Björn W. Schuller:
Self-attention transfer networks for speech emotion recognition. Virtual Real. Intell. Hardw. 3(1): 43-54 (2021) - [j147]Meishu Song, Adria Mallol-Ragolta, Emilia Parada-Cabaleiro, Zijiang Yang, Shuo Liu, Zhao Ren, Ziping Zhao, Björn W. Schuller:
Frustration recognition from speech during game interaction using wide residual networks. Virtual Real. Intell. Hardw. 3(1): 76-86 (2021) - [c535]Lukas Stappen, Lea Schumann, Anton Batliner, Björn W. Schuller:
Embracing and Exploiting Annotator Emotional Subjectivity: An Affective Rater Ensemble Model. ACII (Workshops and Demos) 2021: 1-8 - [c534]Korbinian Friedl, Georgios Rizos, Lukas Stappen, Madina Hasan, Lucia Specia, Thomas Hain, Björn W. Schuller:
Uncertainty Aware Review Hallucination for Science Article Classification. ACL/IJCNLP (Findings) 2021: 5004-5009 - [c533]Jason Thies, Lukas Stappen, Gerhard Hagerer, Björn W. Schuller, Georg Groh:
GraphTMT: Unsupervised Graph-based Topic Modeling from Video Transcripts. BigMM 2021: 1-8 - [c532]Mina A. Nessiem, Mostafa M. Mohamed, Harry Coppock, Alexander Gaskell, Björn W. Schuller:
Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks. CBMS 2021: 183-188 - [c531]Ru Zhang, Yuanchun Shi, Björn W. Schuller, Elisabeth André, Sharon L. Oviatt, Aaron Quigley, Nicolai Marquardt, Ilhan Aslan, Ran Ju:
User Experience for Multi-Device Ecosystems: Challenges and Opportunities. CHI Extended Abstracts 2021: 111:1-111:5 - [c530]Andreas Triantafyllopoulos, Manuel Milling, Konstantinos Drossos, Björn W. Schuller:
Fairness and Underspecification in Acoustic Scene Classification: The Case for Disaggregated Evaluations. DCASE 2021: 70-74 - [c529]Kun Qian, Tomoya Koike, Kota Tamada, Toru Takumi, Björn W. Schuller, Yoshiharu Yamamoto:
Sensing the Sounds of Silence: A Pilot Study on the Detection of Model Mice of Autism Spectrum Disorder from Ultrasonic Vocalisations. EMBC 2021: 68-71 - [c528]Gauri Deshpande, Björn W. Schuller:
COVID-19 Biomarkers in Speech: On Source and Filter Components. EMBC 2021: 800-803 - [c527]Shuo Liu, Adria Mallol-Ragolta, Björn W. Schuller:
COVID-19 Detection with a Novel Multi-Type Deep Fusion Method using Breathing and Coughing Information. EMBC 2021: 1840-1843 - [c526]Tomoya Koike, Kun Qian, Björn W. Schuller, Yoshiharu Yamamoto:
Transferring Cross-Corpus Knowledge: An Investigation on Data Augmentation for Heart Sound Classification. EMBC 2021: 1976-1979 - [c525]Adria Mallol-Ragolta, Shuo Liu, Björn W. Schuller:
The Filtering Effect of Face Masks in their Detection from Speech. EMBC 2021: 2079-2082 - [c524]Yi Chang, Zhao Ren, Björn W. Schuller:
Transformer-based CNNs: Mining Temporal Context Information for Multi-sound COVID-19 Diagnosis. EMBC 2021: 2335-2338 - [c523]Fabio Hellmann, Zhao Ren, Elisabeth André, Björn W. Schuller:
Deformable Dilated Faster R-CNN for Universal Lesion Detection in CT Images. EMBC 2021: 2896-2902 - [c522]Adria Mallol-Ragolta, Anastasia Semertzidou, Maria Pateraki, Björn W. Schuller:
harAGE: A Novel Multimodal Smartwatch-based Dataset for Human Activity Recognition. FG 2021: 1-7 - [c521]Meishu Song, Emilia Parada-Cabaleiro, Shuo Liu, Manuel Milling, Alice Baird, Zijiang Yang, Björn W. Schuller:
Supervised Contrastive Learning for Game-Play Frustration Detection from Speech. HCI (7) 2021: 617-629 - [c520]Chao Li, Boyang Chen, Ziping Zhao, Nicholas Cummins, Björn W. Schuller:
Hierarchical Attention-Based Temporal Convolutional Networks for Eeg-Based Emotion Recognition. ICASSP 2021: 1240-1244 - [c519]Panagiotis Tzirakis, Anh Nguyen, Stefanos Zafeiriou, Björn W. Schuller:
Speech Emotion Recognition Using Semantic Information. ICASSP 2021: 6279-6283 - [c518]Srividya Tirunellai Rajamani, Kumar T. Rajamani, Adria Mallol-Ragolta, Shuo Liu, Björn W. Schuller:
A Novel Attention-Based Gated Recurrent Unit and its Efficacy in Speech Emotion Recognition. ICASSP 2021: 6294-6298 - [c517]Andreas Triantafyllopoulos, Björn W. Schuller:
The Role of Task and Acoustic Similarity in Audio Transfer Learning: Insights from the Speech Emotion Recognition Case. ICASSP 2021: 7268-7272 - [c516]Meishu Song, Kun Qian, Bin Chen, Keiju Okabayashi, Emilia Parada-Cabaleiro, Zijiang Yang, Shuo Liu, Kazumasa Togami, Ichiro Hidaka, Yueheng Wang, Björn W. Schuller, Yoshiharu Yamamoto:
Predicting Group Work Performance from Physical Handwriting Features in a Smart English Classroom. ICDSP 2021: 140-145 - [c515]Andreas Triantafyllopoulos, Shuo Liu, Björn W. Schuller:
Deep speaker conditioning for speech emotion recognition. ICME 2021: 1-6 - [c514]Björn W. Schuller, Tuomas Virtanen, Maria Riveiro, Georgios Rizos, Jing Han, Annamaria Mesaros, Konstantinos Drossos:
Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence. ICMI 2021: 788-792 - [c513]Dongyan Huang, Björn W. Schuller, Jianhua Tao, Lei Xie, Jie Yang:
ASMMC21: The 6th International Workshop on Affective Social Multimedia Computing. ICMI 2021: 864-867 - [c512]Shahin Amiriparian, Björn W. Schuller:
AI Hears Your Health: Computer Audition for Health Monitoring. IHAW 2021: 227-233 - [c511]Zhengxin Joseph Ye, Björn W. Schuller:
Deep Learning Post-Earnings-Announcement Drift. IJCNN 2021: 1-7 - [c510]Björn W. Schuller, Anton Batliner, Christian Bergler, Cecilia Mascolo, Jing Han, Iulia Lefter, Heysem Kaya, Shahin Amiriparian, Alice Baird, Lukas Stappen, Sandra Ottl, Maurice Gerczuk, Panagiotis Tzirakis, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Léon J. M. Rothkrantz, Joeri A. Zwerts, Jelle Treep, Casper S. Kaandorp:
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates. Interspeech 2021: 431-435 - [c509]Georgios Rizos, Jenna Lawson, Zhuoda Han, Duncan Butler, James Rosindell, Krystian Mikolajczyk, Cristina Banks-Leite, Björn W. Schuller:
Multi-Attentive Detection of the Spider Monkey Whinny in the (Actual) Wild. Interspeech 2021: 471-475 - [c508]Judith Dineley, Grace Lavelle, Daniel Leightley, Faith Matcham, Sara Siddi, Maria Teresa Peñarrubia-María, Katie M. White, Alina Ivan, Carolin Oetzmann, Sara Simblett, Erin Dawe-Lane, Stuart Bruce, Daniel Stahl, Yatharth Ranjan, Zulqarnain Rashid, Pauline Conde, Amos A. Folarin, Josep Maria Haro, Til Wykes, Richard J. B. Dobson, Vaibhav A. Narayan, Matthew Hotopf, Björn W. Schuller, Nicholas Cummins, RADAR-CNS Consortium:
Remote Smartphone-Based Speech Collection: Acceptance and Barriers in Individuals with Major Depressive Disorder. Interspeech 2021: 631-635 - [c507]Xiangheng He, Junjie Chen, Georgios Rizos, Björn W. Schuller:
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation. Interspeech 2021: 821-825 - [c506]Vincent Karas, Björn W. Schuller:
Recognising Covid-19 from Coughing Using Ensembles of SVMs and LSTMs with Handcrafted and Deep Audio Features. Interspeech 2021: 911-915 - [c505]Gauri Deshpande, Björn W. Schuller:
The DiCOVA 2021 Challenge - An Encoder-Decoder Approach for COVID-19 Recognition from Coughing Audio. Interspeech 2021: 931-935 - [c504]Adria Mallol-Ragolta, Helena Cuesta, Emilia Gómez, Björn W. Schuller:
Cough-Based COVID-19 Detection with Contextual Attention Convolutional Neural Networks and Gender Information. Interspeech 2021: 941-945 - [c503]Pascal Hecker, Florian B. Pokorny, Katrin D. Bartl-Pokorny, Uwe D. Reichel, Zhao Ren, Simone Hantke, Florian Eyben, Dagmar M. Schuller, Bert Arnrich, Björn W. Schuller:
Speaking Corona? Human and Machine Recognition of COVID-19 from Voice. Interspeech 2021: 1029-1033 - [c502]Pingchuan Ma, Rodrigo Mira, Stavros Petridis, Björn W. Schuller, Maja Pantic:
LiRA: Learning Visual Speech Representations from Audio Through Self-Supervision. Interspeech 2021: 3011-3015 - [c501]Alice Baird, Silvan Mertes, Manuel Milling, Lukas Stappen, Thomas Wiest, Elisabeth André, Björn W. Schuller:
A Prototypical Network Approach for Evaluating Generated Emotional Speech. Interspeech 2021: 3161-3165 - [c500]Tianhao Yan, Hao Meng, Emilia Parada-Cabaleiro, Shuo Liu, Meishu Song, Björn W. Schuller:
Coughing-Based Recognition of Covid-19 with Spatial Attentive ConvLSTM Recurrent Neural Networks. Interspeech 2021: 4154-4158 - [c499]Emilia Parada-Cabaleiro, Maximilian Schmitt, Anton Batliner, Björn W. Schuller, Markus Schedl:
Automatic Recognition of Texture in Renaissance Music. ISMIR 2021: 509-516 - [c498]Joshua Y. Kim, Chunfeng Liu, Rafael A. Calvo, Kathryn McCabe, Silas C. R. Taylor, Björn W. Schuller, Kaihang Wu:
Comparison of Automatic Speech Recognition Systems. IWSDS 2021: 123-131 - [c497]Kun Qian, Björn W. Schuller, Yoshiharu Yamamoto:
Recent Advances in Computer Audition for Diagnosing COVID-19: An Overview. LifeTech 2021: 181-182 - [c496]Lukas Stappen, Alice Baird, Lukas Christ, Lea Schumann, Benjamin Sertolli, Eva-Maria Meßner, Erik Cambria, Guoying Zhao, Björn W. Schuller:
The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress. MuSe @ ACM Multimedia 2021: 5-14 - [c495]Alice Baird, Lukas Stappen, Lukas Christ, Lea Schumann, Eva-Maria Messner, Björn W. Schuller:
A Physiologically-Adapted Gold Standard for Arousal during Stress. MuSe @ ACM Multimedia 2021: 69-73 - [c494]Lukas Stappen, Lea Schumann, Benjamin Sertolli, Alice Baird, Benjamin Weigel, Erik Cambria, Björn W. Schuller:
MuSe-Toolbox: The Multimodal Sentiment Analysis Continuous Annotation Fusion and Discrete Class Transformation Toolbox. MuSe @ ACM Multimedia 2021: 75-82 - [c493]Lukas Stappen, Eva-Maria Meßner, Erik Cambria, Guoying Zhao, Björn W. Schuller:
MuSe 2021 Challenge: Multimodal Emotion, Sentiment, Physiological-Emotion, and Stress Detection. ACM Multimedia 2021: 5706-5707 - [c492]Toby Godwin, Georgios Rizos, Alice Baird, Najla D. Al Futaisi, Vincent Brisse, Björn W. Schuller:
Evaluating Deep Music Generation Methods Using Data Augmentation. MMSP 2021: 1-6 - [c491]Srividya Tirunellai Rajamani, Kumar T. Rajamani, Björn W. Schuller:
Towards an Efficient Deep Learning Model for Emotion and Theme Recognition in Music. MMSP 2021: 1-5 - [c490]Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse H. Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk:
HEAR: Holistic Evaluation of Audio Representations. NeurIPS (Competition and Demos) 2021: 125-145 - [c489]Xinzhou Xu, Jun Deng, Zixing Zhang, Chen Wu, Björn W. Schuller:
Identifying surgical-mask speech using deep neural networks on low-level aggregation. SAC 2021: 580-585 - [c488]Alice Baird, Shahin Amiriparian, Manuel Milling, Björn W. Schuller:
Emotion Recognition in Public Speaking Scenarios Utilising An LSTM-RNN Approach with Attention. SLT 2021: 397-402 - [c487]Decky Aspandi, Federico Sukno, Björn W. Schuller, Xavier Binefa:
An Enhanced Adversarial Network with Combined Latent Features for Spatio-temporal Facial Affect Estimation in the Wild. VISIGRAPP (4: VISAPP) 2021: 172-181 - [e21]Joseph Turian, Björn W. Schuller, Dorien Herremans, Katrin Kirchhoff, L. Paola García-Perera, Philippe Esling:
HEAR: Holistic Evaluation of Audio Representations, Virtual Event, December 13-14, 2021. Proceedings of Machine Learning Research 166, PMLR 2021 [contents] - [e20]Björn W. Schuller, Lukas Stappen, Eva-Maria Meßner, Erik Cambria, Guoying Zhao:
MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, Virtual Event, China, 24 October 2021. ACM 2021, ISBN 978-1-4503-8678-4 [contents] - [i102]Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Björn W. Schuller, Jiajun Liu:
A novel policy for pre-trained Deep Reinforcement Learning for Speech Emotion Recognition. CoRR abs/2101.00738 (2021) - [i101]Ognjen Rudovic, Nicolas Tobis, Sebastian Kaltwang, Björn W. Schuller, Daniel Rueckert, Jeffrey F. Cohn, Rosalind W. Picard:
Personalized Federated Deep Learning for Pain Estimation From Face Images. CoRR abs/2101.04800 (2021) - [i100]Zhao Ren, Kun Qian, Fengquan Dong, Zhenyu Dai, Yoshiharu Yamamoto, Björn W. Schuller:
Deep Attention-based Representation Learning for Heart Sound Classification. CoRR abs/2101.04979 (2021) - [i99]Lukas Stappen, Alice Baird, Lea Schumann, Björn W. Schuller:
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements. CoRR abs/2101.06053 (2021) - [i98]Harry Coppock, Alexander Gaskell, Panagiotis Tzirakis, Alice Baird, Lyn Jones, Björn W. Schuller:
End-2-End COVID-19 Detection from Breath & Cough Audio. CoRR abs/2102.08359 (2021) - [i97]Decky Aspandi, Federico Sukno, Björn W. Schuller, Xavier Binefa:
An Enhanced Adversarial Network with Combined Latent Features for Spatio-Temporal Facial Affect Estimation in the Wild. CoRR abs/2102.09150 (2021) - [i96]Björn W. Schuller, Anton Batliner, Christian Bergler, Cecilia Mascolo, Jing Han, Iulia Lefter, Heysem Kaya, Shahin Amiriparian, Alice Baird, Lukas Stappen, Sandra Ottl, Maurice Gerczuk, Panagiotis Tzirakis, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Léon J. M. Rothkrantz, Joeri A. Zwerts, Jelle Treep, Casper S. Kaandorp:
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates. CoRR abs/2102.13468 (2021) - [i95]Panagiotis Tzirakis, Anh Nguyen, Stefanos Zafeiriou, Björn W. Schuller:
Speech Emotion Recognition using Semantic Information. CoRR abs/2103.02993 (2021) - [i94]Maurice Gerczuk, Shahin Amiriparian, Sandra Ottl, Björn W. Schuller:
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition. CoRR abs/2103.08310 (2021) - [i93]Sicheng Zhao, Quanwei Huang, Youbao Tang, Xingxu Yao, Jufeng Yang, Guiguang Ding, Björn W. Schuller:
Computational Emotion Analysis From Images: Recent Advances and Future Directions. CoRR abs/2103.10798 (2021) - [i92]Lukas Stappen, Alice Baird, Lukas Christ, Lea Schumann, Benjamin Sertolli, Eva-Maria Messner, Erik Cambria, Guoying Zhao, Björn W. Schuller:
The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress. CoRR abs/2104.07123 (2021) - [i91]Judith Dineley, Grace Lavelle, Daniel Leightley, Faith Matcham, Sara Siddi, Maria Teresa Peñarrubia-María, Katie M. White, Alina Ivan, Carolin Oetzmann, Sara Simblett, Erin Dawe-Lane, Stuart Bruce, Daniel Stahl, Josep Maria Haro, Til Wykes, Vaibhav A. Narayan, Matthew Hotopf, Björn W. Schuller, Nicholas Cummins:
Remote smartphone-based speech collection: acceptance and barriers in individuals with major depressive disorder. CoRR abs/2104.08600 (2021) - [i90]Shuo Liu, Jing Han, Estela Laporta Puyal, Spyridon Kontaxis, Shaoxiong Sun, Patrick Locatelli, Judith Dineley, Florian B. Pokorny, Gloria Dalla Costa, Letizia Leocani, Ana Isabel Guerrero, Carlos Nos, Ana Zabalza, Per Soelberg Sørensen, Mathias Buron, Melinda Magyari, Yatharth Ranjan, Zulqarnain Rashid, Pauline Conde, Callum L. Stewart, Amos A. Folarin, Richard J. B. Dobson, Raquel Bailón, Srinivasan Vairavan, Nicholas Cummins, Vaibhav A. Narayan, Matthew Hotopf, Giancarlo Comi, Björn W. Schuller:
Fitbeat: COVID-19 Estimation based on Wristband Heart Rate. CoRR abs/2104.09263 (2021) - [i89]Shahin Amiriparian, Artem Sokolov, Ilhan Aslan, Lukas Christ, Maurice Gerczuk, Tobias Hübner, Dmitry Lamanov, Manuel Milling, Sandra Ottl, Ilya Poduremennykh, Evgeniy Shuranov, Björn W. Schuller:
On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era. CoRR abs/2104.10121 (2021) - [i88]Shahin Amiriparian, Tobias Hübner, Maurice Gerczuk, Sandra Ottl, Björn W. Schuller:
DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing from Decentralised Data. CoRR abs/2104.11629 (2021) - [i87]Rodrigo Mira, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Björn W. Schuller, Maja Pantic:
End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks. CoRR abs/2104.13332 (2021) - [i86]Lukas Stappen, Gerhard Hagerer, Björn W. Schuller, Georg Groh:
Unsupervised Graph-based Topic Modeling from Video Transcriptions. CoRR abs/2105.01466 (2021) - [i85]Lukas Stappen, Alice Baird, Michelle Lienhart, Annalena Bätz, Björn W. Schuller:
An Estimation of Online Video User Engagement from Features of Continuous Emotions. CoRR abs/2105.01633 (2021) - [i84]Pingchuan Ma, Rodrigo Mira, Stavros Petridis, Björn W. Schuller, Maja Pantic:
LiRA: Learning Visual Speech Representations from Audio through Self-supervision. CoRR abs/2106.09171 (2021) - [i83]Sicheng Zhao, Xingxu Yao, Jufeng Yang, Guoli Jia, Guiguang Ding, Tat-Seng Chua, Björn W. Schuller, Kurt Keutzer:
Affective Image Content Analysis: Two Decades Review and New Perspectives. CoRR abs/2106.16125 (2021) - [i82]Xiangheng He, Junjie Chen, Georgios Rizos, Björn W. Schuller:
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation. CoRR abs/2107.08361 (2021) - [i81]Lukas Stappen, Lea Schumann, Benjamin Sertolli, Alice Baird, Benjamin Weigel, Erik Cambria, Björn W. Schuller:
MuSe-Toolbox: The Multimodal Sentiment Analysis Continuous Annotation Fusion and Discrete Class Transformation Toolbox. CoRR abs/2107.11757 (2021) - [i80]Alice Baird, Lukas Stappen, Lukas Christ, Lea Schumann, Eva-Maria Meßner, Björn W. Schuller:
A Physiologically-Adapted Gold Standard for Arousal during Stress. CoRR abs/2107.12964 (2021) - [i79]Alican Akman, Harry Coppock, Alexander Gaskell, Panagiotis Tzirakis, Lyn Jones, Björn W. Schuller:
Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 from Audio Challenges. CoRR abs/2107.14549 (2021) - [i78]Zhao Ren, Yi Chang, Björn W. Schuller:
The EIHW-GLAM Deep Attentive Multi-model Fusion System for Cough-based COVID-19 Recognition in the DiCOVA 2021 Challenge. CoRR abs/2108.03041 (2021) - [i77]Sandra Ottl, Shahin Amiriparian, Maurice Gerczuk, Björn W. Schuller:
A Machine Learning Framework for Automatic Prediction of Human Semen Motility. CoRR abs/2109.08049 (2021) - [i76]Andreas Triantafyllopoulos, Manuel Milling, Konstantinos Drossos, Björn W. Schuller:
Fairness and underspecification in acoustic scene classification: The case for disaggregated evaluations. CoRR abs/2110.01506 (2021) - [i75]Adria Mallol-Ragolta, Helena Cuesta, Emilia Gómez, Björn W. Schuller:
EIHW-MTG DiCOVA 2021 Challenge System Report. CoRR abs/2110.06543 (2021) - [i74]Andreas Triantafyllopoulos, Uwe D. Reichel, Shuo Liu, Stephan Huber, Florian Eyben, Björn W. Schuller:
Multistage linguistic conditioning of convolutional layers for speech emotion recognition. CoRR abs/2110.06650 (2021) - [i73]Adria Mallol-Ragolta, Helena Cuesta, Emilia Gómez, Björn W. Schuller:
EIHW-MTG: Second DiCOVA Challenge System Report. CoRR abs/2110.09239 (2021) - [i72]Panagiotis Tzirakis, Dénes Boros, Elnar Hajiyev, Björn W. Schuller:
Facial Emotion Recognition using Deep Residual Networks in Real-World Environments. CoRR abs/2111.02717 (2021) - [i71]Effie Lai-Chong Law, Asbjørn Følstad, Jonathan Grudin, Björn W. Schuller:
Conversational Agent as Trustworthy Autonomous System (Trust-CA) (Dagstuhl Seminar 21381). Dagstuhl Reports 11(8): 76-114 (2021) - 2020
- [j146]Kazi Nazmul Haque, Rajib Rana, Björn W. Schuller:
High-Fidelity Audio Generation and Representation Learning With Guided Adversarial Autoencoder. IEEE Access 8: 223509-223528 (2020) - [j145]Panpan Wu, Xuanchao Sun, Ziping Zhao, Haishuai Wang, Shirui Pan, Björn W. Schuller:
Classification of Lung Nodules Based on Deep Residual Networks and Migration Learning. Comput. Intell. Neurosci. 2020: 8975078:1-8975078:10 (2020) - [j144]Shahin Amiriparian, Maurice Gerczuk, Sandra Ottl, Lukas Stappen, Alice Baird, Lukas Koebe, Björn W. Schuller:
Towards cross-modal pre-training and learning tempo-spatial characteristics for audio recognition with convolutional and recurrent neural networks. EURASIP J. Audio Speech Music. Process. 2020(1): 19 (2020) - [j143]Alice Baird, Björn W. Schuller:
Considerations for a More Ethical Approach to Data in AI: On Data Representation and Infrastructure. Frontiers Big Data 3: 25 (2020) - [j142]Kun Qian, Xiao Li, Haifeng Li, Shengchen Li, Wei Li, Zuoliang Ning, Shuai Yu, Limin Hou, Gang Tang, Jing Lu, Feng Li, Shufei Duan, Chengcheng Du, Yao Cheng, Yujun Wang, Lin Gan, Yoshiharu Yamamoto, Björn W. Schuller:
Computer Audition for Healthcare: Opportunities and Challenges. Frontiers Digit. Health 2: 5 (2020) - [j141]Nicholas Cummins, Björn W. Schuller:
Five Crucial Challenges in Digital Health. Frontiers Digit. Health 2: 536203 (2020) - [j140]Emilia Parada-Cabaleiro, Anton Batliner, Alice Baird, Björn W. Schuller:
The perception of emotional cues by children in artificial background noise. Int. J. Speech Technol. 23(1): 169-182 (2020) - [j139]Vedhas Pandit, Maximilian Schmitt, Nicholas Cummins, Björn W. Schuller:
I see it in your eyes: Training the shallowest-possible CNN to recognise emotions and pain from muted web-assisted in-the-wild video-chats in real-time. Inf. Process. Manag. 57(6): 102347 (2020) - [j138]Ziping Zhao, Zhongtian Bao, Zixing Zhang, Jun Deng, Nicholas Cummins, Haishuai Wang, Jianhua Tao, Björn W. Schuller:
Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders. IEEE J. Sel. Top. Signal Process. 14(2): 423-434 (2020) - [j137]Gil Keren, Sivan Sabato, Björn W. Schuller:
Analysis of loss functions for fast single-class classification. Knowl. Inf. Syst. 62(1): 337-358 (2020) - [j136]Tobias Baur, Alexander Heimerl, Florian Lingenfelser, Johannes Wagner, Michel F. Valstar, Björn W. Schuller, Elisabeth André:
eXplainable Cooperative Machine Learning with NOVA. Künstliche Intell. 34(2): 143-164 (2020) - [j135]Emilia Parada-Cabaleiro, Giovanni Costantini, Anton Batliner, Maximilian Schmitt, Björn W. Schuller:
DEMoS: an Italian emotional speech corpus. Lang. Resour. Evaluation 54(2): 341-383 (2020) - [j134]Maria Littmann, Katharina Selig, Liel Cohen-Lavi, Yotam Frank, Peter Hönigschmid, Evans Kataka, Anja Mösch, Kun Qian, Avihai Ron, Sebastian Schmid, Adam Sorbie, Liran Szlak, Ayana Dagan-Wiener, Nir Ben-Tal, Masha Y. Niv, Daniel Razansky, Björn W. Schuller, Donna P. Ankerst, Tomer Hertz, Burkhard Rost:
Validity of machine learning in biology and medicine increased through collaborations across fields of expertise. Nat. Mach. Intell. 2(1): 18-24 (2020) - [j133]Jun Deng, Björn W. Schuller, Florian Eyben, Dagmar Schuller, Zixing Zhang, Holly Francois, Eunmi Oh:
Exploiting time-frequency patterns with LSTM-RNNs for low-bitrate audio restoration. Neural Comput. Appl. 32(4): 1095-1107 (2020) - [j132]Shahin Amiriparian, Nicholas Cummins, Maurice Gerczuk, Sergey Pugachevskiy, Sandra Ottl, Björn W. Schuller:
"Are You Playing a Shooter Again?!" Deep Representation Learning for Audio-Based Video Game Genre Recognition. IEEE Trans. Games 12(2): 145-154 (2020) - [j131]Yue Zhang, Andrea Michi, Johannes Wagner, Elisabeth André, Björn W. Schuller, Felix Weninger:
A Generic Human-Machine Annotation Framework Based on Dynamic Cooperative Learning. IEEE Trans. Cybern. 50(3): 1230-1239 (2020) - [j130]Zixing Zhang, Dimitris N. Metaxas, Hung-yi Lee, Björn W. Schuller:
Guest Editorial Special Issue on Adversarial Learning in Computational Intelligence. IEEE Trans. Emerg. Top. Comput. Intell. 4(4): 414-416 (2020) - [j129]Zixing Zhang, Jing Han, Kun Qian, Christoph Janott, Yanan Guo, Björn W. Schuller:
Snore-GANs: Improving Automatic Snore Sound Classification With Synthesized Data. IEEE J. Biomed. Health Informatics 24(1): 300-310 (2020) - [j128]Fengquan Dong, Kun Qian, Zhao Ren, Alice Baird, Xinjian Li, Zhenyu Dai, Bo Dong, Florian Metze, Yoshiharu Yamamoto, Björn W. Schuller:
Machine Listening for Heart Status Monitoring: Introducing and Benchmarking HSS - The Heart Sounds Shenzhen Corpus. IEEE J. Biomed. Health Informatics 24(7): 2082-2092 (2020) - [c486]Tomoya Koike, Kun Qian, Qiuqiang Kong, Mark D. Plumbley, Björn W. Schuller, Yoshiharu Yamamoto:
Audio for Audio is Better? An Investigation on Transfer Learning Models for Heart Sound Classification. EMBC 2020: 74-77 - [c485]Panagiotis Tzirakis, Athanasios Papaioannou, Alexandros Lattas, Michail Tarasiou, Björn W. Schuller, Stefanos Zafeiriou:
Synthesising 3D Facial Motion from "In-the-Wild" Speech. FG 2020: 265-272 - [c484]Decky Aspandi, Adria Mallol-Ragolta, Björn W. Schuller, Xavier Binefa:
Latent-Based Adversarial Neural Networks for Facial Affect Estimations. FG 2020: 606-610 - [c483]Adria Mallol-Ragolta, Shuo Liu, Nicholas Cummins, Björn W. Schuller:
A Curriculum Learning Approach for Pain Intensity Recognition from Facial Expressions. FG 2020: 829-833 - [c482]Alice Baird, Meishu Song, Björn W. Schuller:
Interaction with the Soundscape: Exploring Emotional Audio Generation for Improved Individual Wellbeing. HCI (37) 2020: 229-242 - [c481]Georgios Rizos, Alice Baird, Max Elliott, Björn W. Schuller:
Stargan for Emotional Speech Conversion: Validated by Data Augmentation of End-To-End Emotion Recognition. ICASSP 2020: 3502-3506 - [c480]Wenjing Han, Tao Jiang, Yan Li, Björn W. Schuller, Huabin Ruan:
Ordinal Learning for Emotion Recognition in Customer Service Calls. ICASSP 2020: 6494-6498 - [c479]Ziping Zhao, Zhongtian Bao, Zixing Zhang, Nicholas Cummins, Haishuai Wang, Björn W. Schuller:
Hierarchical Attention Transfer Networks for Depression Assessment from Speech. ICASSP 2020: 7159-7163 - [c478]Zhao Ren, Alice Baird, Jing Han, Zixing Zhang, Björn W. Schuller:
Generating and Protecting Against Adversarial Attacks for Deep Speech-Based Emotion Recognition Models. ICASSP 2020: 7184-7188 - [c477]Sandra Ottl, Shahin Amiriparian, Maurice Gerczuk, Vincent Karas, Björn W. Schuller:
Group-level Speech Emotion Recognition Utilising Deep Spectrum Features. ICMI 2020: 821-826 - [c476]Lukas Stappen, Georgios Rizos, Björn W. Schuller:
X-AWARE: ConteXt-AWARE Human-Environment Attention Fusion for Driver Gaze Prediction in the Wild. ICMI 2020: 858-867 - [c475]Chao Li, Qian Zhang, Ziping Zhao, Li Gu, Björn W. Schuller:
Exploring Spatial-Temporal Representations for fNIRS-based Intimacy Detection via an Attention-enhanced Cascade Convolutional Recurrent Neural Network. ICPR 2020: 8862-8869 - [c474]Shuo Liu, Jinlong Jiao, Ziping Zhao, Judith Dineley, Nicholas Cummins, Björn W. Schuller:
Hierarchical Component-attention Based Speaker Turn Embedding for Emotion Recognition. IJCNN 2020: 1-7 - [c473]Catarina Botelho, Lorenz Diener, Dennis Küster, Kevin Scheck, Shahin Amiriparian, Björn W. Schuller, Tanja Schultz, Alberto Abad, Isabel Trancoso:
Toward Silent Paralinguistics: Speech-to-EMG - Retrieving Articulatory Muscle Activity from Speech. INTERSPEECH 2020: 354-358 - [c472]Zhao Ren, Jing Han, Nicholas Cummins, Björn W. Schuller:
Enhancing Transferability of Black-Box Adversarial Attacks via Lifelong Learning for Speech Emotion Recognition Models. INTERSPEECH 2020: 496-500 - [c471]Adria Mallol-Ragolta, Nicholas Cummins, Björn W. Schuller:
An Investigation of Cross-Cultural Semi-Supervised Learning for Continuous Affect Recognition. INTERSPEECH 2020: 511-515 - [c470]Siddique Latif, Muhammad Asim, Rajib Rana, Sara Khalifa, Raja Jurdak, Björn W. Schuller:
Augmenting Generative Adversarial Networks for Speech Emotion Recognition. INTERSPEECH 2020: 521-525 - [c469]Panagiotis Tzirakis, Alexander Shiarella, Robert M. Ewers, Björn W. Schuller:
Computer Audition for Continuous Rainforest Occupancy Monitoring: The Case of Bornean Gibbons' Call Detection. INTERSPEECH 2020: 1211-1215 - [c468]Lukas Stappen, Georgios Rizos, Madina Hasan, Thomas Hain, Björn W. Schuller:
Uncertainty-Aware Machine Support for Paper Reviewing on the Interspeech 2019 Submission Corpus. INTERSPEECH 2020: 1808-1812 - [c467]Björn W. Schuller, Anton Batliner, Christian Bergler, Eva-Maria Messner, Antonia F. de C. Hamilton, Shahin Amiriparian, Alice Baird, Georgios Rizos, Maximilian Schmitt, Lukas Stappen, Harald Baumeister, Alexis Deighton MacIntyre, Simone Hantke:
The INTERSPEECH 2020 Computational Paralinguistics Challenge: Elderly Emotion, Breathing & Masks. INTERSPEECH 2020: 2042-2046 - [c466]Tomoya Koike, Kun Qian, Björn W. Schuller, Yoshiharu Yamamoto:
Learning Higher Representations from Pre-Trained Deep Models with Data Augmentation for the COMPARE 2020 Challenge Mask Task. INTERSPEECH 2020: 2047-2051 - [c465]Alexis Deighton MacIntyre, Georgios Rizos, Anton Batliner, Alice Baird, Shahin Amiriparian, Antonia F. de C. Hamilton, Björn W. Schuller:
Deep Attentive End-to-End Continuous Breath Sensing from Speech. INTERSPEECH 2020: 2082-2086 - [c464]Nicholas Cummins, Yilin Pan, Zhao Ren, Julian Fritsch, Venkata Srikanth Nallanthighal, Heidi Christensen, Daniel Blackburn, Björn W. Schuller, Mathew Magimai-Doss, Helmer Strik, Aki Härmä:
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition. INTERSPEECH 2020: 2182-2186 - [c463]Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Björn W. Schuller:
Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-Corpus Setting for Speech Emotion Recognition. INTERSPEECH 2020: 2327-2331 - [c462]Zijiang Yang, Shuo Liu, Meishu Song, Emilia Parada-Cabaleiro, Björn W. Schuller:
Adventitious Respiratory Classification Using Attentive Residual Neural Networks. INTERSPEECH 2020: 2912-2916 - [c461]Shuo Liu, Andreas Triantafyllopoulos, Zhao Ren, Björn W. Schuller:
Towards Speech Robustness for Acoustic Scene Classification. INTERSPEECH 2020: 3087-3091 - [c460]Lorenz Diener, Shahin Amiriparian, Catarina Botelho, Kevin Scheck, Dennis Küster, Isabel Trancoso, Björn W. Schuller, Tanja Schultz:
Towards Silent Paralinguistics: Deriving Speaking Mode and Speaker ID from Electromyographic Signals. INTERSPEECH 2020: 3117-3121 - [c459]Merlin Albes, Zhao Ren, Björn W. Schuller, Nicholas Cummins:
Squeeze for Sneeze: Compact Neural Networks for Cold and Flu Recognition. INTERSPEECH 2020: 4546-4550 - [c458]Jing Han, Kun Qian, Meishu Song, Zijiang Yang, Zhao Ren, Shuo Liu, Juan Liu, Huaiyuan Zheng, Wei Ji, Tomoya Koike, Xiao Li, Zixing Zhang, Yoshiharu Yamamoto, Björn W. Schuller:
An Early Study on Intelligent Analysis of Speech Under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety. INTERSPEECH 2020: 4946-4950 - [c457]Alice Baird, Nicholas Cummins, Sebastian Schnieder, Jarek Krajewski, Björn W. Schuller:
An Evaluation of the Effect of Anxiety on Speech - Computational Prediction of Anxiety from Sustained Vowels. INTERSPEECH 2020: 4951-4955 - [c456]Ziping Zhao, Qifei Li, Nicholas Cummins, Bin Liu, Haishuai Wang, Jianhua Tao, Björn W. Schuller:
Hybrid Network Feature Extraction for Depression Assessment from Speech. INTERSPEECH 2020: 4956-4960 - [c455]Georgios Rizos, Björn W. Schuller:
Average Jane, Where Art Thou? - Recent Avenues in Efficient Machine Learning Under Subjectivity Uncertainty. IPMU (1) 2020: 42-55 - [c454]Maurice Gerczuk, Shahin Amiriparian, Sandra Ottl, Srividya Tirunellai Rajamani, Björn W. Schuller:
Emotion and Themes Recognition in Music with Convolutional and Recurrent Attention-Blocks. MediaEval 2020 - [c453]Srividya Tirunellai Rajamani, Kumar T. Rajamani, Björn W. Schuller:
Emotion and Theme Recognition in Music Using Attention-Based Methods. MediaEval 2020 - [c452]Shahin Amiriparian, Pawel Winokurow, Vincent Karas, Sandra Ottl, Maurice Gerczuk, Björn W. Schuller:
Unsupervised Representation Learning with Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech. MuSe @ ACM Multimedia 2020: 11-17 - [c451]Lukas Stappen, Alice Baird, Georgios Rizos, Panagiotis Tzirakis, Xinchen Du, Felix Hafner, Lea Schumann, Adria Mallol-Ragolta, Björn W. Schuller, Iulia Lefter, Erik Cambria, Ioannis Kompatsiaris:
MuSe 2020 Challenge and Workshop: Multimodal Sentiment Analysis, Emotion-target Engagement and Trustworthiness Detection in Real-life Media: Emotional Car Reviews in-the-wild. MuSe @ ACM Multimedia 2020: 35-44 - [c450]Lukas Stappen, Björn W. Schuller, Iulia Lefter, Erik Cambria, Ioannis Kompatsiaris:
Summary of MuSe 2020: Multimodal Sentiment Analysis, Emotion-target Engagement and Trustworthiness Detection in Real-life Media. ACM Multimedia 2020: 4769-4770 - [c449]Silvan Mertes, Alice Baird, Dominik Schiller, Björn W. Schuller, Elisabeth André:
An Evolutionary-based Generative Approach for Audio Data Augmentation. MMSP 2020: 1-6 - [c448]Gauri Deshpande, Sachin Patel, Sushovan Chanda, Priti Patil, Vasundhara Agrawal, Björn W. Schuller:
Laughter as a Controller in a Stress Buster Game. PervasiveHealth 2020: 316-324 - [e19]Björn W. Schuller, Iulia Lefter, Erik Cambria, Ioannis Kompatsiaris, Lukas Stappen:
MuSe'20: Proceedings of the 1st International on Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop, Seattle, WA, USA, October 16, 2020. ACM 2020, ISBN 978-1-4503-8157-4 [contents] - [i70]Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Junaid Qadir, Björn W. Schuller:
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends. CoRR abs/2001.00378 (2020) - [i69]Decky Aspandi, Adria Mallol-Ragolta, Björn W. Schuller, Xavier Binefa:
Adversarial-based neural networks for affect estimations in the wild. CoRR abs/2002.00883 (2020) - [i68]Kazi Nazmul Haque, Rajib Rana, Björn W. Schuller:
Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data. CoRR abs/2003.02836 (2020) - [i67]Björn W. Schuller, Dagmar M. Schuller, Kun Qian, Juan Liu, Huaiyuan Zheng, Xiao Li:
COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis. CoRR abs/2003.11117 (2020) - [i66]Lukas Stappen, Fabian Brunn, Björn W. Schuller:
Cross-lingual Zero- and Few-shot Hate Speech Detection Utilising Frozen Transformer Language Models and AXEL. CoRR abs/2004.13850 (2020) - [i65]Lukas Stappen, Alice Baird, Georgios Rizos, Panagiotis Tzirakis, Xinchen Du, Felix Hafner, Lea Schumann, Adria Mallol-Ragolta, Björn W. Schuller, Iulia Lefter, Erik Cambria, Ioannis Kompatsiaris:
MuSe 2020 - The First International Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop. CoRR abs/2004.14858 (2020) - [i64]Jing Han, Kun Qian, Meishu Song, Zijiang Yang, Zhao Ren, Shuo Liu, Juan Liu, Huaiyuan Zheng, Wei Ji, Tomoya Koike, Xiao Li, Zixing Zhang, Yoshiharu Yamamoto, Björn W. Schuller:
An Early Study on Intelligent Analysis of Speech under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety. CoRR abs/2005.00096 (2020) - [i63]Tomoya Koike, Kun Qian, Björn W. Schuller, Yoshiharu Yamamoto:
deepSELF: An Open Source Deep Self End-to-End Learning Framework. CoRR abs/2005.06993 (2020) - [i62]Mostafa M. Mohamed, Björn W. Schuller:
"I have vxxx bxx connexxxn!": Facing Packet Loss in Deep Speech Emotion Recognition. CoRR abs/2005.07757 (2020) - [i61]Mostafa M. Mohamed, Björn W. Schuller:
ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition. CoRR abs/2005.07777 (2020) - [i60]Mostafa M. Mohamed, Mina A. Nessiem, Björn W. Schuller:
On Deep Speech Packet Loss Concealment: A Mini-Survey. CoRR abs/2005.07794 (2020) - [i59]Siddique Latif, Muhammad Asim, Rajib Rana, Sara Khalifa, Raja Jurdak, Björn W. Schuller:
Augmenting Generative Adversarial Networks for Speech Emotion Recognition. CoRR abs/2005.08447 (2020) - [i58]Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Björn W. Schuller:
Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-corpus Setting for Speech Emotion Recognition. CoRR abs/2005.08453 (2020) - [i57]Gauri Deshpande, Björn W. Schuller:
An Overview on Audio, Signal, Speech, & Language Processing for COVID-19. CoRR abs/2005.08579 (2020) - [i56]Shahin Amiriparian, Pawel Winokurow, Vincent Karas, Sandra Ottl, Maurice Gerczuk, Björn W. Schuller:
A Novel Fusion of Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech. CoRR abs/2005.08722 (2020) - [i55]Thejan Rajapakshe, Siddique Latif, Rajib Rana, Sara Khalifa, Björn W. Schuller:
Deep Reinforcement Learning with Pre-training for Time-efficient Training of Automatic Speech Recognition. CoRR abs/2005.11172 (2020) - [i54]Kazi Nazmul Haque, Rajib Rana, Björn W. Schuller:
High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder. CoRR abs/2006.00877 (2020) - [i53]Lukas Stappen, Xinchen Du, Vincent Karas, Stefan Müller, Björn W. Schuller:
Go-CaRD - Generic, Optical Car Part Recognition and Detection: Collection, Insights, and Applications. CoRR abs/2006.08521 (2020) - [i52]Liang Zhang, Johann Li, Ping Li, Xiaoyuan Lu, Peiyi Shen, Guangming Zhu, Syed Afaq Ali Shah, Mohammed Bennamoun, Kun Qian, Björn W. Schuller:
MeDaS: An open-source platform as service to help break the walls between medicine and informatics. CoRR abs/2007.06013 (2020) - [i51]Zhengxin Joseph Ye, Björn W. Schuller:
Capturing dynamics of post-earnings-announcement drift using genetic algorithm-optimised supervised learning. CoRR abs/2009.03094 (2020) - [i50]Zhao Ren, Qiuqiang Kong, Jing Han, Mark D. Plumbley, Björn W. Schuller:
CAA-Net: Conditional Atrous CNNs with Attention for Explainable Device-robust Acoustic Scene Classification. CoRR abs/2011.09299 (2020) - [i49]Gauri Deshpande, Björn W. Schuller:
Audio, Speech, Language, & Signal Processing for COVID-19: A Comprehensive Overview. CoRR abs/2011.14445 (2020) - [i48]Kun Qian, Björn W. Schuller, Yoshiharu Yamamoto:
Recent Advances in Computer Audition for Diagnosing COVID-19: An Overview. CoRR abs/2012.04650 (2020) - [i47]Katrin D. Bartl-Pokorny, Florian B. Pokorny, Anton Batliner, Shahin Amiriparian, Anastasia Semertzidou, Florian Eyben, Elena Kramer, Florian Schmidt, Rainer Schönweiler, Markus Wehler, Björn W. Schuller:
The voice of COVID-19: Acoustic correlates of infection. CoRR abs/2012.09478 (2020) - [i46]Björn W. Schuller, Harry Coppock, Alexander Gaskell:
Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks. CoRR abs/2012.14553 (2020)
2010 – 2019
- 2019
- [j127]Ziping Zhao, Zhongtian Bao, Yiqin Zhao, Zixing Zhang, Nicholas Cummins, Zhao Ren, Björn W. Schuller:
Exploring Deep Spectrum Representations via Attention-Based Recurrent and Convolutional Neural Networks for Speech Emotion Recognition. IEEE Access 7: 97515-97525 (2019) - [j126]Jing Han, Zixing Zhang, Björn W. Schuller:
Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives [Review Article]. IEEE Comput. Intell. Mag. 14(2): 68-81 (2019) - [j125]Björn W. Schuller:
Microexpressions: A Chance for Computers to Beat Humans at Detecting Hidden Emotions? Computer 52(2): 4-5 (2019) - [j124]Björn W. Schuller, Felix Weninger, Yue Zhang, Fabien Ringeval, Anton Batliner, Stefan Steidl, Florian Eyben, Erik Marchi, Alessandro Vinciarelli, Klaus R. Scherer, Mohamed Chetouani, Marcello Mortillaro:
Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge. Comput. Speech Lang. 53: 156-180 (2019) - [j123]Shahin Amiriparian, Jing Han, Maximilian Schmitt, Alice Baird, Adria Mallol-Ragolta, Manuel Milling, Maurice Gerczuk, Björn W. Schuller:
Synchronization in Interpersonal Speech. Frontiers Robotics AI 6: 116 (2019) - [j122]Simone Hantke, Tobias Olenyi, Christoph Hausner, Tobias Appel, Björn W. Schuller:
Large-scale Data Collection and Analysis via a Gamified Intelligent Crowdsourcing Platform. Int. J. Autom. Comput. 16(4): 427-436 (2019) - [j121]Dimitrios Kollias, Panagiotis Tzirakis, Mihalis A. Nicolaou, Athanasios Papaioannou, Guoying Zhao, Björn W. Schuller, Irene Kotsia, Stefanos Zafeiriou:
Deep Affect Prediction in-the-Wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond. Int. J. Comput. Vis. 127(6-7): 907-929 (2019) - [j120]Björn W. Schuller:
Responding to uncertainty in emotion recognition. J. Inf. Commun. Ethics Soc. 17(3): 299-303 (2019) - [j119]Björn W. Schuller:
IEEE Transactions on Affective Computing-On Novelty and Valence. IEEE Trans. Affect. Comput. 10(1): 1-2 (2019) - [j118]Yue Xie, Ruiyu Liang, Zhenlin Liang, Chengwei Huang, Cairong Zou, Björn W. Schuller:
Speech Emotion Classification Using Attention-Based LSTM. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1675-1685 (2019) - [j117]Björn W. Schuller, Lucas Paletta, Peter Robinson, Nicolas Sabouret, Georgios N. Yannakakis:
Guest Editorial Intelligence in Serious Games. IEEE Trans. Games 11(4): 306-310 (2019) - [j116]Erik Marchi, Tadas Baltrusaitis, Andra Adams, Marwa Mahmoud, Ofer Golan, Shimrit Fridenson-Hayo, Shahar Tal, Shai Newman, Noga Meir-Goren, Antonio Camurri, Stefano Piana, Björn W. Schuller, Sven Bölte, T. Metin Sezgin, Nese Alyüz, Agnieszka Rynkiewicz, Aurelie Baranger, Alice Baird, Simon Baron-Cohen, Amandine Lassalle, Helen O'Reilly, Delia Pigat, Peter Robinson, Ian Davies:
The ASC-Inclusion Perceptual Serious Gaming Platform for Autistic Children. IEEE Trans. Games 11(4): 328-339 (2019) - [j115]Xinzhou Xu, Jun Deng, Eduardo Coutinho, Chen Wu, Li Zhao, Björn W. Schuller:
Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition. IEEE Trans. Multim. 21(3): 795-808 (2019) - [j114]Zixing Zhang, Jing Han, Eduardo Coutinho, Björn W. Schuller:
Dynamic Difficulty Awareness Training for Continuous Emotion Prediction. IEEE Trans. Multim. 21(5): 1289-1301 (2019) - [c447]Meishu Song, Zijiang Yang, Alice Baird, Emilia Parada-Cabaleiro, Zixing Zhang, Ziping Zhao, Björn W. Schuller:
Audiovisual Analysis for Recognising Frustration during Game-Play: Introducing the Multimodal Game Frustration Database. ACII 2019: 517-523 - [c446]Vedhas Pandit, Maximilian Schmitt, Nicholas Cummins, Björn W. Schuller:
I Know How you Feel Now, and Here's why!: Demystifying Time-Continuous High Resolution Text-Based Affect Predictions in the Wild. CBMS 2019: 465-470 - [c445]Georgios Rizos, Konstantin Hemker, Björn W. Schuller:
Augment to Prevent: Short-Text Data Augmentation in Deep Learning for Hate-Speech Classification. CIKM 2019: 991-1000 - [c444]Ognjen (Oggi) Rudovic, Hae Won Park, John Busche, Björn W. Schuller, Cynthia Breazeal, Rosalind W. Picard:
Personalized Estimation of Engagement From Videos Using Active Learning With Deep Reinforcement Learning. CVPR Workshops 2019: 217-226 - [c443]Zhao Ren, Jing Han, Nicholas Cummins, Qiuqiang Kong, Mark D. Plumbley, Björn W. Schuller:
Multi-instance Learning for Bipolar Disorder Diagnosis using Weakly Labelled Speech Data. PDH 2019: 79-83 - [c442]Kun Qian, Hiroyuki Kuromiya, Zixing Zhang, Jinhyuk Kim, Toru Nakamura, Kazuhiro Yoshiuchi, Björn W. Schuller, Yoshiharu Yamamoto:
Teaching Machines to Know Your Depressive State: On Physical Activity in Health and Major Depressive Disorder. EMBC 2019: 3592-3595 - [c441]Christoph Janott, Christian Rohrmeier, Maximilian Schmitt, Werner Hemmert, Björn W. Schuller:
Snoring - An Acoustic Definition. EMBC 2019: 3653-3657 - [c440]Julian Schiele, Fabian Rabe, Maximilian Schmitt, Manuel Glaser, Franziska Häring, Jens O. Brunner, Bernhard Bauer, Björn W. Schuller, Claudia Traidl-Hoffmann, Athanasios Damialis:
Automated Classification of Airborne Pollen using Neural Networks. EMBC 2019: 4474-4478 - [c439]Maximilian Schmitt, Björn W. Schuller:
End-to-end Audio Classification with Small Datasets - Making It Work. EUSIPCO 2019: 1-5 - [c438]Adria Mallol-Ragolta, Maximilian Schmitt, Alice Baird, Nicholas Cummins, Björn W. Schuller:
Performance Analysis of Unimodal and Multimodal Models in Valence-Based Empathy Recognition. FG 2019: 1-5 - [c437]Panagiotis Tzirakis, Mihalis A. Nicolaou, Björn W. Schuller, Stefanos Zafeiriou:
Time-series Clustering with Jointly Learning Deep Representations, Clusters and Temporal Boundaries. FG 2019: 1-5 - [c436]Zhao Ren, Qiuqiang Kong, Jing Han, Mark D. Plumbley, Björn W. Schuller:
Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes. ICASSP 2019: 56-60 - [c435]Georgios Rizos, Björn W. Schuller:
Modelling Sample Informativeness for Deep Affective Computing. ICASSP 2019: 3482-3486 - [c434]Jing Han, Zixing Zhang, Zhao Ren, Björn W. Schuller:
Implicit Fusion by Joint Audiovisual Training for Emotion Recognition in Mono Modality. ICASSP 2019: 5861-5865 - [c433]Lukas Stappen, Nicholas Cummins, Eva-Maria Meßner, Harald Baumeister, Judith Dineley, Björn W. Schuller:
Context Modelling Using Hierarchical Attention Networks for Sentiment and Self-assessed Emotion Detection in Spoken Narratives. ICASSP 2019: 6680-6684 - [c432]Zixing Zhang, Bingwen Wu, Björn W. Schuller:
Attention-augmented End-to-end Multi-task Learning for Emotion Prediction from Speech. ICASSP 2019: 6705-6709 - [c431]Josef Schmid, Mathias Schneider, Alfred Höß, Björn W. Schuller:
A Deep Learning Approach for Location Independent Throughput Prediction. ICCVE 2019: 1-5 - [c430]Ognjen Rudovic, Meiru Zhang, Björn W. Schuller, Rosalind W. Picard:
Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach. ICMI 2019: 6-15 - [c429]Najla Al Futaisi, Zixing Zhang, Alejandrina Cristià, Anne S. Warlaumont, Björn W. Schuller:
VCMNet: Weakly Supervised Learning for Automatic Infant Vocalisation Maturity Analysis. ICMI 2019: 205-209 - [c428]Gil Keren, Sivan Sabato, Björn W. Schuller:
A Walkthrough for the Principle of Logit Separation. IJCAI 2019: 6191-6195 - [c427]Shahin Amiriparian, Arsany Awad, Maurice Gerczuk, Lukas Stappen, Alice Baird, Sandra Ottl, Björn W. Schuller:
Audio-based Recognition of Bipolar Disorder Utilising Capsule Networks. IJCNN 2019: 1-7 - [c426]Chao Li, Qian Zhang, Ziping Zhao, Li Gu, Nicholas Cummins, Björn W. Schuller:
Analysing and Inferring of Intimacy Based on fNIRS Signals and Peripheral Physiological Signals. IJCNN 2019: 1-8 - [c425]Ziping Zhao, Zhongtian Bao, Zixing Zhang, Nicholas Cummins, Haishuai Wang, Björn W. Schuller:
Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition. INTERSPEECH 2019: 206-210 - [c424]Adria Mallol-Ragolta, Ziping Zhao, Lukas Stappen, Nicholas Cummins, Björn W. Schuller:
A Hierarchical Attention Network-Based Approach for Depression Detection from Transcribed Clinical Interviews. INTERSPEECH 2019: 221-225 - [c423]Alice Baird, Shahin Amiriparian, Nicholas Cummins, Sarah Sturmbauer, Johanna Janson, Eva-Maria Meßner, Harald Baumeister, Nicolas Rohleder, Björn W. Schuller:
Using Speech to Predict Sequentially Measured Cortisol Levels During a Trier Social Stress Test. INTERSPEECH 2019: 534-538 - [c422]Alice Baird, Eduardo Coutinho, Julia Hirschberg, Björn W. Schuller:
Sincerity in Acted Speech: Presenting the Sincere Apology Corpus and Results. INTERSPEECH 2019: 539-543 - [c421]Xinzhou Xu, Jun Deng, Nicholas Cummins, Zixing Zhang, Li Zhao, Björn W. Schuller:
Autonomous Emotion Learning in Speech: A View of Zero-Shot Speech Emotion Recognition. INTERSPEECH 2019: 949-953 - [c420]Andreas Triantafyllopoulos, Gil Keren, Johannes Wagner, Ingmar Steiner, Björn W. Schuller:
Towards Robust Speech Emotion Recognition Using Deep Residual Networks for Speech Enhancement. INTERSPEECH 2019: 1691-1695 - [c419]Ya'nan Guo, Ziping Zhao, Yide Ma, Björn W. Schuller:
Speech Augmentation via Speaker-Specific Noise in Unseen Environment. INTERSPEECH 2019: 1781-1785 - [c418]Björn W. Schuller, Anton Batliner, Christian Bergler, Florian B. Pokorny, Jarek Krajewski, Margaret Cychosz, Ralf Vollmann, Sonja-Dana Roelen, Sebastian Schnieder, Elika Bergelson, Alejandrina Cristià, Amanda Seidl, Anne S. Warlaumont, Lisa Yankowitz, Elmar Nöth, Shahin Amiriparian, Simone Hantke, Maximilian Schmitt:
The INTERSPEECH 2019 Computational Paralinguistics Challenge: Styrian Dialects, Continuous Sleepiness, Baby Sounds & Orca Activity. INTERSPEECH 2019: 2378-2382 - [c417]Maximilian Schmitt, Nicholas Cummins, Björn W. Schuller:
Continuous Emotion Recognition in Speech - Do We Need Recurrence? INTERSPEECH 2019: 2808-2812 - [c416]Christopher Oates, Andreas Triantafyllopoulos, Ingmar Steiner, Björn W. Schuller:
Robust Speech Emotion Recognition Under Different Encoding Conditions. INTERSPEECH 2019: 3935-3939 - [c415]Kun Qian, Hiroyuki Kuromiya, Zhao Ren, Maximilian Schmitt, Zixing Zhang, Toru Nakamura, Kazuhiro Yoshiuchi, Björn W. Schuller, Yoshiharu Yamamoto:
Automatic Detection of Major Depressive Disorder via a Bag-of-Behaviour-Words Approach. ISICDM 2019: 71-75 - [c414]Emilia Parada-Cabaleiro, Anton Batliner, Björn W. Schuller:
A Diplomatic Edition of Il Lauro Secco: Ground Truth for OMR of White Mensural Notation. ISMIR 2019: 557-564 - [c413]Kun Qian, Zhao Ren, Fengquan Dong, Wen-Hsing Lai, Björn W. Schuller, Yoshiharu Yamamoto:
Deep Wavelets for Heart Sound Classification. ISPACS 2019: 1-2 - [c412]Josef Schmid, Mathias Schneider, Alfred Höß, Björn W. Schuller:
A Comparison of AI-Based Throughput Prediction for Cellular Vehicle-To-Server Communication. IWCMC 2019: 471-476 - [c411]Shahin Amiriparian, Maurice Gerczuk, Eduardo Coutinho, Alice Baird, Sandra Ottl, Manuel Milling, Björn W. Schuller:
Emotion and Themes Recognition in Music Utilising Convolutional and Recurrent Neural Networks. MediaEval 2019 - [c410]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Nicholas Cummins, Roddy Cowie, Leili Tavabi, Maximilian Schmitt, Sina Alisamir, Shahin Amiriparian, Eva-Maria Meßner, Siyang Song, Shuo Liu, Ziping Zhao, Adria Mallol-Ragolta, Zhao Ren, Mohammad Soleymani, Maja Pantic:
AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition. AVEC@MM 2019: 3-12 - [c409]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Nicholas Cummins, Roddy Cowie, Maja Pantic:
AVEC'19: Audio/Visual Emotion Challenge and Workshop. ACM Multimedia 2019: 2718-2719 - [c408]Alice Baird, Shahin Amiriparian, Miriam Berschneider, Maximilian Schmitt, Björn W. Schuller:
Predicting Biological Signals from Speech: Introducing a Novel Multimodal Dataset and Results. MMSP 2019: 1-5 - [c407]Alice Baird, Shahin Amiriparian, Björn W. Schuller:
Can Deep Generative Audio be Emotional? Towards an Approach for Personalised Emotional Audio Generation. MMSP 2019: 1-5 - [c406]Lukas Stappen, Vincent Karas, Nicholas Cummins, Fabien Ringeval, Klaus R. Scherer, Björn W. Schuller:
From Speech to Facial Activity: Towards Cross-modal Sequence-to-Sequence Attention Networks. MMSP 2019: 1-6 - [p12]Shahin Amiriparian, Maximilian Schmitt, Simone Hantke, Vedhas Pandit, Björn W. Schuller:
Humans Inside: Cooperative Big Multimedia Data Mining. Innovations in Big Data Mining and Embedded Knowledge 2019: 235-257 - [e18]Sharon L. Oviatt, Björn W. Schuller, Philip R. Cohen, Daniel Sonntag, Gerasimos Potamianos, Antonio Krüger:
The Handbook of Multimodal-Multisensor Interfaces: Language Processing, Software, Commercialization, and Emerging Directions - Volume 3. Association for Computing Machinery 2019, ISBN 978-1-970001-75-4 [contents] - [e17]Wen Gao, Helen Mei-Ling Meng, Matthew A. Turk, Susan R. Fussell, Björn W. Schuller, Yale Song, Kai Yu:
International Conference on Multimodal Interaction, ICMI 2019, Suzhou, China, October 14-18, 2019. ACM 2019, ISBN 978-1-4503-6860-5 [contents] - [e16]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Nicholas Cummins, Roddy Cowie, Maja Pantic:
Proceedings of the 9th International on Audio/Visual Emotion Challenge and Workshop, AVEC@MM 2019, Nice, France, October 21-25, 2019. ACM 2019, ISBN 978-1-4503-6913-8 [contents] - [i45]Jean Kossaifi, Robert Walecki, Yannis Panagakis, Jie Shen, Maximilian Schmitt, Fabien Ringeval, Jing Han, Vedhas Pandit, Björn W. Schuller, Kam Star, Elnar Hajiyev, Maja Pantic:
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild. CoRR abs/1901.02839 (2019) - [i44]Vedhas Pandit, Björn W. Schuller:
On Many-to-Many Mapping Between Concordance Correlation Coefficient and Mean Square Error. CoRR abs/1902.05180 (2019) - [i43]Alice Baird, Simone Hantke, Björn W. Schuller:
Responsible and Representative Multimodal Data Acquisition and Analysis: On Auditability, Benchmarking, Confidence, Data-Reliance & Explainability. CoRR abs/1903.07171 (2019) - [i42]Thomas Wiest, Nicholas Cummins, Alice Baird, Simone Hantke, Judith Dineley, Björn W. Schuller:
Voice command generation using Progressive Wavegans. CoRR abs/1903.07395 (2019) - [i41]Zixing Zhang, Jing Han, Kun Qian, Christoph Janott, Yanan Guo, Björn W. Schuller:
Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data. CoRR abs/1903.12422 (2019) - [i40]Zixing Zhang, Bingwen Wu, Björn W. Schuller:
Attention-Augmented End-to-End Multi-Task Learning for Emotion Prediction from Speech. CoRR abs/1903.12424 (2019) - [i39]Panagiotis Tzirakis, Athanasios Papaioannou, Alexander Lattas, Michail Tarasiou, Björn W. Schuller, Stefanos Zafeiriou:
Synthesising 3D Facial Motion from "In-the-Wild" Speech. CoRR abs/1904.07002 (2019) - [i38]Joshua Y. Kim, Chunfeng Liu, Rafael A. Calvo, Kathryn McCabe, Silas C. R. Taylor, Björn W. Schuller, Kaihang Wu:
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech. CoRR abs/1904.12403 (2019) - [i37]Ognjen Rudovic, Meiru Zhang, Björn W. Schuller, Rosalind W. Picard:
Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach. CoRR abs/1906.03098 (2019) - [i36]Shuo Liu, Gil Keren, Björn W. Schuller:
Single-Channel Speech Separation with Auxiliary Speaker Embeddings. CoRR abs/1906.09997 (2019) - [i35]Jing Han, Zixing Zhang, Zhao Ren, Björn W. Schuller:
EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings. CoRR abs/1907.10428 (2019) - [i34]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Nicholas Cummins, Roddy Cowie, Leili Tavabi, Maximilian Schmitt, Sina Alisamir, Shahin Amiriparian, Eva-Maria Meßner, Siyang Song, Shuo Liu, Ziping Zhao, Adria Mallol-Ragolta, Zhao Ren, Mohammad Soleymani, Maja Pantic:
AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition. CoRR abs/1907.11510 (2019) - [i33]Alice Baird, Björn W. Schuller:
Presenting the Acoustic Sounds for Wellbeing Dataset and Baseline Classification Results. CoRR abs/1908.01671 (2019) - [i32]Anton Batliner, Stefan Steidl, Florian Eyben, Björn W. Schuller:
On Laughter and Speech-Laugh, Based on Observations of Child-Robot Interaction. CoRR abs/1908.11593 (2019) - [i31]Ali Girayhan Özbay, Sylvain Laizet, Panagiotis Tzirakis, Georgios Rizos, Björn W. Schuller:
Poisson CNN: Convolutional Neural Networks for the Solution of the Poisson Equation with Varying Meshes and Dirichlet Boundary Conditions. CoRR abs/1910.08613 (2019) - [i30]Thejan Rajapakshe, Rajib Rana, Siddique Latif, Sara Khalifa, Björn W. Schuller:
Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition. CoRR abs/1910.11256 (2019) - [i29]Shuo Liu, Gil Keren, Björn W. Schuller:
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System. CoRR abs/1911.07062 (2019) - 2018
- [j113]Zixing Zhang, Jing Han, Jun Deng, Xinzhou Xu, Fabien Ringeval, Björn W. Schuller:
Leveraging Unlabeled Data for Emotion Recognition With Enhanced Collaborative Semi-Supervised Learning. IEEE Access 6: 22196-22209 (2018) - [j112]Simone Hantke, Alexander Abstreiter, Nicholas Cummins, Björn W. Schuller:
Trustability-Based Dynamic Active Learning for Crowdsourced Labelling of Emotional Audio Data. IEEE Access 6: 42142-42155 (2018) - [j111]Gil Keren, Nicholas Cummins, Björn W. Schuller:
Calibrated Prediction Intervals for Neural Network Regressors. IEEE Access 6: 54033-54041 (2018) - [j110]Björn W. Schuller:
Speech emotion recognition: two decades in a nutshell, benchmarks, and ongoing trends. Commun. ACM 61(5): 90-99 (2018) - [j109]Christoph Janott, Maximilian Schmitt, Yue Zhang, Kun Qian, Vedhas Pandit, Zixing Zhang, Clemens Heiser, Winfried Hohenhorst, Michael Herzog, Werner Hemmert, Björn W. Schuller:
Snoring classified: The Munich-Passau Snore Sound Corpus. Comput. Biol. Medicine 94: 106-118 (2018) - [j108]Björn W. Schuller:
What Affective Computing Reveals about Autistic Children's Facial Expressions of Joy or Fear. Computer 51(6): 7-8 (2018) - [j107]Dagmar Schuller, Björn W. Schuller:
The Age of Artificial Emotional Intelligence. Computer 51(9): 38-46 (2018) - [j106]Zhao Ren, Kun Qian, Yebin Wang, Zixing Zhang, Vedhas Pandit, Alice Baird, Björn W. Schuller:
Deep Scalogram Representations for Acoustic Scene Classification. IEEE CAA J. Autom. Sinica 5(3): 662-669 (2018) - [j105]Björn W. Schuller, Yue Zhang, Felix Weninger:
Three recent trends in Paralinguistics on the way to omniscient machine intelligence. J. Multimodal User Interfaces 12(4): 273-283 (2018) - [j104]George Trigeorgis, Mihalis A. Nicolaou, Björn W. Schuller, Stefanos Zafeiriou:
Deep Canonical Time Warping for Simultaneous Alignment and Representation Learning of Sequences. IEEE Trans. Pattern Anal. Mach. Intell. 40(5): 1128-1138 (2018) - [j103]Jaeryoung Lee, Miles Dai, Björn W. Schuller, Rosalind W. Picard:
Personalized machine learning for robot perception of affect and engagement in autism therapy. Sci. Robotics 3(19) (2018) - [j102]Shaoling Jing, Xia Mao, Lijiang Chen, Maria Colomba Comes, Arianna Mencattini, Grazia Raguso, Fabien Ringeval, Björn W. Schuller, Corrado Di Natale, Eugenio Martinelli:
A closed-form solution to the graph total variation problem for continuous emotion profiling in noisy environment. Speech Commun. 104: 66-72 (2018) - [j101]Björn W. Schuller:
Editorial: Transactions on Affective Computing-Good Reasons for Joy and Excitement. IEEE Trans. Affect. Comput. 9(1): 1-2 (2018) - [j100]Florian Lingenfelser, Johannes Wagner, Jun Deng, Raymond Brueckner, Björn W. Schuller, Elisabeth André:
Asynchronous and Event-Based Fusion Systems for Affect Recognition on Naturalistic Data in Comparison to Conventional Approaches. IEEE Trans. Affect. Comput. 9(4): 410-423 (2018) - [j99]Jun Deng, Xinzhou Xu, Zixing Zhang, Sascha Frühholz, Björn W. Schuller:
Semisupervised Autoencoders for Speech Emotion Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 26(1): 31-43 (2018) - [j98]Stefano Squartini, Björn W. Schuller, Aurelio Uncini, Chuan-Kang Ting:
Guest Editorial Special Issue on Computational Intelligence for End-to-End Audio Processing. IEEE Trans. Emerg. Top. Comput. Intell. 2(2): 89-91 (2018) - [j97]Zixing Zhang, Jürgen T. Geiger, Jouni Pohjalainen, Amr El-Desoky Mousa, Wenyu Jin, Björn W. Schuller:
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments. ACM Trans. Intell. Syst. Technol. 9(5): 49:1-49:28 (2018) - [j96]Paul Buitelaar, Ian D. Wood, Sapna Negi, Mihael Arcan, John P. McCrae, Andrejs Abele, Cécile Robin, Vladimir Andryushechkin, Housam Ziad, Hesam Sagha, Maximilian Schmitt, Björn W. Schuller, J. Fernando Sánchez-Rada, Carlos Angel Iglesias, Carlos Navarro, Andreas Giefer, Nicolaus Heise, Vincenzo Masucci, Francesco A. Danza, Ciro Caterino, Pavel Smrz, Michal Hradis, Filip Povolný, Marek Klimes, Pavel Matejka, Giovanni Tummarello:
MixedEmotions: An Open-Source Toolbox for Multimodal Emotion Analysis. IEEE Trans. Multim. 20(9): 2454-2465 (2018) - [j95]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Jonathan Gratch, Roddy Cowie, Maja Pantic:
Introduction to the Special Section on Multimedia Computing and Applications of Socio-Affective Behaviors in the Wild. ACM Trans. Multim. Comput. Commun. Appl. 14(1s): 25:1-25:2 (2018) - [c405]Zhao Ren, Nicholas Cummins, Jing Han, Sebastian Schnieder, Jarek Krajewski, Björn W. Schuller:
Evaluation of the Pain Level from Speech: Introducing a Novel Pain Database and Benchmarks. ITG Symposium on Speech Communication 2018: 1-5 - [c404]Alice Baird, Emilia Parada-Cabaleiro, Cameron Fraser, Simone Hantke, Björn W. Schuller:
The Perceived Emotion of Isolated Synthetic Audio: The EmoSynth Dataset and Results. Audio Mostly Conference 2018: 7:1-7:8 - [c403]Zhao Ren, Qiuqiang Kong, Kun Qian, Mark D. Plumbley, Björn W. Schuller:
Attention-based convolutional neural networks for acoustic scene classification. DCASE 2018: 39-43 - [c402]Zhao Ren, Nicholas Cummins, Vedhas Pandit, Jing Han, Kun Qian, Björn W. Schuller:
Learning Image-based Representations for Heart Sound Classification. DH 2018: 143-147 - [c401]Gerhard Hagerer, Nicholas Cummins, Florian Eyben, Björn W. Schuller:
Robust Laughter Detection for Wearable Wellbeing Sensing. DH 2018: 156-157 - [c400]Fatih Demir, Abdulkadir Sengür, Nicholas Cummins, Shahin Amiriparian, Björn W. Schuller:
Low Level Texture Features for Snore Sound Discrimination. EMBC 2018: 413-416 - [c399]Shahin Amiriparian, Maximilian Schmitt, Nicholas Cummins, Kun Qian, Fengquan Dong, Björn W. Schuller:
Deep Unsupervised Representation Learning for Abnormal Heart Sound Classification. EMBC 2018: 4776-4779 - [c398]Shahin Arniriparian, Michael Freitag, Nicholas Cummins, Maurice Gerczuk, Sergey Pugachevskiy, Björn W. Schuller:
A Fusion of Deep Convolutional Generative Adversarial Networks and Sequence to Sequence Autoencoders for Acoustic Scene Classification. EUSIPCO 2018: 977-981 - [c397]Jianhong Wang, Harald Strömfelt, Björn W. Schuller:
A Cnn-Gru Approach to Capture Time-Frequency Pattern Interdependence for Snore Sound Classification. EUSIPCO 2018: 997-1001 - [c396]Nicholas Cummins, Shahin Amiriparian, Sandra Ottl, Maurice Gerczuk, Maximilian Schmitt, Björn W. Schuller:
Multimodal Bag-of-Words for Cross Domains Sentiment Analysis. ICASSP 2018: 4954-4958 - [c395]Panagiotis Tzirakis, Jiehao Zhang, Björn W. Schuller:
End-to-End Speech Emotion Recognition Using Deep Neural Networks. ICASSP 2018: 5089-5093 - [c394]Simone Hantke, Nicholas Cummins, Björn W. Schuller:
What is my Dog Trying to Tell Me? the Automatic Recognition of the Context and Perceived Emotion of Dog Barks. ICASSP 2018: 5134-5138 - [c393]Jing Han, Zixing Zhang, Zhao Ren, Fabien Ringeval, Björn W. Schuller:
Towards Conditional Adversarial Training for Predicting Emotions from Speech. ICASSP 2018: 6822-6826 - [c392]Simone Hantke, Christian Cohrs, Maximilian Schmitt, Benjamin Tannert, Florian Lütkebohmert, Mathias Detmers, Heidi Schelhowe, Björn W. Schuller:
Introducing an Emotion-Driven Assistance System for Cognitively Impaired Individuals. ICCHP (1) 2018: 486-494 - [c391]Gil Keren, Sivan Sabato, Björn W. Schuller:
Fast Single-Class Classification and the Principle of Logit Separation. ICDM 2018: 227-236 - [c390]Simone Hantke, Maximilian Schmitt, Panagiotis Tzirakis, Björn W. Schuller:
EAT -: The ICMI 2018 Eating Analysis and Tracking Challenge. ICMI 2018: 559-563 - [c389]Ya'nan Guo, Jing Han, Zixing Zhang, Björn W. Schuller, Yide Ma:
Exploring A New Method for Food Likability Rating Based on DT-CWT Theory. ICMI 2018: 569-573 - [c388]Benjamin Sertolli, Nicholas Cummins, Abdulkadir Sengür, Björn W. Schuller:
Deep End-to-End Representation Learning for Food Type Recognition from Speech. ICMI 2018: 574-578 - [c387]Hans-Jörg Vögel, Christian Süß, Thomas Hubregtsen, Elisabeth André, Björn W. Schuller, Jérôme Härri, Jörg Conradt, Asaf Adi, Alexander Zadorojniy, Jacques M. B. Terken, Jonas Beskow, Ann Morrison, Kynan Eng, Florian Eyben, Samer Al Moubayed, Susanne Muller, Nicholas Cummins, Viviane S. Ghaderi, Ronee Chadowitz, Raphaël Troncy, Benoit Huet, Melek Önen, Adlen Ksentini:
Emotion-Awareness for Intelligent Vehicle Assistants: A Research Agenda. SEFAIAS@ICSE 2018: 11-15 - [c386]Sicheng Zhao, Guiguang Ding, Qingming Huang, Tat-Seng Chua, Björn W. Schuller, Kurt Keutzer:
Affective Image Content Analysis: A Comprehensive Survey. IJCAI 2018: 5534-5541 - [c385]Shahin Amiriparian, Maurice Gerczuk, Sandra Ottl, Nicholas Cummins, Sergey Pugachevskiy, Björn W. Schuller:
Bag-of-Deep-Features: Noise-Robust Deep Feature Representations for Audio Analysis. IJCNN 2018: 1-7 - [c384]Siyang Song, Shuimei Zhang, Björn W. Schuller, Linlin Shen, Michel F. Valstar:
Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification. IJCNN 2018: 1-8 - [c383]Björn W. Schuller, Stefan Steidl, Anton Batliner, Peter B. Marschik, Harald Baumeister, Fengquan Dong, Simone Hantke, Florian B. Pokorny, Eva-Maria Rathner, Katrin D. Bartl-Pokorny, Christa Einspieler, Dajie Zhang, Alice Baird, Shahin Amiriparian, Kun Qian, Zhao Ren, Maximilian Schmitt, Panagiotis Tzirakis, Stefanos Zafeiriou:
The INTERSPEECH 2018 Computational Paralinguistics Challenge: Atypical & Self-Assessed Affect, Crying & Heart Beats. INTERSPEECH 2018: 122-126 - [c382]Zixing Zhang, Jing Han, Kun Qian, Björn W. Schuller:
Evolving Learning for Analysing Mood-Related Infant Vocalisation. INTERSPEECH 2018: 142-146 - [c381]Eva-Maria Rathner, Yannik Terhorst, Nicholas Cummins, Björn W. Schuller, Harald Baumeister:
State of Mind: Classification through Self-reported Affect and Word Use in Speech. INTERSPEECH 2018: 267-271 - [c380]Wenjing Han, Huabin Ruan, Xiaomin Chen, Zhixiang Wang, Haifeng Li, Björn W. Schuller:
Towards Temporal Modelling of Categorical Speech Emotion Recognition. INTERSPEECH 2018: 932-936 - [c379]Shahin Amiriparian, Alice Baird, Sahib Julka, Alyssa Alcorn, Sandra Ottl, Suncica Petrovic, Eloise Ainger, Nicholas Cummins, Björn W. Schuller:
Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks. INTERSPEECH 2018: 2334-2338 - [c378]Zixing Zhang, Alejandrina Cristià, Anne S. Warlaumont, Björn W. Schuller:
Automated Classification of Children's Linguistic versus Non-Linguistic Vocalisations. INTERSPEECH 2018: 2588-2592 - [c377]Alice Baird, Emilia Parada-Cabaleiro, Simone Hantke, Felix Burkhardt, Nicholas Cummins, Björn W. Schuller:
The Perception and Analysis of the Likeability and Human Likeness of Synthesized Speech. INTERSPEECH 2018: 2863-2867 - [c376]Jing Han, Zixing Zhang, Maximilian Schmitt, Zhao Ren, Fabien Ringeval, Björn W. Schuller:
Bags in Bag: Generating Context-Aware Bags for Tracking Emotions from Speech. INTERSPEECH 2018: 3082-3086 - [c375]Eva-Maria Rathner, Julia Djamali, Yannik Terhorst, Björn W. Schuller, Nicholas Cummins, Gudrun Salamon, Christina Hunger-Schoppe, Harald Baumeister:
How Did You like 2017? Detection of Language Markers of Depression and Narcissism in Personal Narratives. INTERSPEECH 2018: 3388-3392 - [c374]Simone Hantke, Christoph Stemp, Björn W. Schuller:
Annotator Trustability-based Cooperative Learning Solutions for Intelligent Audio Analysis. INTERSPEECH 2018: 3504-3508 - [c373]Emilia Parada-Cabaleiro, Giovanni Costantini, Anton Batliner, Alice Baird, Björn W. Schuller:
Categorical vs Dimensional Perception of Italian Emotional Speech. INTERSPEECH 2018: 3638-3642 - [c372]Ognjen Rudovic, Yuria Utsumi, Jaeryoung Lee, Javier Hernandez, Eduardo Castelló Ferrer, Björn W. Schuller, Rosalind W. Picard:
CultureNet: A Deep Learning Approach for Engagement Intensity Estimation from Face Images of Children with Autism. IROS 2018: 339-346 - [c371]Emilia Parada-Cabaleiro, Maximilian Schmitt, Anton Batliner, Simone Hantke, Giovanni Costantini, Klaus R. Scherer, Björn W. Schuller:
Identifying Emotions in Opera Singing: Implications of Adverse Acoustic Conditions. ISMIR 2018: 376-382 - [c370]Emilia Parada-Cabaleiro, Maximilian Schmitt, Anton Batliner, Björn W. Schuller:
Musical-Linguistic Annotations of Il Lauro Secco. ISMIR 2018: 461-467 - [c369]Josef Schmid, Philipp Hes, Alfred Höß, Björn W. Schuller:
Passive monitoring and geo-based prediction of mobile network vehicle-to-server communication. IWCMC 2018: 1483-1488 - [c368]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Roddy Cowie, Heysem Kaya, Maximilian Schmitt, Shahin Amiriparian, Nicholas Cummins, Denis Lalanne, Adrien Michaud, Elvan Çiftçi, Hüseyin Güleç, Albert Ali Salah, Maja Pantic:
AVEC 2018 Workshop and Challenge: Bipolar Disorder and Cross-Cultural Affect Recognition. AVEC@MM 2018: 3-13 - [c367]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Roddy Cowie, Maja Pantic:
Summary for AVEC 2018: Bipolar Disorder and Cross-Cultural Affect Recognition. ACM Multimedia 2018: 2111-2112 - [c366]Dong-Yan Huang, Sicheng Zhao, Björn W. Schuller, Hongxun Yao, Jianhua Tao, Min Xu, Lei Xie, Qingming Huang, Jie Yang:
ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop. ACM Multimedia 2018: 2120-2121 - [c365]Björn W. Schuller:
State of Mind Sensing from Speech: State of Matters and What Matters. SMM 2018 - [c364]Jing Han, Maximilian Schmitt, Björn W. Schuller:
You Sound Like Your Counterpart: Interpersonal Speech Analysis. SPECOM 2018: 188-197 - [c363]Vedhas Pandit, Maximilian Schmitt, Nicholas Cummins, Franz Graf, Lucas Paletta, Björn W. Schuller:
How Good Is Your Model 'Really'? On 'Wildness' of the In-the-Wild Speech-Based Affect Recognisers. SPECOM 2018: 490-500 - [c362]Florian Jomrich, Josef Schmid, Steffen Knapp, Alfred Höß, Ralf Steinmetz, Björn W. Schuller:
Analysing communication requirements for crowd sourced backend generation of HD Maps used in automated driving. VNC 2018: 1-8 - [p11]Gil Keren, Amr El-Desoky Mousa, Olivier Pietquin, Stefanos Zafeiriou, Björn W. Schuller:
Deep learning for multisensorial and multimodal interaction. The Handbook of Multimodal-Multisensor Interfaces, Volume 2 (2) 2018: 99-128 - [p10]Björn W. Schuller:
Multimodal user state and trait recognition: an overview. The Handbook of Multimodal-Multisensor Interfaces, Volume 2 (2) 2018: 129-165 - [p9]Samy Bengio, Li Deng, Louis-Philippe Morency, Björn W. Schuller:
Perspectives on predictive power of multimodal deep learning: surprises and future directions. The Handbook of Multimodal-Multisensor Interfaces, Volume 2 (2) 2018: 455-472 - [e15]Sharon L. Oviatt, Björn W. Schuller, Philip R. Cohen, Daniel Sonntag, Gerasimos Potamianos, Antonio Krüger:
The Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations - Volume 2. Association for Computing Machinery 2018, ISBN 978-1-970001-71-6 [contents] - [e14]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Roddy Cowie, Maja Pantic:
Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop, AVEC@MM 2018, Seoul, Republic of Korea, October 22, 2018. ACM 2018, ISBN 978-1-4503-5983-2 [contents] - [i28]Gil Keren, Maximilian Schmitt, Thomas Kehrenberg, Björn W. Schuller:
Weakly Supervised One-Shot Detection with Attention Siamese Networks. CoRR abs/1801.03329 (2018) - [i27]Panagiotis Tzirakis, Stefanos Zafeiriou, Björn W. Schuller:
End2You - The Imperial Toolkit for Multimodal Profiling by End-to-End Learning. CoRR abs/1802.01115 (2018) - [i26]Johannes Wagner, Tobias Baur, Yue Zhang, Michel F. Valstar, Björn W. Schuller, Elisabeth André:
Applying Cooperative Machine Learning to Speed Up the Annotation of Social Signals in Large Multi-modal Corpora. CoRR abs/1802.02565 (2018) - [i25]Gil Keren, Nicholas Cummins, Björn W. Schuller:
Calibrated Prediction Intervals for Neural Network Regressors. CoRR abs/1803.09546 (2018) - [i24]Dimitrios Kollias, Panagiotis Tzirakis, Mihalis A. Nicolaou, Athanasios Papaioannou, Guoying Zhao, Björn W. Schuller, Irene Kotsia, Stefanos Zafeiriou:
Deep Affect Prediction in-the-wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond. CoRR abs/1804.10938 (2018) - [i23]Andreas Triantafyllopoulos, Hesam Sagha, Florian Eyben, Björn W. Schuller:
audEERING's approach to the One-Minute-Gradual Emotion Challenge. CoRR abs/1805.01222 (2018) - [i22]Siyang Song, Shuimei Zhang, Björn W. Schuller, Linlin Shen, Michel F. Valstar:
Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification. CoRR abs/1805.01259 (2018) - [i21]Jing Han, Zixing Zhang, Nicholas Cummins, Björn W. Schuller:
Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives. CoRR abs/1809.08927 (2018) - [i20]Zixing Zhang, Jing Han, Eduardo Coutinho, Björn W. Schuller:
Dynamic Difficulty Awareness Training for Continuous Emotion Prediction. CoRR abs/1810.05507 (2018) - [i19]Gil Keren, Jing Han, Björn W. Schuller:
Scaling Speech Enhancement in Unseen Environments with Noise Embeddings. CoRR abs/1810.12757 (2018) - 2017
- [j94]Jun Deng, Sascha Frühholz, Zixing Zhang, Björn W. Schuller:
Recognizing Emotions From Whispered Speech Based on Acoustic Feature Transfer Learning. IEEE Access 5: 5235-5246 (2017) - [j93]Erik Marchi, Fabio Vesperini, Stefano Squartini, Björn W. Schuller:
Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection. Comput. Intell. Neurosci. 2017: 4694860:1-4694860:14 (2017) - [j92]Vedhas Pandit, Björn W. Schuller:
A Novel Graphical Technique for Combinational Logic Representation and Optimization. Complex. 2017: 9696342:1-9696342:12 (2017) - [j91]Björn W. Schuller:
Can Affective Computing Save Lives? Meet Mobile Health. Computer 50(5): 13 (2017) - [j90]Ognjen Rudovic, Jaeryoung Lee, Lea Mascarell-Maricic, Björn W. Schuller, Rosalind W. Picard:
Measuring Engagement in Robot-Assisted Autism Therapy: A Cross-Cultural Study. Frontiers Robotics AI 4: 36 (2017) - [j89]Mohammad Soleymani, Björn W. Schuller, Shih-Fu Chang:
Guest editorial: Multimodal sentiment analysis and mining in the wild. Image Vis. Comput. 65: 1-2 (2017) - [j88]Mohammad Soleymani, David García, Brendan Jou, Björn W. Schuller, Shih-Fu Chang, Maja Pantic:
A survey of multimodal sentiment analysis. Image Vis. Comput. 65: 3-14 (2017) - [j87]Jing Han, Zixing Zhang, Nicholas Cummins, Fabien Ringeval, Björn W. Schuller:
Strength modelling for real-worldautomatic continuous affect recognition from audiovisual signals. Image Vis. Comput. 65: 76-86 (2017) - [j86]Maximilian Schmitt, Björn W. Schuller:
openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit. J. Mach. Learn. Res. 18: 96:1-96:5 (2017) - [j85]Michael Freitag, Shahin Amiriparian, Sergey Pugachevskiy, Nicholas Cummins, Björn W. Schuller:
auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks. J. Mach. Learn. Res. 18: 173:1-173:5 (2017) - [j84]Panagiotis Tzirakis, George Trigeorgis, Mihalis A. Nicolaou, Björn W. Schuller, Stefanos Zafeiriou:
End-to-End Multimodal Emotion Recognition Using Deep Neural Networks. IEEE J. Sel. Top. Signal Process. 11(8): 1301-1309 (2017) - [j83]George Trigeorgis, Konstantinos Bousmalis, Stefanos Zafeiriou, Björn W. Schuller:
A Deep Matrix Factorization Method for Learning Attribute Representations. IEEE Trans. Pattern Anal. Mach. Intell. 39(3): 417-429 (2017) - [j82]Jun Deng, Xinzhou Xu, Zixing Zhang, Sascha Frühholz, Björn W. Schuller:
Universum Autoencoder-Based Domain Adaptation for Speech Emotion Recognition. IEEE Signal Process. Lett. 24(4): 500-504 (2017) - [j81]Zixing Zhang, Nicholas Cummins, Björn W. Schuller:
Advanced Data Exploitation in Speech Analysis: An overview. IEEE Signal Process. Mag. 34(4): 107-129 (2017) - [j80]Björn W. Schuller:
Editorial: IEEE Transactions on Affective Computing - Challenges and Chances. IEEE Trans. Affect. Comput. 8(1): 1-2 (2017) - [j79]Arianna Mencattini, Eugenio Martinelli, Fabien Ringeval, Björn W. Schuller, Corrado Di Natale:
Continuous Estimation of Emotions in Speech by Dynamic Cooperative Speaker Models. IEEE Trans. Affect. Comput. 8(3): 314-327 (2017) - [j78]Xinzhou Xu, Jun Deng, Nicholas Cummins, Zixing Zhang, Chen Wu, Li Zhao, Björn W. Schuller:
A Two-Dimensional Framework of Multiple Kernel Subspace Learning for Recognizing Emotion in Speech. IEEE ACM Trans. Audio Speech Lang. Process. 25(7): 1436-1449 (2017) - [j77]Kun Qian, Christoph Janott, Vedhas Pandit, Zixing Zhang, Clemens Heiser, Winfried Hohenhorst, Michael Herzog, Werner Hemmert, Björn W. Schuller:
Classification of the Excitation Location of Snore Sounds in the Upper Airway by Acoustic Multifeature Analysis. IEEE Trans. Biomed. Eng. 64(8): 1731-1741 (2017) - [j76]Hesam Sagha, Nicholas Cummins, Björn W. Schuller:
Stacked denoising autoencoders for sentiment analysis: a review. WIREs Data Mining Knowl. Discov. 7(5) (2017) - [c361]Gil Keren, Sivan Sabato, Björn W. Schuller:
Tunable Sensitivity to Large Errors in Neural Network Training. AAAI 2017: 2087-2093 - [c360]Jun Deng, Florian Eyben, Björn W. Schuller, Felix Burkhardt:
Deep neural networks for anger detection from real life speech data. ACII Workshops 2017: 1-6 - [c359]Shahin Amiriparian, Nicholas Cummins, Sandra Ottl, Maurice Gerczuk, Björn W. Schuller:
Sentiment analysis using image-based deep spectrum features. ACII Workshops 2017: 26-29 - [c358]Shahin Amiriparian, Michael Freitag, Nicholas Cummins, Björn W. Schuller:
Feature selection in multimodal continuous emotion prediction. ACII Workshops 2017: 30-37 - [c357]Gerhard Hagerer, Florian Eyben, Dagmar Schuller, Klaus R. Scherer, Björn W. Schuller:
VoicePlay - An affective sports game operated by speech emotion recognition based on the component process model. ACII Workshops 2017: 74-76 - [c356]Hesam Sagha, Jun Deng, Björn W. Schuller:
The effect of personality trait, age, and gender on the performance of automatic speech valence recognition. ACII 2017: 86-91 - [c355]J. Fernando Sánchez-Rada, Carlos Angel Iglesias, Hesam Sagha, Björn W. Schuller, Ian D. Wood, Paul Buitelaar:
Multimodal multimodel emotion analysis as linked data. ACII Workshops 2017: 111-116 - [c354]Harald Strömfelt, Yue Zhang, Björn W. Schuller:
Emotion-augmented machine learning: Overview of an emerging domain. ACII 2017: 305-312 - [c353]Shahin Amiriparian, Sergey Pugachevskiy, Nicholas Cummins, Simone Hantke, Jouni Pohjalainen, Gil Keren, Björn W. Schuller:
CAST a database: Rapid targeted large-scale big data acquisition via small-world modelling of social media platforms. ACII 2017: 340-345 - [c352]Nicholas Cummins, Bogdan Vlasenko, Hesam Sagha, Björn W. Schuller:
Enhancing Speech-Based Depression Detection Through Gender Dependent Vowel-Level Formant Features. AIME 2017: 209-214 - [c351]Alice Baird, Stina Hasse Jørgensen, Emilia Parada-Cabaleiro, Simone Hantke, Nicholas Cummins, Björn W. Schuller:
Perception of Paralinguistic Traits in Synthesized Voices. Audio Mostly Conference 2017: 17:1-17:5 - [c350]Björn W. Schuller:
Reading the Author and Speaker: Towards a Holistic and Deep Approach on Automatic Assessment of What is in One's Words. CICLing (2) 2017: 275-288 - [c349]Robert Walecki, Ognjen Rudovic, Vladimir Pavlovic, Björn W. Schuller, Maja Pantic:
Deep Structured Learning for Facial Action Unit Intensity Estimation. CVPR 2017: 5709-5718 - [c348]Shahin Amiriparian, Michael Freitag, Nicholas Cummins, Björn W. Schuller:
Sequence to Sequence Autoencoders for Unsupervised Representation Learning from Audio. DCASE 2017: 17-21 - [c347]Kun Qian, Zhao Ren, Vedhas Pandit, Zijiang Yang, Zixing Zhang, Björn W. Schuller:
Wavelets Revisited for the Classification of Acoustic Scenes. DCASE 2017: 108-112 - [c346]Zhao Ren, Vedhas Pandit, Kun Qian, Zijiang Yang, Zixing Zhang, Björn W. Schuller:
Deep Sequential Image Features on Acoustic Scene Classification. DCASE 2017: 113-117 - [c345]Amr El-Desoky Mousa, Björn W. Schuller:
Contextual Bidirectional Long Short-Term Memory Recurrent Neural Network Language Models: A Generative Approach to Sentiment Analysis. EACL (1) 2017: 1023-1032 - [c344]Jun Deng, Nicholas Cummins, Maximilian Schmitt, Kun Qian, Fabien Ringeval, Björn W. Schuller:
Speech-based Diagnosis of Autism Spectrum Condition by Generative Adversarial Network Representations. DH 2017: 53-57 - [c343]Kun Qian, Christoph Janott, Jun Deng, Clemens Heiser, Winfried Hohenhorst, Michael Herzog, Nicholas Cummins, Björn W. Schuller:
Snore sound recognition: On wavelets and classifiers from deep nets to kernels. EMBC 2017: 3737-3740 - [c342]Nicholas Cummins, Maximilian Schmitt, Shahin Amiriparian, Jarek Krajewski, Björn W. Schuller:
"You sound ill, take the day off": Automatic recognition of speech affected by upper respiratory tract infection. EMBC 2017: 3806-3809 - [c341]Tobias Geib, Maximilian Schmitt, Björn W. Schuller:
Automatic Guitar String Detection by String-Inverse Frequency Estimation. GI-Jahrestagung 2017: 127-138 - [c340]Maximilian Schmitt, Björn W. Schuller:
Recognising Guitar Effects - Which Acoustic Features Really Matter? GI-Jahrestagung 2017: 177-190 - [c339]Felix Burkhardt, Benjamin Weiss, Florian Eyben, Jun Deng, Björn W. Schuller:
Detecting Vocal Irony. GSCL 2017: 11-22 - [c338]Christian Kohlschein, Maximilian Schmitt, Björn W. Schuller, Sabina Jeschke, Cornelius J. Werner:
A machine learning based system for the automatic evaluation of aphasia speech. Healthcom 2017: 1-6 - [c337]Jing Han, Zixing Zhang, Fabien Ringeval, Björn W. Schuller:
Reconstruction-error-based learning for continuous emotion recognition in speech. ICASSP 2017: 2367-2371 - [c336]Yue Zhang, Yifan Liu, Felix Weninger, Björn W. Schuller:
Multi-task deep neural network with shared hidden layers: Breaking down the wall between emotion representations. ICASSP 2017: 4990-4994 - [c335]Jing Han, Zixing Zhang, Fabien Ringeval, Björn W. Schuller:
Prediction-based learning for continuous emotion recognition in speech. ICASSP 2017: 5005-5009 - [c334]Florian Eyben, Matthias Unfried, Gerhard Hagerer, Björn W. Schuller:
Automatic multi-lingual arousal detection from voice applied to real product testing applications. ICASSP 2017: 5155-5159 - [c333]Dieu Linh Tran, Robert Walecki, Ognjen Rudovic, Stefanos Eleftheriadis, Björn W. Schuller, Maja Pantic:
DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding. ICCV 2017: 3209-3218 - [c332]Gil Keren, Tobias Kirschstein, Erik Marchi, Fabien Ringeval, Björn W. Schuller:
End-to-end learning for dimensional emotion recognition from physiological signals. ICME 2017: 985-990 - [c331]Emilia Parada-Cabaleiro, Alice Baird, Nicholas Cummins, Björn W. Schuller:
Stimulation of psychological listener experiences by semi-automatically composed electroacoustic environments. ICME 2017: 1051-1056 - [c330]Björn W. Schuller:
Keynote Lecture 1: NLP in Tomorrow's Profiling - Words May Fail You. ICON 2017: 1 - [c329]Zixing Zhang, Felix Weninger, Martin Wöllmer, Jing Han, Björn W. Schuller:
Towards intoxicated speech recognition. IJCNN 2017: 1555-1559 - [c328]Johanna Bohm, Florian Eyben, Maximilian Schmitt, Harald Kosch, Björn W. Schuller:
Seeking the SuperStar: Automatic assessment of perceived singing quality. IJCNN 2017: 1560-1569 - [c327]Romain Sabathe, Eduardo Coutinho, Björn W. Schuller:
Deep recurrent music writer: Memory-enhanced variational autoencoder-based musical score composition and an objective measure. IJCNN 2017: 3467-3474 - [c326]Florian B. Pokorny, Björn W. Schuller, Peter B. Marschik, Raymond Brueckner, Pär Nyström, Nicholas Cummins, Sven Bölte, Christa Einspieler, Terje Falck-Ytter:
Earlier Identification of Children with Autism Spectrum Disorder: An Automatic Vocalisation-Based Approach. INTERSPEECH 2017: 309-313 - [c325]Alice Baird, Shahin Amiriparian, Nicholas Cummins, Alyssa M. Alcorn, Anton Batliner, Sergey Pugachevskiy, Michael Freitag, Maurice Gerczuk, Björn W. Schuller:
Automatic Classification of Autistic Child Vocalisations: A Novel Database and Results. INTERSPEECH 2017: 849-853 - [c324]Gerhard Hagerer, Nicholas Cummins, Florian Eyben, Björn W. Schuller:
"Did you laugh enough today?" - Deep Neural Networks for Mobile and Wearable Laughter Trackers. INTERSPEECH 2017: 2044-2045 - [c323]Raymond Brueckner, Maximilian Schmitt, Maja Pantic, Björn W. Schuller:
Spotting Social Signals in Conversational Speech over IP: A Deep Learning Perspective. INTERSPEECH 2017: 2371-2375 - [c322]Simone Hantke, Hesam Sagha, Nicholas Cummins, Björn W. Schuller:
Emotional Speech of Mentally and Physically Disabled Individuals: Introducing the EmotAsS Database and First Findings. INTERSPEECH 2017: 3137-3141 - [c321]Yue Zhang, Felix Weninger, Björn W. Schuller:
Cross-Domain Classification of Drowsiness in Speech: The Case of Alcohol Intoxication and Sleep Deprivation. INTERSPEECH 2017: 3152-3156 - [c320]Emilia Parada-Cabaleiro, Alice Baird, Anton Batliner, Nicholas Cummins, Simone Hantke, Björn W. Schuller:
The Perception of Emotions in Noisified Nonsense Speech. INTERSPEECH 2017: 3246-3250 - [c319]Björn W. Schuller, Anton Batliner:
Discussion. INTERSPEECH 2017 - [c318]Bogdan Vlasenko, Hesam Sagha, Nicholas Cummins, Björn W. Schuller:
Implementing Gender-Dependent Vowel-Level Analysis for Boosting Speech-Based Depression Recognition. INTERSPEECH 2017: 3266-3270 - [c317]Björn W. Schuller, Stefan Steidl, Anton Batliner, Elika Bergelson, Jarek Krajewski, Christoph Janott, Andrei Amatuni, Marisa Casillas, Amanda Seidl, Melanie Soderstrom, Anne S. Warlaumont, Guillermo Hidalgo, Sebastian Schnieder, Clemens Heiser, Winfried Hohenhorst, Michael Herzog, Maximilian Schmitt, Kun Qian, Yue Zhang, George Trigeorgis, Panagiotis Tzirakis, Stefanos Zafeiriou:
The INTERSPEECH 2017 Computational Paralinguistics Challenge: Addressee, Cold & Snoring. INTERSPEECH 2017: 3442-3446 - [c316]Michael Freitag, Shahin Amiriparian, Nicholas Cummins, Maurice Gerczuk, Björn W. Schuller:
An 'End-to-Evolution' Hybrid Approach for Snore Sound Classification. INTERSPEECH 2017: 3507-3511 - [c315]Shahin Amiriparian, Maurice Gerczuk, Sandra Ottl, Nicholas Cummins, Michael Freitag, Sergey Pugachevskiy, Alice Baird, Björn W. Schuller:
Snore Sound Classification Using Image-Based Deep Spectrum Features. INTERSPEECH 2017: 3512-3516 - [c314]Simone Hantke, Zixing Zhang, Björn W. Schuller:
Towards Intelligent Crowdsourcing for Audio Data Annotation: Integrating Active Learning in the Real World. INTERSPEECH 2017: 3951-3955 - [c313]Emilia Parada-Cabaleiro, Alice Baird, Anton Batliner, Nicholas Cummins, Simone Hantke, Björn W. Schuller:
The Perception of Emotion in the Singing Voice: The Understanding of Music Mood for Music Organisation. DLfM 2017: 29-36 - [c312]Emilia Parada-Cabaleiro, Anton Batliner, Alice Baird, Björn W. Schuller:
The SEILS Dataset: Symbolically Encoded Scores in Modern-Early Notation for Computational Musicology. ISMIR 2017: 575-581 - [c311]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Jonathan Gratch, Roddy Cowie, Stefan Scherer, Sharon Mozgai, Nicholas Cummins, Maximilian Schmitt, Maja Pantic:
AVEC 2017: Real-life Depression, and Affect Recognition Workshop and Challenge. AVEC@ACM Multimedia 2017: 3-9 - [c310]Yue Zhang, Felix Weninger, Boqing Liu, Maximilian Schmitt, Florian Eyben, Björn W. Schuller:
A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits. ACM Multimedia 2017: 387-392 - [c309]Nicholas Cummins, Shahin Amiriparian, Gerhard Hagerer, Anton Batliner, Stefan Steidl, Björn W. Schuller:
An Image-based Deep Spectrum Feature Representation for the Recognition of Emotional Speech. ACM Multimedia 2017: 478-484 - [c308]Jing Han, Zixing Zhang, Maximilian Schmitt, Maja Pantic, Björn W. Schuller:
From Hard to Soft: Towards more Human-like Emotion Recognition by Modelling the Perception Uncertainty. ACM Multimedia 2017: 890-897 - [c307]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Jonathan Gratch, Roddy Cowie, Maja Pantic:
Summary for AVEC 2017: Real-life Depression and Affect Challenge and Workshop. ACM Multimedia 2017: 1963-1964 - [c306]Gerhard Hagerer, Vedhas Pandit, Florian Eyben, Björn W. Schuller:
Enhancing LSTM RNN-Based Speech Overlap Detection by Artificially Mixed Data. Semantic Audio 2017 - [c305]Björn W. Schuller:
Big Data, Deep Learning - At the Edge of X-Ray Speaker Analysis. SPECOM 2017: 20-34 - [c304]Björn W. Schuller:
Automatic speaker analysis 2.0: Hearing the bigger picture. SpeD 2017: 1-6 - [p8]Klaus R. Scherer, Björn W. Schuller, Aaron Elkins:
Computational Analysis of Vocal Expression of Affect: Trends and Challenges. Social Signal Processing 2017: 56-68 - [p7]Hatice Gunes, Björn W. Schuller:
Automatic Analysis of Aesthetics: Human Beauty, Attractiveness, and Likability. Social Signal Processing 2017: 183-201 - [p6]Hatice Gunes, Björn W. Schuller:
Automatic Analysis of Social Emotions. Social Signal Processing 2017: 213-224 - [p5]Björn W. Schuller:
Acquisition of Affect. Emotions and Personality in Personalized Services 2017: 57-80 - [e13]Sharon L. Oviatt, Björn W. Schuller, Philip R. Cohen, Daniel Sonntag, Gerasimos Potamianos, Antonio Krüger:
The Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations - Volume 1. ACM 2017, ISBN 978-1-970001-67-9 [contents] - [e12]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Jonathan Gratch, Roddy Cowie, Maja Pantic:
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23 - 27, 2017. ACM 2017, ISBN 978-1-4503-5502-5 [contents] - [i18]Dieu Linh Tran, Robert Walecki, Ognjen Rudovic, Stefanos Eleftheriadis, Björn W. Schuller, Maja Pantic:
DeepCoder: Semi-parametric Variational Autoencoders for Facial Action Unit Intensity Estimation. CoRR abs/1704.02206 (2017) - [i17]Robert Walecki, Ognjen Rudovic, Vladimir Pavlovic, Björn W. Schuller, Maja Pantic:
Deep Structured Learning for Facial Action Unit Intensity Estimation. CoRR abs/1704.04481 (2017) - [i16]Panagiotis Tzirakis, George Trigeorgis, Mihalis A. Nicolaou, Björn W. Schuller, Stefanos Zafeiriou:
End-to-End Multimodal Emotion Recognition using Deep Neural Networks. CoRR abs/1704.08619 (2017) - [i15]Gil Keren, Sivan Sabato, Björn W. Schuller:
Fast Single-Class Classification and the Principle of Logit Separation. CoRR abs/1705.10246 (2017) - [i14]Zixing Zhang, Jürgen T. Geiger, Jouni Pohjalainen, Amr El-Desoky Mousa, Björn W. Schuller:
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments. CoRR abs/1705.10874 (2017) - [i13]Zixing Zhang, Ding Liu, Jing Han, Björn W. Schuller:
Learning Audio Sequence Representations for Acoustic Event Classification. CoRR abs/1707.08729 (2017) - [i12]Michael Freitag, Shahin Amiriparian, Sergey Pugachevskiy, Nicholas Cummins, Björn W. Schuller:
auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks. CoRR abs/1712.04382 (2017) - 2016
- [j75]Jun Deng, Xinzhou Xu, Zixing Zhang, Sascha Frühholz, Björn W. Schuller:
Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition. IEEE Access 4: 4299-4309 (2016) - [j74]Sascha Frühholz, Erik Marchi, Björn W. Schuller:
The Effect of Narrow-Band Transmission on Recognition of Paralinguistic Information From Human Vocalizations. IEEE Access 4: 6059-6072 (2016) - [j73]Björn W. Schuller, Jian Pei:
Using Computer Intelligence for Depression Diagnosis and Crowdsourcing. Computer 49(7): 8-9 (2016) - [j72]Hesam Sagha, Feipeng Li, Ehsan Variani, José del R. Millán, Ricardo Chavarriaga, Björn W. Schuller:
Stream fusion for multi-stream automatic speech recognition. Int. J. Speech Technol. 19(4): 669-675 (2016) - [j71]Erik Cambria, Björn W. Schuller, Yunqing Xia, Bebo White:
New avenues in knowledge bases for natural language processing. Knowl. Based Syst. 108: 1-4 (2016) - [j70]Björn W. Schuller:
Editorial: Transactions on Affective Computing - Changes and Continuance. IEEE Trans. Affect. Comput. 7(1): 1-2 (2016) - [j69]Florian Eyben, Klaus R. Scherer, Björn W. Schuller, Johan Sundberg, Elisabeth André, Carlos Busso, Laurence Y. Devillers, Julien Epps, Petri Laukka, Shrikanth S. Narayanan, Khiet P. Truong:
The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing. IEEE Trans. Affect. Comput. 7(2): 190-202 (2016) - [j68]Florian Groß, Justus Jordan, Felix Weninger, Felix Klanner, Björn W. Schuller:
Route and Stopping Intent Prediction at Intersections From Car Fleet Data. IEEE Trans. Intell. Veh. 1(2): 177-186 (2016) - [c303]Maximilian Schmitt, Christoph Janott, Vedhas Pandit, Kun Qian, Clemens Heiser, Werner Hemmert, Björn W. Schuller:
A Bag-of-Audio-Words Approach for Snore Sounds' Excitation Localisation. ITG Symposium on Speech Communication 2016: 1-5 - [c302]Maximilian Schmitt, Erik Marchi, Fabien Ringeval, Björn W. Schuller:
Towards Cross-lingual Automatic Diagnosis of Autism Spectrum Condition in Children's Voices. ITG Symposium on Speech Communication 2016: 1-5 - [c301]Jun Deng, Nicholas Cummins, Jing Han, Xinzhou Xu, Zhao Ren, Vedhas Pandit, Zixing Zhang, Björn W. Schuller:
The University of Passau Open Emotion Recognition System for the Multimodal Emotion Challenge. CCPR (2) 2016: 652-666 - [c300]Ya Li, Jianhua Tao, Björn W. Schuller, Shiguang Shan, Dongmei Jiang, Jia Jia:
MEC 2016: The Multimodal Emotion Recognition Challenge of CCPR 2016. CCPR (2) 2016: 667-678 - [c299]Erik Cambria, Soujanya Poria, Rajiv Bajpai, Björn W. Schuller:
SenticNet 4: A Semantic Resource for Sentiment Analysis Based on Conceptual Primitives. COLING 2016: 2666-2677 - [c298]George Trigeorgis, Mihalis A. Nicolaou, Stefanos Zafeiriou, Björn W. Schuller:
Deep Canonical Time Warping. CVPR 2016: 5110-5118 - [c297]Erik Marchi, Dario Tonelli, Xinzhou Xu, Fabien Ringeval, Jun Deng, Stefano Squartini, Björn W. Schuller:
Pairwise Decomposition with Deep Neural Networks and Multiscale Kernel Subspace Learning for Acoustic Scene Classification. DCASE 2016: 65-69 - [c296]Jian Guo, Kun Qian, Huijie Xu, Christoph Janott, Björn W. Schuller, Satoshi Matsuoka:
GPU-based fast signal processing for large amounts of snore sound data. GCCE 2016: 1-2 - [c295]Kun Qian, Christoph Janott, Zixing Zhang, Clemens Heiser, Björn W. Schuller:
Wavelet features for classification of vote snore sounds. ICASSP 2016: 221-225 - [c294]Marius Telespan, Björn W. Schuller:
Audio watermarking based on empirical mode decomposition and beat detection. ICASSP 2016: 2124-2128 - [c293]Zixing Zhang, Fabien Ringeval, Bin Dong, Eduardo Coutinho, Erik Marchi, Björn W. Schuller:
Enhanced semi-supervised learning for multimodal emotion recognition. ICASSP 2016: 5185-5189 - [c292]George Trigeorgis, Fabien Ringeval, Raymond Brueckner, Erik Marchi, Mihalis A. Nicolaou, Björn W. Schuller, Stefanos Zafeiriou:
Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network. ICASSP 2016: 5200-5204 - [c291]Hesam Sagha, Jun Deng, Maryna Gavryukova, Jing Han, Björn W. Schuller:
Cross lingual speech emotion recognition using canonical correlation analysis on principal component subspace. ICASSP 2016: 5800-5804 - [c290]Yue Zhang, Yuxiang Zhou, Jie Shen, Björn W. Schuller:
Semi-autonomous data enrichment based on cross-task labelling of missing targets for holistic speech analysis. ICASSP 2016: 6090-6094 - [c289]Xinzhou Xu, Jun Deng, Maryna Gavryukova, Zixing Zhang, Li Zhao, Björn W. Schuller:
Multiscale kernel locally penalised discriminant analysis exemplified by emotion recognition in speech. ICMI 2016: 233-237 - [c288]Yue Zhang, Felix Weninger, Anton Batliner, Florian Hönig, Björn W. Schuller:
Language proficiency assessment of English L2 speakers based on joint analysis of prosody and native language. ICMI 2016: 274-278 - [c287]Michel F. Valstar, Tobias Baur, Angelo Cafaro, Alexandru Ghitulescu, Blaise Potard, Johannes Wagner, Elisabeth André, Laurent Durieu, Matthew P. Aylett, Soumia Dermouche, Catherine Pelachaud, Eduardo Coutinho, Björn W. Schuller, Yue Zhang, Dirk Heylen, Mariët Theune, Jelte van Waterschoot:
Ask Alice: an artificial retrieval of information agent. ICMI 2016: 419-420 - [c286]Irman Abdic, Lex Fridman, Daniel E. Brown, William Angell, Bryan Reimer, Erik Marchi, Björn W. Schuller:
Detecting road surface wetness from audio: A deep learning approach. ICPR 2016: 3458-3463 - [c285]Irman Abdic, Lex Fridman, Daniel McDuff, Erik Marchi, Bryan Reimer, Björn W. Schuller:
Driver Frustration Detection from Audio and Video in the Wild. IJCAI 2016: 1354-1360 - [c284]Felix Weninger, Fabien Ringeval, Erik Marchi, Björn W. Schuller:
Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio. IJCAI 2016: 2196-2202 - [c283]Gil Keren, Björn W. Schuller:
Convolutional RNN: An enhanced model for extracting features from sequential data. IJCNN 2016: 3412-3419 - [c282]Maximilian Schmitt, Fabien Ringeval, Björn W. Schuller:
At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech. INTERSPEECH 2016: 495-499 - [c281]Erik Marchi, Florian Eyben, Gerhard Hagerer, Björn W. Schuller:
Real-Time Tracking of Speakers' Emotions, States, and Traits on Mobile Platforms. INTERSPEECH 2016: 1182-1183 - [c280]Fabien Ringeval, Erik Marchi, Charline Grossard, Jean Xavier, Mohamed Chetouani, David Cohen, Björn W. Schuller:
Automatic Analysis of Typical and Atypical Encoding of Spontaneous Emotion in the Voice of Children. INTERSPEECH 2016: 1210-1214 - [c279]Florian B. Pokorny, Peter B. Marschik, Christa Einspieler, Björn W. Schuller:
Does She Speak RTT? Towards an Earlier Identification of Rett Syndrome Through Intelligent Pre-Linguistic Vocalisation Analysis. INTERSPEECH 2016: 1953-1957 - [c278]Björn W. Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron C. Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini:
The INTERSPEECH 2016 Computational Paralinguistics Challenge: Deception, Sincerity & Native Language. INTERSPEECH 2016: 2001-2005 - [c277]Shahin Amiriparian, Jouni Pohjalainen, Erik Marchi, Sergey Pugachevskiy, Björn W. Schuller:
Is Deception Emotional? An Emotion-Driven Predictive Approach. INTERSPEECH 2016: 2011-2015 - [c276]Yue Zhang, Felix Weninger, Zhao Ren, Björn W. Schuller:
Sincerity and Deception in Speech: Two Sides of the Same Coin? A Transfer- and Multi-Task Learning Perspective. INTERSPEECH 2016: 2041-2045 - [c275]Gil Keren, Jun Deng, Jouni Pohjalainen, Björn W. Schuller:
Convolutional Neural Networks with Data Augmentation for Classifying Speakers' Native Language. INTERSPEECH 2016: 2393-2397 - [c274]Amr El-Desoky Mousa, Björn W. Schuller:
Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks for Grapheme-to-Phoneme Conversion Utilizing Complex Many-to-Many Alignments. INTERSPEECH 2016: 2836-2840 - [c273]Hesam Sagha, Pavel Matejka, Maryna Gavryukova, Filip Povolný, Erik Marchi, Björn W. Schuller:
Enhancing Multilingual Recognition of Emotion in Speech by Language Identification. INTERSPEECH 2016: 2949-2953 - [c272]Florian B. Pokorny, Robert Peharz, Wolfgang Roth, Matthias Zöhrer, Franz Pernkopf, Peter B. Marschik, Björn W. Schuller:
Manual versus Automated: The Challenging Routine of Infant Vocalisation Segmentation in Home Videos to Study Neuro(mal)development. INTERSPEECH 2016: 2997-3001 - [c271]Björn W. Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron C. Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini:
The Deception Sub-Challenge: The Data. INTERSPEECH 2016 - [c270]Björn W. Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini:
The Sincerity Sub-Challenge: The Data. INTERSPEECH 2016 - [c269]Björn W. Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini:
The Native Language Sub-Challenge: The Data. INTERSPEECH 2016 - [c268]Björn W. Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini:
The INTERSPEECH 2016 Computational Paralinguistics Challenge: A Summary of Results. INTERSPEECH 2016 - [c267]Björn W. Schuller, Stefan Steidl, Anton Batliner, Julia Hirschberg, Judee K. Burgoon, Alice Baird, Aaron Elkins, Yue Zhang, Eduardo Coutinho, Keelan Evanini:
Discussion. INTERSPEECH 2016 - [c266]Zixing Zhang, Fabien Ringeval, Jing Han, Jun Deng, Erik Marchi, Björn W. Schuller:
Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks. INTERSPEECH 2016: 3593-3597 - [c265]Jun Deng, Xinzhou Xu, Zixing Zhang, Sascha Frühholz, Didier Grandjean, Björn W. Schuller:
Fisher Kernels on Phase-Based Features for Speech Emotion Recognition. IWSDS 2016: 195-203 - [c264]Eduardo Coutinho, Florian Hönig, Yue Zhang, Simone Hantke, Anton Batliner, Elmar Nöth, Björn W. Schuller:
Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets. LREC 2016 - [c263]Simone Hantke, Erik Marchi, Björn W. Schuller:
Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification. LREC 2016 - [c262]Bogdan Vlasenko, Björn W. Schuller, Andreas Wendemuth:
Tendencies regarding the effect of emotional intensity in inter corpus phoneme-level speech emotion modelling. MLSP 2016: 1-6 - [c261]Michel F. Valstar, Jonathan Gratch, Björn W. Schuller, Fabien Ringeval, Denis Lalanne, Mercedes Torres, Stefan Scherer, Giota Stratou, Roddy Cowie, Maja Pantic:
AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge. AVEC@ACM Multimedia 2016: 3-10 - [c260]Jouni Pohjalainen, Fabien Ringeval, Zixing Zhang, Björn W. Schuller:
Spectral and Cepstral Audio Noise Reduction Techniques in Speech Emotion Recognition. ACM Multimedia 2016: 670-674 - [c259]Maja Pantic, Vanessa Evers, Marc Peter Deisenroth, Luis Merino, Björn W. Schuller:
Social and Affective Robotics Tutorial. ACM Multimedia 2016: 1477-1478 - [c258]Michel F. Valstar, Jonathan Gratch, Björn W. Schuller, Fabien Ringeval, Roddy Cowie, Maja Pantic:
Summary for AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge. ACM Multimedia 2016: 1483-1484 - [p4]Björn W. Schuller:
A Decade of Encouraging Speech Processing "Outside of the Box" - A Foreword. Recent Advances in Nonlinear Speech Processing 2016: 3-4 - [e11]Michel F. Valstar, Jonathan Gratch, Björn W. Schuller, Fabien Ringeval, Roddy Cowie, Maja Pantic:
Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, AVEC@MM 2016, Amsterdam, The Netherlands, October 16, 2016. ACM 2016, ISBN 978-1-4503-4516-3 [contents] - [i11]Gil Keren, Björn W. Schuller:
Convolutional RNN: an Enhanced Model for Extracting Features from Sequential Data. CoRR abs/1602.05875 (2016) - [i10]Michel F. Valstar, Jonathan Gratch, Björn W. Schuller, Fabien Ringeval, Denis Lalanne, Mercedes Torres, Stefan Scherer, Giota Stratou, Roddy Cowie, Maja Pantic:
AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge. CoRR abs/1605.01600 (2016) - [i9]Maximilian Schmitt, Björn W. Schuller:
openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit. CoRR abs/1605.06778 (2016) - [i8]Gil Keren, Sivan Sabato, Björn W. Schuller:
Tunable Sensitivity to Large Errors in Neural Network Training. CoRR abs/1611.07743 (2016) - 2015
- [j67]Björn W. Schuller:
Do Computers Have Personality? Computer 48(3): 6-7 (2015) - [j66]Björn W. Schuller, Stefan Steidl, Anton Batliner, Alessandro Vinciarelli, Felix Burkhardt, Rob van Son:
Introduction. Comput. Speech Lang. 29(1): 98-99 (2015) - [j65]Björn W. Schuller, Stefan Steidl, Anton Batliner, Elmar Nöth, Alessandro Vinciarelli, Felix Burkhardt, Rob van Son, Felix Weninger, Florian Eyben, Tobias Bocklet, Gelareh Mohammadi, Benjamin Weiss:
A Survey on perceived speaker traits: Personality, likability, pathology, and the first challenge. Comput. Speech Lang. 29(1): 100-131 (2015) - [j64]Florian Eyben, Gláucia L. Salomão, Johan Sundberg, Klaus R. Scherer, Björn W. Schuller:
Emotion in the singing voice - a deeperlook at acoustic features in the light ofautomatic classification. EURASIP J. Audio Speech Music. Process. 2015: 19 (2015) - [j63]Felix Weninger, Johannes Bergmann, Björn W. Schuller:
Introducing CURRENNT: the munich open-source CUDA recurrent neural network toolkit. J. Mach. Learn. Res. 16: 547-551 (2015) - [j62]Fabien Ringeval, Florian Eyben, Eleni Kroupi, Anil Yüce, Jean-Philippe Thiran, Touradj Ebrahimi, Denis Lalanne, Björn W. Schuller:
Prediction of asynchronous dimensional emotion ratings from audiovisual and physiological data. Pattern Recognit. Lett. 66: 22-30 (2015) - [j61]Zixing Zhang, Eduardo Coutinho, Jun Deng, Björn W. Schuller:
Cooperative Learning and its Application to Emotion Recognition from Speech. IEEE ACM Trans. Audio Speech Lang. Process. 23(1): 115-126 (2015) - [j60]Björn W. Schuller, Amr El-Desoky Mousa, Vasileios Vryniotis:
Sentiment analysis and opinion mining: on optimal parameters and performances. WIREs Data Mining Knowl. Discov. 5(5): 255-263 (2015) - [c257]Nicolas Sabouret, Björn W. Schuller, Lucas Paletta, Erik Marchi, Hazaël Jones, Atef Ben Youssef:
Intelligent user interfaces in digital games for empowerment and inclusion. Advances in Computer Entertainment 2015: 8:1-8:8 - [c256]Yue Zhang, Eduardo Coutinho, Björn W. Schuller, Zixing Zhang, Michael G. Adam:
On rater reliability and agreement based dynamic active learning. ACII 2015: 70-76 - [c255]Silvia Monica Feraru, Dagmar Schuller, Björn W. Schuller:
Cross-language acoustic emotion recognition: An overview and some tendencies. ACII 2015: 125-131 - [c254]Marc Schröder, Elisabetta Bevacqua, Roddy Cowie, Florian Eyben, Hatice Gunes, Dirk Heylen, Mark ter Maat, Gary McKeown, Sathish Pammi, Maja Pantic, Catherine Pelachaud, Björn W. Schuller, Etienne de Sevin, Michel F. Valstar, Martin Wöllmer:
Building autonomous sensitive artificial listeners (Extended abstract). ACII 2015: 456-462 - [c253]Angeliki Metallinou, Athanasios Katsamanis, Martin Wöllmer, Florian Eyben, Björn W. Schuller, Shrikanth S. Narayanan:
Context-sensitive learning for enhanced audiovisual emotion classification (Extended abstract). ACII 2015: 463-469 - [c252]Björn W. Schuller, Bogdan Vlasenko, Florian Eyben, Martin Wöllmer, André Stuhlsatz, Andreas Wendemuth, Gerhard Rigoll:
Cross-corpus acoustic emotion recognition: Variances and strategies (Extended abstract). ACII 2015: 470-476 - [c251]Florian Eyben, Bernd Huber, Erik Marchi, Dagmar Schuller, Björn W. Schuller:
Real-time robust recognition of speakers' emotions and characteristics on mobile platforms. ACII 2015: 778-780 - [c250]Florian B. Pokorny, Franz Graf, Franz Pernkopf, Björn W. Schuller:
Detection of negative emotions in speech signals using bags-of-audio-words. ACII 2015: 879-884 - [c249]Simone Hantke, Florian Eyben, Tobias Appel, Björn W. Schuller:
iHEARu-PLAY: Introducing a game for crowdsourced data collection for affective computing. ACII 2015: 891-897 - [c248]Kun Qian, Zixing Zhang, Fabien Ringeval, Björn W. Schuller:
Bird sounds classification by large scale acoustic features and extreme learning machine. GlobalSIP 2015: 1317-1321 - [c247]Felix Weninger, Hakan Erdogan, Shinji Watanabe, Emmanuel Vincent, Jonathan Le Roux, John R. Hershey, Björn W. Schuller:
Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR. LVA/ICA 2015: 91-99 - [c246]Erik Marchi, Fabio Vesperini, Florian Eyben, Stefano Squartini, Björn W. Schuller:
A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks. ICASSP 2015: 1996-2000 - [c245]Kim Hartmann, Ingo Siegert, Björn W. Schuller, Louis-Philippe Morency, Albert Ali Salah, Ronald Böck:
ERM4CT 2015: Workshop on Emotion Representations and Modelling for Companion Systems. ERM4CT@ICMI 2015: 1-2 - [c244]Yue Zhang, Eduardo Coutinho, Zixing Zhang, Caijiao Quan, Björn W. Schuller:
Dynamic Active Learning Based on Agreement and Applied to Emotion Recognition in Spoken Interactions. ICMI 2015: 275-278 - [c243]Erik Marchi, Fabio Vesperini, Felix Weninger, Florian Eyben, Stefano Squartini, Björn W. Schuller:
Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection. IJCNN 2015: 1-7 - [c242]Erik Marchi, Björn W. Schuller, Simon Baron-Cohen, Ofer Golan, Sven Bölte, Prerna Arora, Reinhold Häb-Umbach:
Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages. INTERSPEECH 2015: 115-119 - [c241]Björn W. Schuller, Stefan Steidl, Anton Batliner, Simone Hantke, Florian Hönig, Juan Rafael Orozco-Arroyave, Elmar Nöth, Yue Zhang, Felix Weninger:
The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, parkinson's & eating condition. INTERSPEECH 2015: 478-482 - [c240]Xinzhou Xu, Jun Deng, Wenming Zheng, Li Zhao, Björn W. Schuller:
Dimensionality reduction for speech emotion features by multiscale kernels. INTERSPEECH 2015: 1532-1536 - [c239]Fabien Ringeval, Erik Marchi, Marc Mehu, Klaus R. Scherer, Björn W. Schuller:
Face reading from speech - predicting facial action units from audio cues. INTERSPEECH 2015: 1977-1981 - [c238]Lucas Azaïs, Adrien Payan, Tianjiao Sun, Guillaume Vidal, Tina Zhang, Eduardo Coutinho, Florian Eyben, Björn W. Schuller:
Does my speech rock? automatic assessment of public speaking skills. INTERSPEECH 2015: 2519-2523 - [c237]Björn W. Schuller:
Modelling User Affect and Sentiment in Intelligent User Interfaces: A Tutorial Overview. IUI 2015: 443-446 - [c236]Lucas Paletta, Björn W. Schuller, Peter Robinson, Nicolas Sabouret:
IDGEI 2015: 3rd International Workshop on Intelligent Digital Games for Empowerment and Inclusion. IUI 2015: 450-452 - [c235]Eduardo Coutinho, George Trigeorgis, Stefanos Zafeiriou, Björn W. Schuller:
Automatically Estimating Emotion in Music with Deep Long-Short Term Memory Recurrent Neural Networks. MediaEval 2015 - [c234]George Trigeorgis, Eduardo Coutinho, Fabien Ringeval, Erik Marchi, Stefanos Zafeiriou, Björn W. Schuller:
The ICL-TUM-PASSAU Approach for the MediaEval 2015 "Affective Impact of Movies" Task. MediaEval 2015 - [c233]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Shashank Jaiswal, Erik Marchi, Denis Lalanne, Roddy Cowie, Maja Pantic:
AV+EC 2015: The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data. AVEC@ACM Multimedia 2015: 3-8 - [c232]Hesam Sagha, Eduardo Coutinho, Björn W. Schuller:
Exploring the Importance of Individual Differences to the Automatic Estimation of Emotions Induced by Music. AVEC@ACM Multimedia 2015: 57-63 - [c231]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Roddy Cowie, Maja Pantic:
AVEC 2015: The 5th International Audio/Visual Emotion Challenge and Workshop. ACM Multimedia 2015: 1335-1336 - [c230]Björn W. Schuller:
Speech Analysis in the Big Data Era. TSD 2015: 3-11 - [p3]Björn W. Schuller:
Emotional Expressions and Daily Cognitive Functions. Advances in Neural Networks 2015: 339-346 - [e10]Kim Hartmann, Ingo Siegert, Björn W. Schuller, Louis-Philippe Morency, Albert Ali Salah, Ronald Böck:
Proceedings of the International Workshop on Emotion Representations and Modelling for Companion Technologies, ERM4CT@ICMI 2015, Seattle, Washington, USA, November 13, 2015. ACM 2015, ISBN 978-1-4503-3988-9 [contents] - [e9]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Roddy Cowie, Maja Pantic:
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, AVEC 2015, Brisbane, Australia, October 26, 2015. ACM 2015, ISBN 978-1-4503-3743-4 [contents] - [i7]George Trigeorgis, Konstantinos Bousmalis, Stefanos Zafeiriou, Björn W. Schuller:
A deep matrix factorization method for learning attribute representations. CoRR abs/1509.03248 (2015) - [i6]Amr El-Desoky Mousa, Erik Marchi, Björn W. Schuller:
The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models. CoRR abs/1510.00268 (2015) - 2014
- [j59]Björn W. Schuller, Stefan Steidl, Anton Batliner, Florian Schiel, Jarek Krajewski:
Introduction to the Special Issue on Broadening the View on Speaker Analysis. Comput. Speech Lang. 28(2): 343-345 (2014) - [j58]Björn W. Schuller, Stefan Steidl, Anton Batliner, Florian Schiel, Jarek Krajewski, Felix Weninger, Florian Eyben:
Medium-term speaker states - A review on intoxication, sleepiness and the first challenge. Comput. Speech Lang. 28(2): 346-374 (2014) - [j57]Felix Weninger, Jürgen T. Geiger, Martin Wöllmer, Björn W. Schuller, Gerhard Rigoll:
Feature enhancement by deep LSTM networks for ASR in reverberant multisource environments. Comput. Speech Lang. 28(4): 888-902 (2014) - [j56]Martin Wöllmer, Björn W. Schuller:
Probabilistic speech feature extraction with context-sensitive Bottleneck neural networks. Neurocomputing 132: 113-120 (2014) - [j55]Martin Hofmann, Jürgen T. Geiger, Sebastian Bachmann, Björn W. Schuller, Gerhard Rigoll:
The TUM Gait from Audio, Image and Depth (GAID) database: Multimodal recognition of subjects and traits. J. Vis. Commun. Image Represent. 25(1): 195-206 (2014) - [j54]Amir Hussain, Erik Cambria, Björn W. Schuller, Newton Howard:
Affective neural networks and cognitive learning systems for big data analysis. Neural Networks 58: 1-3 (2014) - [j53]Jun Deng, Zixing Zhang, Florian Eyben, Björn W. Schuller:
Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition. IEEE Signal Process. Lett. 21(9): 1068-1072 (2014) - [j52]Zixing Zhang, Eduardo Coutinho, Jun Deng, Björn W. Schuller:
Distributing Recognition in Computational Paralinguistics. IEEE Trans. Affect. Comput. 5(4): 406-417 (2014) - [j51]Jürgen T. Geiger, Felix Weninger, Jort F. Gemmeke, Martin Wöllmer, Björn W. Schuller, Gerhard Rigoll:
Memory-Enhanced Neural Networks and NMF for Robust ASR. IEEE ACM Trans. Audio Speech Lang. Process. 22(6): 1037-1046 (2014) - [j50]Zixing Zhang, Joel Pinto, Christian Plahl, Björn W. Schuller, Daniel Willett:
Channel mapping using bidirectional long short-term memory for dereverberation in hands-free voice controlled devices. IEEE Trans. Consumer Electron. 60(3): 525-533 (2014) - [c229]Felix Weninger, John R. Hershey, Jonathan Le Roux, Björn W. Schuller:
Discriminatively trained recurrent neural networks for single-channel speech separation. GlobalSIP 2014: 577-581 - [c228]Rui Xia, Jun Deng, Björn W. Schuller, Yang Liu:
Modeling gender information for emotion recognition using Denoising autoencoder. ICASSP 2014: 990-994 - [c227]Erik Marchi, Giacomo Ferroni, Florian Eyben, Leonardo Gabrielli, Stefano Squartini, Björn W. Schuller:
Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks. ICASSP 2014: 2164-2168 - [c226]Felix Weninger, Florian Eyben, Björn W. Schuller:
Single-channel speech separation with memory-enhanced recurrent neural networks. ICASSP 2014: 3709-3713 - [c225]Heysem Kaya, Florian Eyben, Albert Ali Salah, Björn W. Schuller:
CCA based feature selection with application to continuous depression recognition from acoustic speech features. ICASSP 2014: 3729-3733 - [c224]Felix Weninger, Shinji Watanabe, Yuuki Tachioka, Björn W. Schuller:
Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition. ICASSP 2014: 4623-4627 - [c223]Jun Deng, Rui Xia, Zixing Zhang, Yang Liu, Björn W. Schuller:
Introducing shared-hidden-layer autoencoders for transfer learning and their application in acoustic emotion recognition. ICASSP 2014: 4818-4822 - [c222]Raymond Brueckner, Björn W. Schuller:
Social signal classification using deep blstm recurrent neural networks. ICASSP 2014: 4823-4827 - [c221]Felix Weninger, Florian Eyben, Björn W. Schuller:
On-line continuous-time music mood regression with deep recurrent neural networks. ICASSP 2014: 5412-5416 - [c220]Oya Çeliktutan, Florian Eyben, Evangelos Sariyanidi, Hatice Gunes, Björn W. Schuller:
MAPTRAITS 2014: The First Audio/Visual Mapping Personality Traits Challenge. MAPTRAITS@ICMI 2014: 3-9 - [c219]Jürgen T. Geiger, Maximilian Kneißl, Björn W. Schuller, Gerhard Rigoll:
Acoustic Gait-based Person Identification using Hidden Markov Models. MAPTRAITS@ICMI 2014: 25-30 - [c218]Fabien Ringeval, Shahin Amiriparian, Florian Eyben, Klaus R. Scherer, Björn W. Schuller:
Emotion Recognition in the Wild: Incorporating Voice and Lip Activity in Multimodal Decision-Level Fusion. ICMI 2014: 473-480 - [c217]Kim Hartmann, Björn W. Schuller, Ronald Böck:
ERM4HCI 2014: The 2nd Workshop on Emotion Representation and Modelling in Human-Computer-Interaction-Systems. ICMI 2014: 525-526 - [c216]Oya Çeliktutan, Florian Eyben, Evangelos Sariyanidi, Hatice Gunes, Björn W. Schuller:
MAPTRAITS 2014 - The First Audio/Visual Mapping Personality Traits Challenge - An Introduction: Perceived Personality and Social Dimensions. ICMI 2014: 529-530 - [c215]George Trigeorgis, Konstantinos Bousmalis, Stefanos Zafeiriou, Björn W. Schuller:
A Deep Semi-NMF Model for Learning Hidden Representations. ICML 2014: 1692-1700 - [c214]Jun Deng, Zixing Zhang, Björn W. Schuller:
Linked Source and Target Domain Subspace Feature Transfer Learning - Exemplified by Speech Emotion Recognition. ICPR 2014: 761-766 - [c213]Erik Marchi, Giacomo Ferroni, Florian Eyben, Stefano Squartini, Björn W. Schuller:
Audio onset detection: A wavelet packet based approach with recurrent neural networks. IJCNN 2014: 3585-3591 - [c212]Eduardo Coutinho, Jun Deng, Björn W. Schuller:
Transfer learning emotion manifestation across music and speech. IJCNN 2014: 3592-3598 - [c211]Björn W. Schuller, Stefan Steidl, Anton Batliner, Julien Epps, Florian Eyben, Fabien Ringeval, Erik Marchi, Yue Zhang:
The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load. INTERSPEECH 2014: 427-431 - [c210]Jürgen T. Geiger, Zixing Zhang, Felix Weninger, Björn W. Schuller, Gerhard Rigoll:
Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling. INTERSPEECH 2014: 631-635 - [c209]Jürgen T. Geiger, Jort F. Gemmeke, Björn W. Schuller, Gerhard Rigoll:
Investigating NMF speech enhancement for neural network based acoustic models. INTERSPEECH 2014: 2405-2409 - [c208]Lucas Paletta, Björn W. Schuller, Peter Robinson, Nicolas Sabouret:
IDGEI 2014: 2nd international workshop on intelligent digital games for empowerment and inclusion. IUI Companion 2014: 49-50 - [c207]Björn W. Schuller, Felix Friedmann, Florian Eyben:
The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production. LREC 2014: 1506-1510 - [c206]Eduardo Coutinho, Felix Weninger, Björn W. Schuller, Klaus R. Scherer:
The Munich LSTM-RNN Approach to the MediaEval 2014 "Emotion in Music'" Task. MediaEval 2014 - [c205]Michel F. Valstar, Björn W. Schuller, Kirsty Smith, Timur R. Almaev, Florian Eyben, Jarek Krajewski, Roddy Cowie, Maja Pantic:
AVEC 2014: 3D Dimensional Affect and Depression Recognition Challenge. AVEC@MM 2014: 3-10 - [c204]Mohammad Soleymani, Anna Aljanaki, Yi-Hsuan Yang, Michael N. Caro, Florian Eyben, Konstantin Markov, Björn W. Schuller, Remco C. Veltkamp, Felix Weninger, Frans Wiering:
Emotional Analysis of Music: A Comparison of Methods. ACM Multimedia 2014: 1161-1164 - [c203]Michel F. Valstar, Björn W. Schuller, Jarek Krajewski, Roddy Cowie, Maja Pantic:
AVEC 2014: the 4th international audio/visual emotion challenge and workshop. ACM Multimedia 2014: 1243-1244 - [c202]Jürgen T. Geiger, Boxin Zhang, Björn W. Schuller, Gerhard Rigoll:
On the Influence of Alcohol Intoxication on Speaker Recognition. Semantic Audio 2014 - [c201]Christian Kirst, Felix Weninger, Cyril Joder, Peter Grosche, Jürgen T. Geiger, Björn W. Schuller:
On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise. Semantic Audio 2014 - [e8]Albert Ali Salah, Jeffrey F. Cohn, Björn W. Schuller, Oya Aran, Louis-Philippe Morency, Philip R. Cohen:
Proceedings of the 16th International Conference on Multimodal Interaction, ICMI 2014, Istanbul, Turkey, November 12-16, 2014. ACM 2014, ISBN 978-1-4503-2885-2 [contents] - [e7]Kim Hartmann, Björn W. Schuller, Klaus R. Scherer, Ronald Böck:
Proceedings of the 2014 workshop on Emotion Representation and Modelling in Human-Computer-Interaction-Systems, ERM4HCI@ICMI 2014, Istanbul, Turkey, November 16, 2014. ACM 2014, ISBN 978-1-4503-0124-4 [contents] - [e6]Hatice Gunes, Björn W. Schuller, Oya Çeliktutan, Evangelos Sariyanidi, Florian Eyben:
Proceedings of the 2014 Workshop on Mapping Personality Traits Challenge and Workshop, MAPTRAITS@ICMI 2014, Istanbul, Turkey, November 12, 2014. ACM 2014, ISBN 978-1-4503-0480-1 [contents] - [e5]Michel F. Valstar, Björn W. Schuller, Jarek Krajewski, Roddy Cowie, Maja Pantic:
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, AVEC '14, Orlando, Florida, USA, November 7, 2014. ACM 2014, ISBN 978-1-4503-3119-7 [contents] - [i5]Björn W. Schuller, Erik Marchi, Simon Baron-Cohen, Helen O'Reilly, Delia Pigat, Peter Robinson, Ian Davies:
The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions. CoRR abs/1403.5912 (2014) - [i4]Jürgen T. Geiger, Maximilian Kneißl, Björn W. Schuller, Gerhard Rigoll:
Acoustic Gait-based Person Identification using Hidden Markov Models. CoRR abs/1406.2895 (2014) - [i3]Felix Weninger, Björn W. Schuller, Florian Eyben, Martin Wöllmer, Gerhard Rigoll:
A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems. CoRR abs/1412.4616 (2014) - 2013
- [b3]Björn W. Schuller:
Intelligent Audio Analysis. Signals and communication technology, Springer 2013, ISBN 978-3-642-36805-9, pp. I-XXVIII, 1-345 - [j49]Rudy Rotili, Emanuele Principi, Stefano Squartini, Björn W. Schuller:
A Real-Time Speech Enhancement Framework in Noisy and Reverberated Acoustic Scenarios. Cogn. Comput. 5(4): 504-516 (2013) - [j48]Björn W. Schuller, Stefan Steidl, Anton Batliner:
Introduction to the special issue on Paralinguistics in Naturalistic Speech and Language. Comput. Speech Lang. 27(1): 1-3 (2013) - [j47]Björn W. Schuller, Stefan Steidl, Anton Batliner, Felix Burkhardt, Laurence Devillers, Christian A. Müller, Shrikanth S. Narayanan:
Paralinguistics in speech and language - State-of-the-art and the challenge. Comput. Speech Lang. 27(1): 4-39 (2013) - [j46]Martin Wöllmer, Felix Weninger, Jürgen T. Geiger, Björn W. Schuller, Gerhard Rigoll:
Noise robust ASR in reverberated multisource environments applying convolutive NMF and Long Short-Term Memory. Comput. Speech Lang. 27(3): 780-797 (2013) - [j45]Erik Cambria, Björn W. Schuller, Bing Liu, Haixun Wang, Catherine Havasi:
Knowledge-Based Approaches to Concept-Level Sentiment Analysis. IEEE Intell. Syst. 28(2): 12-14 (2013) - [j44]Erik Cambria, Björn W. Schuller, Yunqing Xia, Catherine Havasi:
New Avenues in Opinion Mining and Sentiment Analysis. IEEE Intell. Syst. 28(2): 15-21 (2013) - [j43]Erik Cambria, Björn W. Schuller, Bing Liu, Haixun Wang, Catherine Havasi:
Statistical Approaches to Concept-Level Sentiment Analysis. IEEE Intell. Syst. 28(3): 6-9 (2013) - [j42]Martin Wöllmer, Felix Weninger, Tobias Knaup, Björn W. Schuller, Congkai Sun, Kenji Sagae, Louis-Philippe Morency:
YouTube Movie Reviews: Sentiment Analysis in an Audio-Visual Context. IEEE Intell. Syst. 28(3): 46-53 (2013) - [j41]Felix Weninger, Pascal Staudt, Björn W. Schuller:
Words that Fascinate the Listener: Predicting Affective Ratings of On-Line Lectures. Int. J. Distance Educ. Technol. 11(2): 110-123 (2013) - [j40]Hatice Gunes, Björn W. Schuller:
Introduction To The Special Issue On Affect Analysis In Continuous Input. Image Vis. Comput. 31(2): 118-119 (2013) - [j39]Hatice Gunes, Björn W. Schuller:
Categorical and dimensional affect analysis in continuous input: Current trends and future directions. Image Vis. Comput. 31(2): 120-136 (2013) - [j38]Martin Wöllmer, Moritz Kaiser, Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
LSTM-Modeling of continuous emotions in an audiovisual affect recognition framework. Image Vis. Comput. 31(2): 153-163 (2013) - [j37]Björn W. Schuller, Ian Dunwell, Felix Weninger, Lucas Paletta:
Serious Gaming for Behavior Change: The State of Play. IEEE Pervasive Comput. 12(3): 48-55 (2013) - [j36]Martin Wöllmer, Björn W. Schuller, Gerhard Rigoll:
Keyword spotting exploiting Long Short-Term Memory. Speech Commun. 55(2): 252-265 (2013) - [c200]Jun Deng, Zixing Zhang, Erik Marchi, Björn W. Schuller:
Sparse Autoencoder-Based Feature Transfer Learning for Speech Emotion Recognition. ACII 2013: 511-516 - [c199]Raymond Brueckner, Björn W. Schuller:
Hierarchical neural networks and enhanced class posteriors for social signal classification. ASRU 2013: 362-367 - [c198]Felix Weninger, Christian Kirst, Björn W. Schuller, Hans-Joachim Bungartz:
A discriminative approach to polyphonic piano note transcription using supervised non-negative matrix factorization. ICASSP 2013: 6-10 - [c197]Cyril Joder, Felix Weninger, David Virette, Björn W. Schuller:
Integrating noise estimation and factorization-based speech separation: A novel hybrid approach. ICASSP 2013: 131-135 - [c196]Cyril Joder, Björn W. Schuller:
Off-line refinement of audio-to-score alignment by observation template adaptation. ICASSP 2013: 206-210 - [c195]Björn W. Schuller, Florian B. Pokorny, Stefan Ladstaetter, Maria Fellner, Franz Graf, Lucas Paletta:
Acoustic Geo-Sensing: Recognising cyclists' route, route direction, and route progress from cell-phone audio. ICASSP 2013: 453-457 - [c194]Jürgen T. Geiger, Martin Hofmann, Björn W. Schuller, Gerhard Rigoll:
Gait-based person identification by spectral, cepstral and energy-related audio features. ICASSP 2013: 458-462 - [c193]Florian Eyben, Felix Weninger, Stefano Squartini, Björn W. Schuller:
Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies. ICASSP 2013: 483-487 - [c192]Cyril Joder, Felix Weninger, David Virette, Björn W. Schuller:
A comparative study on sparsity penalties for NMF-based speech separation: Beyond LP-norms. ICASSP 2013: 858-862 - [c191]Felix Weninger, Claudia Wagner, Martin Wöllmer, Björn W. Schuller, Louis-Philippe Morency:
Speaker trait characterization in web videos: Uniting speech, language, and facial features. ICASSP 2013: 3647-3651 - [c190]Martin Wöllmer, Zixing Zhang, Felix Weninger, Björn W. Schuller, Gerhard Rigoll:
Feature enhancement by bidirectional LSTM networks for conversational speech recognition in highly non-stationary noise. ICASSP 2013: 6822-6826 - [c189]Martin Wöllmer, Björn W. Schuller, Gerhard Rigoll:
Probabilistic asr feature extraction applying context-sensitive connectionist temporal classification networks. ICASSP 2013: 7125-7129 - [c188]Björn W. Schuller, Felix Friedmann, Florian Eyben:
Automatic recognition of physiological parameters in the human voice: Heart rate and skin conductance. ICASSP 2013: 7219-7223 - [c187]Zixing Zhang, Jun Deng, Björn W. Schuller:
Co-training succeeds in Computational Paralinguistics. ICASSP 2013: 8505-8509 - [c186]Florian Eyben, Felix Weninger, Lucas Paletta, Björn W. Schuller:
The acoustics of eye contact: detecting visual attention from conversational audio cues. GazeIn@ICMI 2013: 7-12 - [c185]Kim Hartmann, Ronald Böck, Christian Becker-Asano, Jonathan Gratch, Björn W. Schuller, Klaus R. Scherer:
ERM4HCI 2013: the 1st workshop on emotion representation and modelling in human-computer-interaction-systems. ICMI 2013: 607-608 - [c184]Aldona Rosner, Felix Weninger, Björn W. Schuller, Marcin Michalak, Bozena Kostek:
Influence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classification. ICMMI 2013: 467-473 - [c183]Björn W. Schuller, Stefan Steidl, Anton Batliner, Alessandro Vinciarelli, Klaus R. Scherer, Fabien Ringeval, Mohamed Chetouani, Felix Weninger, Florian Eyben, Erik Marchi, Marcello Mortillaro, Hugues Salamin, Anna Polychroniou, Fabio Valente, Samuel Kim:
The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism. INTERSPEECH 2013: 148-152 - [c182]Jürgen T. Geiger, Florian Eyben, Nicholas W. D. Evans, Björn W. Schuller, Gerhard Rigoll:
Using linguistic information to detect overlapping speech. INTERSPEECH 2013: 690-694 - [c181]Jürgen T. Geiger, Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
Detecting overlapping speech with long short-term memory recurrent neural networks. INTERSPEECH 2013: 1668-1672 - [c180]Florian Eyben, Felix Weninger, Björn W. Schuller:
Affect recognition in real-life acoustic conditions - a new perspective on feature selection. INTERSPEECH 2013: 2044-2048 - [c179]Wenjing Han, Haifeng Li, Huabin Ruan, Lin Ma, Jiayin Sun, Björn W. Schuller:
Active learning for dimensional speech emotion recognition. INTERSPEECH 2013: 2841-2845 - [c178]Zixing Zhang, Jun Deng, Erik Marchi, Björn W. Schuller:
Active learning by label uncertainty for acoustic emotion recognition. INTERSPEECH 2013: 2856-2860 - [c177]Felix Weninger, Florian Eyben, Björn W. Schuller:
The TUM Approach to the MediaEval Music Emotion Task Using Generic Affective Audio Features. MediaEval 2013 - [c176]Michel F. Valstar, Björn W. Schuller, Kirsty Smith, Florian Eyben, Bihan Jiang, Sanjay Bilakhia, Sebastian Schnieder, Roddy Cowie, Maja Pantic:
AVEC 2013: the continuous audio/visual emotion and depression recognition challenge. AVEC@ACM Multimedia 2013: 3-10 - [c175]Florian Eyben, Felix Weninger, Florian Groß, Björn W. Schuller:
Recent developments in openSMILE, the munich open-source multimedia feature extractor. ACM Multimedia 2013: 835-838 - [c174]Michel F. Valstar, Björn W. Schuller, Jarek Krajewski, Roddy Cowie, Maja Pantic:
Workshop summary for the 3rd international audio/visual emotion challenge and workshop (AVEC'13). ACM Multimedia 2013: 1085-1086 - [c173]Jürgen T. Geiger, Björn W. Schuller, Gerhard Rigoll:
Large-scale audio feature extraction and SVM for acoustic scene classification. WASPAA 2013: 1-4 - [c172]Florian Eyben, Felix Weninger, Erik Marchi, Björn W. Schuller:
Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation. WIAMIS 2013: 1-4 - [e4]Julien Epps, Fang Chen, Sharon L. Oviatt, Kenji Mase, Andrew Sears, Kristiina Jokinen, Björn W. Schuller:
2013 International Conference on Multimodal Interaction, ICMI '13, Sydney, NSW, Australia, December 9-13, 2013. ACM 2013, ISBN 978-1-4503-2129-7 [contents] - [e3]Björn W. Schuller, Michel F. Valstar, Roddy Cowie, Jarek Krajewski, Maja Pantic:
Proceedings of the 3rd ACM international workshop on Audio/visual emotion challenge, AVEC@ACM Multimedia 2013, Barcelona, Spain, October 21, 2013. ACM 2013, ISBN 978-1-4503-2395-6 [contents] - [i2]Lucas Paletta, Laurent Itti, Björn W. Schuller, Fang Fang:
6th International Symposium on Attention in Cognitive Systems 2013. CoRR abs/1307.6170 (2013) - [i1]Meinard Müller, Shrikanth S. Narayanan, Björn W. Schuller:
Computational Audio Analysis (Dagstuhl Seminar 13451). Dagstuhl Reports 3(11): 1-28 (2013) - 2012
- [j35]Stefano Squartini, Björn W. Schuller, Amir Hussain:
Cognitive and Emotional Information Processing for Human-Machine Interaction. Cogn. Comput. 4(4): 383-385 (2012) - [j34]Emanuele Principi, Rudy Rotili, Martin Wöllmer, Florian Eyben, Stefano Squartini, Björn W. Schuller:
Real-Time Activity Detection in a Multi-Talker Reverberated Environment. Cogn. Comput. 4(4): 386-397 (2012) - [j33]Julien Epps, Roddy Cowie, Shrikanth S. Narayanan, Björn W. Schuller, Jianhua Tao:
Emotion and mental state recognition from speech. EURASIP J. Adv. Signal Process. 2012: 15 (2012) - [j32]Jarek Krajewski, Sebastian Schnieder, David Sommer, Anton Batliner, Björn W. Schuller:
Applying multiple classifiers and non-linear dynamics features for detecting sleepiness from speech. Neurocomputing 84: 65-75 (2012) - [j31]Björn W. Schuller, Zixing Zhang, Felix Weninger, Felix Burkhardt:
Synthesized speech for model training in cross-corpus recognition of human emotion. Int. J. Speech Technol. 15(3): 313-323 (2012) - [j30]Björn W. Schuller:
The Computational Paralinguistics Challenge [Social Sciences]. IEEE Signal Process. Mag. 29(4): 97-101 (2012) - [j29]Björn W. Schuller, Ellen Douglas-Cowie, Anton Batliner:
Guest Editorial: Special Section on Naturalistic Affect Resources for System Building and Evaluation. IEEE Trans. Affect. Comput. 3(1): 3-4 (2012) - [j28]Marc Schröder, Elisabetta Bevacqua, Roddy Cowie, Florian Eyben, Hatice Gunes, Dirk Heylen, Mark ter Maat, Gary McKeown, Sathish Pammi, Maja Pantic, Catherine Pelachaud, Björn W. Schuller, Etienne de Sevin, Michel François Valstar, Martin Wöllmer:
Building Autonomous Sensitive Artificial Listeners. IEEE Trans. Affect. Comput. 3(2): 165-183 (2012) - [j27]Angeliki Metallinou, Martin Wöllmer, Athanasios Katsamanis, Florian Eyben, Björn W. Schuller, Shrikanth S. Narayanan:
Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification. IEEE Trans. Affect. Comput. 3(2): 184-198 (2012) - [j26]Felix Weninger, Jarek Krajewski, Anton Batliner, Björn W. Schuller:
The Voice of Leadership: Models and Performances of Automatic Analysis in Online Speeches. IEEE Trans. Affect. Comput. 3(4): 496-508 (2012) - [j25]Florian Eyben, Martin Wöllmer, Björn W. Schuller:
A multitask approach to continuous five-dimensional affect sensing in natural speech. ACM Trans. Interact. Intell. Syst. 2(1): 6:1-6:29 (2012) - [j24]Bart Baesens, Pantelis Bouboulis, Sergio Cruces, Carlotta Domeniconi, Shiro Ikeda, Xuelong Li, Patricia Melin, Vadrevu Sree Hari Rao, Björn W. Schuller, Yi Shen, Huajin Tang, Cong Wang, Jian Yang, Derong Zhao, Derong Liu:
Neural Networks and Learning Systems Come Together. IEEE Trans. Neural Networks Learn. Syst. 23(1): 1-6 (2012) - [j23]Felix Weninger, Björn W. Schuller:
Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit. J. Signal Process. Syst. 69(3): 267-277 (2012) - [c171]Jun Deng, Wenjing Han, Björn W. Schuller:
Confidence Measures for Speech Emotion Recognition: A Start. ITG Conference on Speech Communication 2012: 1-4 - [c170]Cyril Joder, Björn W. Schuller:
Exploring Nonnegative Matrix Factorization for Audio Classification: Application to Speaker Recognition. ITG Conference on Speech Communication 2012: 1-4 - [c169]Felix Weninger, Martin Wöllmer, Björn W. Schuller:
Sparse, Hierarchical and Semi-Supervised Base Learning for Monaural Enhancement of Conversational Speech. ITG Conference on Speech Communication 2012: 1-4 - [c168]Martin Wöllmer, Moritz Kaiser, Florian Eyben, Felix Weninger, Björn W. Schuller, Gerhard Rigoll:
Fully Automatic Audiovisual Emotion Recognition: Voice, Words, and the Face. ITG Conference on Speech Communication 2012: 1-4 - [c167]Zixing Zhang, Felix Weninger, Björn W. Schuller:
Towards Automatic Intoxication Detection from Speech in Real-Life Acoustic Environments. ITG Conference on Speech Communication 2012: 1-4 - [c166]Jürgen T. Geiger, Ravichander Vipperla, Nicholas W. D. Evans, Björn W. Schuller, Gerhard Rigoll:
Speech overlap detection using convolutive non-negative sparse coding: New improvements and insights. EUSIPCO 2012: 340-344 - [c165]Cyril Joder, Felix Weninger, Florian Eyben, David Virette, Björn W. Schuller:
Real-Time Speech Separation by Semi-supervised Nonnegative Matrix Factorization. LVA/ICA 2012: 322-329 - [c164]Felix Weninger, Jordi Feliu, Björn W. Schuller:
Supervised and semi-supervised suppression of background music in monaural speech recordings. ICASSP 2012: 61-64 - [c163]Felix Weninger, Noam Amir, Ofer Amir, Irit Ronen, Florian Eyben, Björn W. Schuller:
Robust feature extraction for automatic recognition of vibrato singing in recorded polyphonic music. ICASSP 2012: 85-88 - [c162]Zixing Zhang, Björn W. Schuller:
Semi-supervised learning helps in sound event classification. ICASSP 2012: 333-336 - [c161]Björn W. Schuller, Simone Hantke, Felix Weninger, Wenjing Han, Zixing Zhang, Shrikanth S. Narayanan:
Automatic recognition of emotion evoked by general sound events. ICASSP 2012: 341-344 - [c160]Martin Wöllmer, Angeliki Metallinou, Nassos Katsamanis, Björn W. Schuller, Shrikanth S. Narayanan:
Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions. ICASSP 2012: 4157-4160 - [c159]Ravichander Vipperla, Jürgen T. Geiger, Simon Bozonnet, Dong Wang, Nicholas W. D. Evans, Björn W. Schuller, Gerhard Rigoll:
Speech overlap detection and attribution using convolutive non-negative sparse coding. ICASSP 2012: 4181-4184 - [c158]Dmytro Prylipko, Björn W. Schuller, Andreas Wendemuth:
Fine-tuning HMMS for nonverbal vocalizations in spontaneous speech: A multicorpus perspective. ICASSP 2012: 4625-4628 - [c157]Felix Weninger, Martin Wöllmer, Jürgen T. Geiger, Björn W. Schuller, Jort F. Gemmeke, Antti Hurmalainen, Tuomas Virtanen, Gerhard Rigoll:
Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize? ICASSP 2012: 4681-4684 - [c156]Florian Eyben, Stavros Petridis, Björn W. Schuller, Maja Pantic:
Audiovisual vocal outburst classification in noisy acoustic conditions. ICASSP 2012: 5097-5100 - [c155]Björn W. Schuller, Michel François Valstar, Roddy Cowie, Maja Pantic:
AVEC 2012: the continuous audio/visual emotion challenge - an introduction. ICMI 2012: 361-362 - [c154]Björn W. Schuller, Michel F. Valstar, Florian Eyben, Roddy Cowie, Maja Pantic:
AVEC 2012: the continuous audio/visual emotion challenge. ICMI 2012: 449-456 - [c153]Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
Improving generalisation and robustness of acoustic affect recognition. ICMI 2012: 517-522 - [c152]Wenjing Han, Haifeng Li, Florian Eyben, Lin Ma, Jiayin Sun, Björn W. Schuller:
Preserving actual dynamic trend of emotion in dimensional speech emotion recognition. ICMI 2012: 523-528 - [c151]Felix Weninger, Björn W. Schuller:
Discrimination of Linguistic and Non-Linguistic Vocalizations in Spontaneous Speech: Intra- and Inter-Corpus Perspectives. INTERSPEECH 2012: 102-105 - [c150]Björn W. Schuller, Stefan Steidl, Anton Batliner, Elmar Nöth, Alessandro Vinciarelli, Felix Burkhardt, Rob van Son, Felix Weninger, Florian Eyben, Tobias Bocklet, Gelareh Mohammadi, Benjamin Weiss:
The INTERSPEECH 2012 Speaker Trait Challenge. INTERSPEECH 2012: 254-257 - [c149]Raymond Brueckner, Björn W. Schuller:
Likability Classification - A Not so Deep Neural Network Approach. INTERSPEECH 2012: 290-293 - [c148]Felix Weninger, Martin Wöllmer, Björn W. Schuller:
Combining Bottleneck-BLSTM and Semi-Supervised Sparse NMF for Recognition of Conversational Speech in Highly Instationary Noise. INTERSPEECH 2012: 302-305 - [c147]Fabien Ringeval, Mohamed Chetouani, Björn W. Schuller:
Novel Metrics of Speech Rhythm for the Assessment of Emotion. INTERSPEECH 2012: 346-349 - [c146]Martin Wöllmer, Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
Temporal and Situational Context Modeling for Improved Dominance Recognition in Meetings. INTERSPEECH 2012: 350-353 - [c145]Zixing Zhang, Björn W. Schuller:
Active Learning by Sparse Instance Tracking and Classifier Confidence in Acoustic Emotion Recognition. INTERSPEECH 2012: 362-365 - [c144]Felix Weninger, Erik Marchi, Björn W. Schuller:
Improving Recognition of Speaker States and Traits by Cumulative Evidence: Intoxication, Sleepiness, Age and Gender. INTERSPEECH 2012: 1159-1162 - [c143]Jürgen T. Geiger, Ravichander Vipperla, Simon Bozonnet, Nicholas W. D. Evans, Björn W. Schuller, Gerhard Rigoll:
Convolutive Non-Negative Sparse Coding and New Features for Speech Overlap Handling in Speaker Diarization. INTERSPEECH 2012: 2154-2157 - [c142]Jun Deng, Björn W. Schuller:
Confidence Measures in Speech Emotion Recognition Based on Semi-supervised Learning. INTERSPEECH 2012: 2226-2229 - [c141]Wenjing Han, Zixing Zhang, Jun Deng, Martin Wöllmer, Felix Weninger, Björn W. Schuller:
Towards distributed recognition of emotion from speech. ISCCSP 2012: 1-4 - [c140]Cyril Joder, Björn W. Schuller:
Score-Informed Leading Voice Separation from Monaural Audio. ISMIR 2012: 277-282 - [c139]Emanuele Principi, Rudy Rotili, Martin Wöllmer, Stefano Squartini, Björn W. Schuller:
Dominance Detection in a Reverberated Acoustic Scenario. ISNN (1) 2012: 394-402 - [c138]Florian Eyben, Felix Weninger, Nicolas H. Lehment, Gerhard Rigoll, Björn W. Schuller:
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature Sets. MediaEval 2012 - [c137]Hatice Gunes, Björn W. Schuller:
Dimensional and continuous analysis of emotions for multimedia applications: a tutorial overview. ACM Multimedia 2012: 1531-1532 - [c136]Erik Marchi, Anton Batliner, Björn W. Schuller, Shimrit Fridenzon, Shahar Tal, Ofer Golan:
Speech, Emotion, Age, Language, Task, and Typicality: Trying to Disentangle Performance and Feature Relevance. SocialCom/PASSAT 2012: 961-968 - [c135]Erik Marchi, Björn W. Schuller, Anton Batliner, Shimrit Fridenzon, Shahar Tal, Ofer Golan:
Emotion in the speech of children with autism spectrum conditions: prosody and everything else. WOCCI 2012: 17-24 - [p2]Felix Weninger, Björn W. Schuller, Cynthia C. S. Liem, Frank Kurth, Alan Hanjalic:
Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines. Multimodal Music Processing 2012: 195-216 - 2011
- [j22]Anton Batliner, Stefan Steidl, Björn W. Schuller, Dino Seppi, Thurid Vogt, Johannes Wagner, Laurence Devillers, Laurence Vidrascu, Vered Aharonson, Loïc Kessous, Noam Amir:
Whodunnit - Searching for the most important feature types signalling emotion-related user states in speech. Comput. Speech Lang. 25(1): 4-28 (2011) - [j21]Felix Weninger, Björn W. Schuller, Anton Batliner, Stefan Steidl, Dino Seppi:
Recognition of Nonprototypical Emotions in Reverberated and Noisy Speech by Nonnegative Matrix Factorization. EURASIP J. Adv. Signal Process. 2011 (2011) - [j20]Björn W. Schuller:
Affective speaker state analysis in the presence of reverberation. Int. J. Speech Technol. 14(2): 77-87 (2011) - [j19]Martin Wöllmer, Felix Weninger, Florian Eyben, Björn W. Schuller:
Computational Assessment of Interest in Speech - Facing the Real-Life Challenge. Künstliche Intell. 25(3): 225-234 (2011) - [j18]Björn W. Schuller, Anton Batliner, Stefan Steidl:
Introduction to the special issue on sensing emotion and affect - Facing realism in speech processing. Speech Commun. 53(9-10): 1059-1061 (2011) - [j17]Björn W. Schuller, Anton Batliner, Stefan Steidl, Dino Seppi:
Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge. Speech Commun. 53(9-10): 1062-1087 (2011) - [j16]Björn W. Schuller:
Recognizing Affect from Linguistic Information in 3D Continuous Space. IEEE Trans. Affect. Comput. 2(4): 192-205 (2011) - [j15]Martin Wöllmer, Christoph Blaschke, Thomas Schindl, Björn W. Schuller, Berthold Färber, Stefan Mayer, Benjamin Trefflich:
Online Driver Distraction Detection Using Long Short-Term Memory. IEEE Trans. Intell. Transp. Syst. 12(2): 574-582 (2011) - [j14]Martin Wöllmer, Björn W. Schuller, Anton Batliner, Stefan Steidl, Dino Seppi:
Tandem decoding of children's speech for keyword detection in a child-robot interaction scenario. ACM Trans. Speech Lang. Process. 7(4): 12:1-12:22 (2011) - [c134]Björn W. Schuller, Michel François Valstar, Roddy Cowie, Maja Pantic:
The First Audio/Visual Emotion Challenge and Workshop - An Introduction. ACII (2) 2011: 322 - [c133]Björn W. Schuller, Michel François Valstar, Florian Eyben, Gary McKeown, Roddy Cowie, Maja Pantic:
AVEC 2011-The First International Audio/Visual Emotion Challenge. ACII (2) 2011: 415-424 - [c132]Martin Wöllmer, Björn W. Schuller, Gerhard Rigoll:
A novel bottleneck-BLSTM front-end for feature-level context modeling in conversational speech recognition. ASRU 2011: 36-41 - [c131]Zixing Zhang, Felix Weninger, Martin Wöllmer, Björn W. Schuller:
Unsupervised learning in cross-corpus acoustic emotion recognition. ASRU 2011: 523-528 - [c130]Björn W. Schuller, Felix Weninger:
Ten Recent Trends in Computational Paralinguistics. COST 2102 Training School 2011: 35-49 - [c129]Rudy Rotili, Emanuele Principi, Martin Wöllmer, Stefano Squartini, Björn W. Schuller:
Conversational Speech Recognition in Non-stationary Reverberated Environments. COST 2102 Training School 2011: 50-59 - [c128]Florian Eyben, Martin Wöllmer, Michel François Valstar, Hatice Gunes, Björn W. Schuller, Maja Pantic:
String-based audiovisual fusion of behavioural events for the assessment of dimensional affect. FG 2011: 322-329 - [c127]Marc Schröder, Sathish Pammi, Hatice Gunes, Maja Pantic, Michel François Valstar, Roddy Cowie, Gary McKeown, Dirk Heylen, Mark ter Maat, Florian Eyben, Björn W. Schuller, Martin Wöllmer, Elisabetta Bevacqua, Catherine Pelachaud, Etienne de Sevin:
Come and have an emotional workout with sensitive artificial listeners! FG 2011: 646 - [c126]Hatice Gunes, Björn W. Schuller, Maja Pantic, Roddy Cowie:
Emotion representation, analysis and synthesis in continuous space: A survey. FG 2011: 827-834 - [c125]Felix Weninger, Björn W. Schuller:
Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations. ICASSP 2011: 337-340 - [c124]Felix Weninger, Alexander Lehmann, Björn W. Schuller:
OpenBliSSART: Design and evaluation of a research toolkit for Blind Source Separation in Audio Recognition Tasks. ICASSP 2011: 1625-1628 - [c123]Felix Weninger, Jean-Louis Durrieu, Florian Eyben, Gaël Richard, Björn W. Schuller:
Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition. ICASSP 2011: 2196-2199 - [c122]Martin Wöllmer, Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
A multi-stream ASR framework for BLSTM modeling of conversational speech. ICASSP 2011: 4860-4863 - [c121]Christian Landsiedel, Jens Edlund, Florian Eyben, Daniel Neiberg, Björn W. Schuller:
Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks. ICASSP 2011: 5256-5259 - [c120]André Stuhlsatz, Christine Meyer, Florian Eyben, Thomas Zielke, Hans-Günter Meier, Björn W. Schuller:
Deep neural networks for acoustic emotion recognition: Raising the benchmarks. ICASSP 2011: 5688-5691 - [c119]Felix Weninger, Björn W. Schuller, Martin Wöllmer, Gerhard Rigoll:
Localization of non-linguistic events in spontaneous speech by Non-Negative Matrix Factorization and Long Short-Term Memory. ICASSP 2011: 5840-5843 - [c118]Florian Eyben, Stavros Petridis, Björn W. Schuller, George Tzimiropoulos, Stefanos Zafeiriou, Maja Pantic:
Audiovisual classification of vocal outbursts in human conversation using Long-Short-Term Memory networks. ICASSP 2011: 5844-5847 - [c117]Rudy Rotili, Emanuele Principi, Stefano Squartini, Björn W. Schuller:
Real-Time Speech Recognition in a Multi-talker Reverberated Acoustic Scenario. ICIC (2) 2011: 379-386 - [c116]Martin Wöllmer, Felix Weninger, Florian Eyben, Björn W. Schuller:
Acoustic-Linguistic Recognition of Interest in Speech with Bottleneck-BLSTM Nets. INTERSPEECH 2011: 77-80 - [c115]Jürgen T. Geiger, Mohamed Anouar Lakhal, Björn W. Schuller, Gerhard Rigoll:
Learning New Acoustic Events in an HMM-Based System Using MAP Adaptation. INTERSPEECH 2011: 293-296 - [c114]Martin Wöllmer, Björn W. Schuller, Gerhard Rigoll:
Feature Frame Stacking in RNN-Based Tandem ASR Systems - Learned vs. Predefined Context. INTERSPEECH 2011: 1233-1236 - [c113]Björn W. Schuller, Zixing Zhang, Felix Weninger, Gerhard Rigoll:
Using Multiple Databases for Training in Emotion Recognition: To Unite or to Vote? INTERSPEECH 2011: 1553-1556 - [c112]Felix Burkhardt, Björn W. Schuller, Benjamin Weiss, Felix Weninger:
"Would You Buy a Car from Me?" - On the Likability of Telephone Voices. INTERSPEECH 2011: 1557-1560 - [c111]Martin Wöllmer, Felix Weninger, Stefan Steidl, Anton Batliner, Björn W. Schuller:
Speech-Based Non-Prototypical Affect Recognition for Child-Robot Interaction in Reverberated Environments. INTERSPEECH 2011: 3113-3104 - [c110]Björn W. Schuller, Stefan Steidl, Anton Batliner, Florian Schiel, Jarek Krajewski:
The INTERSPEECH 2011 Speaker State Challenge. INTERSPEECH 2011: 3201-3204 - [c109]Elisabetta Bevacqua, Florian Eyben, Dirk Heylen, Mark ter Maat, Sathish Pammi, Catherine Pelachaud, Marc Schröder, Björn W. Schuller, Etienne de Sevin, Martin Wöllmer:
Interacting with Emotional Virtual Agents. INTETAIN 2011: 243-245 - [c108]Felix Weninger, Martin Wöllmer, Björn W. Schuller:
Automatic Assessment of Singer Traits in Popular Music: Gender, Age, Height and Race. ISMIR 2011: 37-42 - [c107]Björn W. Schuller, Felix Weninger, Johannes Dorfner:
Multi-Modal Non-Prototypical Music Mood Analysis in Continuous Space: Reliability and Performances. ISMIR 2011: 759-764 - [c106]Martin Wöllmer, Erik Marchi, Stefano Squartini, Björn W. Schuller:
Robust Multi-stream Keyword and Non-linguistic Vocalization Detection for Computationally Intelligent Virtual Agents. ISNN (2) 2011: 496-505 - [c105]Martin Wöllmer, Björn W. Schuller:
Enhancing Spontaneous Speech Recognition with BLSTM Features. NOLISP 2011: 17-24 - [c104]Rudy Rotili, Emanuele Principi, Stefano Squartini, Björn W. Schuller:
A Real-Time Speech Enhancement Framework for Multi-party Meetings. NOLISP 2011: 80-87 - [c103]Björn W. Schuller, Martin Wöllmer, Florian Eyben, Gerhard Rigoll, Dejan Arsic:
Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits. Semantic Audio 2011 - [p1]Björn W. Schuller:
Voice and Speech Analysis in Search of States and Traits. Computer Analysis of Human Behavior 2011: 227-253 - [e2]Sidney K. D'Mello, Arthur C. Graesser, Björn W. Schuller, Jean-Claude Martin:
Affective Computing and Intelligent Interaction - 4th International Conference, ACII 2011, Memphis, TN, USA, October 9-12, 2011, Proceedings, Part I. Lecture Notes in Computer Science 6974, Springer 2011, ISBN 978-3-642-24599-2 [contents] - [e1]Sidney K. D'Mello, Arthur C. Graesser, Björn W. Schuller, Jean-Claude Martin:
Affective Computing and Intelligent Interaction - Fourth International Conference, ACII 2011, Memphis, TN, USA, October 9-12, 2011, Proceedings, Part II. Lecture Notes in Computer Science 6975, Springer 2011, ISBN 978-3-642-24570-1 [contents] - 2010
- [j13]Florian Eyben, Martin Wöllmer, Tony Poitschke, Björn W. Schuller, Christoph Blaschke, Berthold Färber, Nhu Nguyen-Thien:
Emotion on the Road - Necessity, Acceptance, and Feasibility of Affective Computing in the Car. Adv. Hum. Comput. Interact. 2010: 263593:1-263593:17 (2010) - [j12]Anton Batliner, Dino Seppi, Stefan Steidl, Björn W. Schuller:
Segmenting into Adequate Units for Automatic Recognition of Emotion-Related Episodes: A Speech-Based Approach. Adv. Hum. Comput. Interact. 2010: 782802:1-782802:15 (2010) - [j11]Martin Wöllmer, Florian Eyben, Alex Graves, Björn W. Schuller, Gerhard Rigoll:
Bidirectional LSTM Networks for Context-Sensitive Keyword Detection in a Cognitive Virtual Agent Framework. Cogn. Comput. 2(3): 180-190 (2010) - [j10]Björn W. Schuller, Johannes Dorfner, Gerhard Rigoll:
Determination of Nonprototypical Valence and Arousal in Popular Music: Features and Performances. EURASIP J. Audio Speech Music. Process. 2010 (2010) - [j9]Stefan Steidl, Anton Batliner, Dino Seppi, Björn W. Schuller:
On the Impact of Children's Emotional Speech on Acoustic and Language Models. EURASIP J. Audio Speech Music. Process. 2010 (2010) - [j8]Florian Eyben, Martin Wöllmer, Alex Graves, Björn W. Schuller, Ellen Douglas-Cowie, Roddy Cowie:
On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues. J. Multimodal User Interfaces 3(1-2): 7-19 (2010) - [j7]Martin Wöllmer, Björn W. Schuller, Florian Eyben, Gerhard Rigoll:
Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening. IEEE J. Sel. Top. Signal Process. 4(5): 867-881 (2010) - [j6]Björn W. Schuller, Bogdan Vlasenko, Florian Eyben, Martin Wöllmer, André Stuhlsatz, Andreas Wendemuth, Gerhard Rigoll:
Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies. IEEE Trans. Affect. Comput. 1(2): 119-131 (2010) - [c102]Martin Wöllmer, Nikolaj Klebert, Björn W. Schuller:
Switching Linear Dynamic Models for Recognition of Emotionally Colored and Noisy Speech. Sprachkommunikation 2010: 1-4 - [c101]Dejan Arsic, Björn W. Schuller:
Real Time Person Tracking and Behavior Interpretation in Multi Camera Scenarios Applying Homography and Coupled HMMs. COST 2102 Conference 2010: 1-18 - [c100]Björn W. Schuller, Tobias Knaup:
Learning and Knowledge-Based Sentiment Analysis in Movie Review Key Excerpts. COST 2102 Training School 2010: 448-472 - [c99]Björn W. Schuller, Felix Weninger, Martin Wöllmer, Yang Sun, Gerhard Rigoll:
Non-negative matrix factorization as noise-robust feature extractor for speech recognition. ICASSP 2010: 4562-4565 - [c98]Björn W. Schuller, Felix Weninger:
Discrimination of speech and non-linguistic vocalizations by Non-Negative Matrix Factorization. ICASSP 2010: 5054-5057 - [c97]Björn W. Schuller, Felix Burkhardt:
Learning with synthesized speech for automatic emotion recognition. ICASSP 2010: 5150-5153 - [c96]Björn W. Schuller, Florian Metze, Stefan Steidl, Anton Batliner, Florian Eyben, Tim Polzehl:
Late fusion of individual engines for improved recognition of negative emotion in speech - learning vs. democratic vote. ICASSP 2010: 5230-5233 - [c95]Martin Wöllmer, Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder. ICASSP 2010: 5274-5277 - [c94]Florian Metze, Anton Batliner, Florian Eyben, Tim Polzehl, Björn W. Schuller, Stefan Steidl:
Emotion recognition using imperfect speech recognition. INTERSPEECH 2010: 478-481 - [c93]Björn W. Schuller, Laurence Devillers:
Incremental acoustic valence recognition: an inter-corpus perspective on features, matching, and performance in a gating paradigm. INTERSPEECH 2010: 801-804 - [c92]Martin Wöllmer, Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
Recognition of spontaneous conversational speech using long short-term memory phoneme predictions. INTERSPEECH 2010: 1946-1949 - [c91]Martin Wöllmer, Angeliki Metallinou, Florian Eyben, Björn W. Schuller, Shrikanth S. Narayanan:
Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling. INTERSPEECH 2010: 2362-2365 - [c90]Björn W. Schuller, Stefan Steidl, Anton Batliner, Felix Burkhardt, Laurence Devillers, Christian A. Müller, Shrikanth S. Narayanan:
The INTERSPEECH 2010 paralinguistic challenge. INTERSPEECH 2010: 2794-2797 - [c89]Martin Wöllmer, Yang Sun, Florian Eyben, Björn W. Schuller:
Long short-term memory networks for noise robust speech recognition. INTERSPEECH 2010: 2966-2969 - [c88]Florian Eyben, Sebastian Böck, Björn W. Schuller, Alex Graves:
Universal Onset Detection with Bidirectional Long Short-Term Memory Neural Networks. ISMIR 2010: 589-594 - [c87]Björn W. Schuller, Christoph Kozielski, Felix Weninger, Florian Eyben, Gerhard Rigoll:
Vocalist Gender Recognition in Recorded Popular Music. ISMIR 2010: 613-618 - [c86]Björn W. Schuller, Riccardo Zaccarelli, Nicolas Rollet, Laurence Devillers:
CINEMO - A French Spoken Language Resource for Complex Emotions: Facts and Baselines. LREC 2010 - [c85]Dejan Arsic, Luis Roalter, Martin Wöllmer, Florian Eyben, Björn W. Schuller, Moritz Kaiser, Matthias Kranz, Gerhard Rigoll:
3d gesture recognition applying long short-term memory and contextual knowledge in a CAVE. MPVA@MM 2010: 33-36 - [c84]Florian Eyben, Martin Wöllmer, Björn W. Schuller:
Opensmile: the munich versatile and fast open-source audio feature extractor. ACM Multimedia 2010: 1459-1462
2000 – 2009
- 2009
- [j5]Björn W. Schuller, Martin Wöllmer, Tobias Moosmayr, Gerhard Rigoll:
Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement. EURASIP J. Audio Speech Music. Process. 2009 (2009) - [j4]Martin Wöllmer, Marc A. Al-Hames, Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams. Neurocomputing 73(1-3): 366-380 (2009) - [j3]Björn W. Schuller, Ronald Müller, Florian Eyben, Jürgen Gast, Benedikt Hörnler, Martin Wöllmer, Gerhard Rigoll, Anja Höthker, Hitoshi Konosu:
Being bored? Recognising natural interest by extensive audiovisual integration for real-life application. Image Vis. Comput. 27(12): 1760-1774 (2009) - [c83]Florian Eyben, Martin Wöllmer, Björn W. Schuller:
OpenEAR - Introducing the munich open-source emotion and affect recognition toolkit. ACII 2009: 1-6 - [c82]Marc Schröder, Elisabetta Bevacqua, Florian Eyben, Hatice Gunes, Dirk Heylen, Mark ter Maat, Sathish Pammi, Maja Pantic, Catherine Pelachaud, Björn W. Schuller, Etienne de Sevin, Michel F. Valstar, Martin Wöllmer:
A demonstration of audiovisual sensitive artificial listeners. ACII 2009: 1-2 - [c81]Stefan Steidl, Anton Batliner, Björn W. Schuller, Dino Seppi:
The hinterland of emotions: Facing the open-microphone challenge. ACII 2009: 1-8 - [c80]Martin Wöllmer, Florian Eyben, Björn W. Schuller, Gerhard Rigoll:
Robust vocabulary independent keyword spotting with graphical models. ASRU 2009: 349-353 - [c79]Florian Eyben, Martin Wöllmer, Björn W. Schuller, Alex Graves:
From speech to letters - using a novel neural network architecture for grapheme based ASR. ASRU 2009: 376-380 - [c78]Björn W. Schuller, Bogdan Vlasenko, Florian Eyben, Gerhard Rigoll, Andreas Wendemuth:
Acoustic emotion recognition: A benchmark comparison of performances. ASRU 2009: 552-557 - [c77]Martin Wöllmer, Florian Eyben, Joseph Keshet, Alex Graves, Björn W. Schuller, Gerhard Rigoll:
Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks. ICASSP 2009: 3949-3952 - [c76]Björn W. Schuller, Anton Batliner, Stefan Steidl, Dino Seppi:
Emotion recognition from speech: Putting ASR in the loop. ICASSP 2009: 4585-4588 - [c75]Björn W. Schuller, Joachim Schenk, Gerhard Rigoll, Tobias Knaup:
"The Godfather" vs. "Chaos": Comparing Linguistic Analysis Based on On-line Knowledge Sources and Bags-of-N-Grams for Movie Review Valence Estimation. ICDAR 2009: 858-862 - [c74]Joachim Schenk, Benedikt Hörnler, Björn W. Schuller, Artur Braun, Gerhard Rigoll:
GMs in On-Line Handwritten Whiteboard Note Recognition: The Influence of Implementation and Modeling. ICDAR 2009: 877-880 - [c73]Dejan Arsic, Benedikt Hörnler, Björn W. Schuller, Gerhard Rigoll:
A hierarchical approach for visual suspicious behavior detection in aircrafts. DPS 2009: 1-7 - [c72]Dejan Arsic, Björn W. Schuller, Benedikt Hörnler, Gerhard Rigoll:
Resolving partial occlusions in crowded environments utilizing range data and video cameras. DPS 2009: 1-6 - [c71]Benedikt Hörnler, Dejan Arsic, Björn W. Schuller, Gerhard Rigoll:
Graphical models for multi-modal automatic video editing in meetings. DPS 2009: 1-8 - [c70]Björn W. Schuller, Benedikt Hörnler, Dejan Arsic, Gerhard Rigoll:
Audio chord labeling by musiological modeling and beat-synchronization. ICME 2009: 526-529 - [c69]Björn W. Schuller, Salman Can, Hubertus Feußner, Martin Wöllmer, Dejan Arsic, Benedikt Hörnler:
Speech control in surgery: A field analysis and strategies. ICME 2009: 1214-1217 - [c68]Benedikt Hörnler, Dejan Arsic, Björn W. Schuller, Gerhard Rigoll:
Boosting multi-modal camera selection with semantic features. ICME 2009: 1298-1301 - [c67]Björn W. Schuller, Stefan Steidl, Anton Batliner:
The INTERSPEECH 2009 emotion challenge. INTERSPEECH 2009: 312-315 - [c66]Martin Wöllmer, Florian Eyben, Björn W. Schuller, Ellen Douglas-Cowie, Roddy Cowie:
Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks. INTERSPEECH 2009: 1595-1598 - [c65]Björn W. Schuller, Gerhard Rigoll:
Recognising interest in conversational speech - comparing bag of frames and supra-segmental features. INTERSPEECH 2009: 1999-2002 - [c64]Martin Wöllmer, Florian Eyben, Björn W. Schuller, Yang Sun, Tobias Moosmayr, Nhu Nguyen-Thien:
Robust in-car spelling recognition - a tandem BLSTM-HMM approach. INTERSPEECH 2009: 2507-2510 - [c63]Martin Wöllmer, Florian Eyben, Alex Graves, Björn W. Schuller, Gerhard Rigoll:
Improving Keyword Spotting with a Tandem BLSTM-DBN Architecture. NOLISP 2009: 68-75 - [c62]Dejan Arsic, Atanas Lyutskanov, Moritz Kaiser, Björn W. Schuller, Gerhard Rigoll:
Applying Bayes Markov chains for the detection of ATM related scenarios. WACV 2009: 1-8 - 2008
- [j2]Björn W. Schuller, Florian Eyben, Gerhard Rigoll:
Tango or Waltz?: Putting Ballroom Dance Style into Tempo Detection. EURASIP J. Audio Speech Music. Process. 2008 (2008) - [c61]Björn W. Schuller, Florian Dibiasi, Florian Eyben, Gerhard Rigoll:
Music Thumbnailing Incorporating Harmony- and Rhythm Structure. Adaptive Multimedia Retrieval 2008: 78-88 - [c60]Björn W. Schuller, Martin Wöllmer, Tobias Moosmayr, Günther Ruske, Gerhard Rigoll:
Switching Linear Dynamic Models for Noise Robust In-Car Speech Recognition. DAGM-Symposium 2008: 244-253 - [c59]Anton Batliner, Björn W. Schuller, Sonja Schaeffler, Stefan Steidl:
Mothers, adults, children, pets - towards the acoustics of intimacy. ICASSP 2008: 4497-4500 - [c58]Björn W. Schuller, Matthias Wimmer, Lorenz Mösenlechner, Christian Kern, Dejan Arsic, Gerhard Rigoll:
Brute-forcing hierarchical functionals for paralinguistics: A waste of feature space? ICASSP 2008: 4501-4504 - [c57]Dejan Arsic, Encho Hristov, Nicolas H. Lehment, Benedikt Hörnler, Björn W. Schuller, Gerhard Rigoll:
Applying multi layer homography for multi camera person tracking. ICDSC 2008: 1-9 - [c56]Björn W. Schuller, Bogdan Vlasenko, Dejan Arsic, Gerhard Rigoll, Andreas Wendemuth:
Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition. ICME 2008: 1333-1336 - [c55]Björn W. Schuller, Matthias Wimmer, Dejan Arsic, Tobias Moosmayr, Gerhard Rigoll:
Detection of security related affect and behaviour in passenger transport. INTERSPEECH 2008: 265-268 - [c54]Martin Wöllmer, Florian Eyben, Stephan Reiter, Björn W. Schuller, Cate Cox, Ellen Douglas-Cowie, Roddy Cowie:
Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies. INTERSPEECH 2008: 597-600 - [c53]Dino Seppi, Anton Batliner, Björn W. Schuller, Stefan Steidl, Thurid Vogt, Johannes Wagner, Laurence Devillers, Laurence Vidrascu, Noam Amir, Vered Aharonson:
Patterns, prototypes, performance: classifying emotional user states. INTERSPEECH 2008: 601-604 - [c52]Bogdan Vlasenko, Björn W. Schuller, Kinfe Tadesse Mengistu, Gerhard Rigoll, Andreas Wendemuth:
Balancing spoken content adaptation and unit length in the recognition of emotion and interest. INTERSPEECH 2008: 805-808 - [c51]Björn W. Schuller, Martin Wöllmer, Tobias Moosmayr, Gerhard Rigoll:
Speech recognition in noisy environments using a switching linear dynamic model for feature enhancement. INTERSPEECH 2008: 1789-1792 - [c50]Björn W. Schuller, Xiaohua Zhang, Gerhard Rigoll:
Prosodic and spectral features within segment-based acoustic modeling. INTERSPEECH 2008: 2370-2373 - [c49]Björn W. Schuller, Florian Eyben, Gerhard Rigoll:
Static and Dynamic Modelling for the Recognition of Non-verbal Vocalisations in Conversational Speech. PIT 2008: 99-110 - [c48]Bogdan Vlasenko, Björn W. Schuller, Andreas Wendemuth, Gerhard Rigoll:
On the Influence of Phonetic Content Variation for Acoustic Emotion Recognition. PIT 2008: 217-220 - [c47]Björn W. Schuller, Gerhard Rigoll, Salman Can, Hubertus Feußner:
Emotion sensitive speech control for human-robot interaction in minimal invasive surgery. RO-MAN 2008: 453-458 - [c46]Matthias Wimmer, Björn W. Schuller, Dejan Arsic, Gerhard Rigoll, Bernd Radig:
Low-Level Fusion of Audio, Video Feature for Multi-Modal Emotion Recognition. VISAPP (2) 2008: 145-151 - [c45]Björn W. Schuller, Anton Batliner, Stefan Steidl, Dino Seppi:
Does affect affect automatic recognition of children2s speech? WOCCI 2008: 14 - [c44]Dino Seppi, Matteo Gerosa, Björn W. Schuller, Anton Batliner, Stefan Steidl:
Detecting problems in spoken child-computer interaction. WOCCI 2008: 15 - 2007
- [b2]Björn W. Schuller:
Mensch, Maschine, Emotion: Erkennung aus sprachlicher und manueller Interaktion. Technical University Munich, VDM 2007, ISBN 978-3-8364-1522-4, pp. 1-239 - [c43]Michael Grimm, Kristian Kroschel, Helen Harris, Clifford Nass, Björn W. Schuller, Gerhard Rigoll, Tobias Moosmayr:
On the Necessity and Feasibility of Detecting a Driver's Emotional State While Driving. ACII 2007: 126-138 - [c42]Bogdan Vlasenko, Björn W. Schuller, Andreas Wendemuth, Gerhard Rigoll:
Frame vs. Turn-Level: Emotion Recognition from Speech Considering Static and Dynamic Processing. ACII 2007: 139-147 - [c41]Marc Schröder, Laurence Devillers, Kostas Karpouzis, Jean-Claude Martin, Catherine Pelachaud, Christian Peter, Hannes Pirker, Björn W. Schuller, Jianhua Tao, Ian Wilson:
What Should a Generic Emotion Markup Language Be Able to Represent? ACII 2007: 440-451 - [c40]Björn W. Schuller, Bogdan Vlasenko, Ricardo Minguez, Gerhard Rigoll, Andreas Wendemuth:
Comparing one and two-stage acoustic modeling in the recognition of emotion in speech. ASRU 2007: 596-600 - [c39]Björn W. Schuller, Florian Eyben, Gerhard Rigoll:
Fast and Robust Meter and Tempo Recognition for the Automatic Discrimination of Ballroom Dance Styles. ICASSP (1) 2007: 217-220 - [c38]Björn W. Schuller, Dejan Arsic, Gerhard Rigoll, Matthias Wimmer, Bernd Radig:
Audiovisual Behavior Modeling by Combined Feature Spaces. ICASSP (2) 2007: 733-736 - [c37]Björn W. Schuller, Dino Seppi, Anton Batliner, Andreas K. Maier, Stefan Steidl:
Towards More Reality in the Recognition of Emotional Speech. ICASSP (4) 2007: 941-944 - [c36]Florian Eyben, Björn W. Schuller, Stephan Reiter, Gerhard Rigoll:
Wearable Assistance for the Ballroom-Dance Hobbyist - Holistic Rhythm Analysis and Dance-Style Classification. ICME 2007: 92-95 - [c35]Stephan Reiter, Björn W. Schuller, Gerhard Rigoll:
Hidden Conditional Random Fields for Meeting Segmentation. ICME 2007: 639-642 - [c34]Dejan Arsic, Björn W. Schuller, Gerhard Rigoll:
Suspicious Behavior Detection in Public Transport by Fusion of Low-Level Video Descriptors. ICME 2007: 2018-2021 - [c33]Björn W. Schuller, Ronald Müller, Benedikt Hörnler, Anja Höthker, Hitoshi Konosu, Gerhard Rigoll:
Audiovisual recognition of spontaneous interest within conversations. ICMI 2007: 30-37 - [c32]Bogdan Vlasenko, Björn W. Schuller, Andreas Wendemuth, Gerhard Rigoll:
Combining frame and turn-level information for robust recognition of emotions within speech. INTERSPEECH 2007: 2249-2252 - [c31]Björn W. Schuller, Anton Batliner, Dino Seppi, Stefan Steidl, Thurid Vogt, Johannes Wagner, Laurence Devillers, Laurence Vidrascu, Noam Amir, Loïc Kessous, Vered Aharonson:
The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functionals. INTERSPEECH 2007: 2253-2256 - 2006
- [c30]Stephan Reiter, Björn W. Schuller, Gerhard Rigoll:
A Combined LSTM-RNN - HMM - Approach for Meeting Event Segmentation and Recognition. ICASSP (2) 2006: 393-396 - [c29]Dejan Arsic, Joachim Schenk, Björn W. Schuller, Frank Wallhoff, Gerhard Rigoll:
Submotions for Hidden Markov Model Based Dynamic Facial Action Recognition. ICIP 2006: 673-676 - [c28]Björn W. Schuller, Stephan Reiter, Gerhard Rigoll:
Evolutionary Feature Generation in Speech Emotion Recognition. ICME 2006: 5-8 - [c27]Marc A. Al-Hames, Stefan Zettl, Frank Wallhoff, Stephan Reiter, Björn W. Schuller, Gerhard Rigoll:
A Two-Layer Graphical Model for Combined Video Shot and Scene Boundary Detection. ICME 2006: 261-264 - [c26]Frank Wallhoff, Björn W. Schuller, Michael Hawellek, Gerhard Rigoll:
Efficient Recognition of Authentic Dynamic Facial Expressions on the Feedtum Database. ICME 2006: 493-496 - [c25]Stephan Reiter, Björn W. Schuller, Gerhard Rigoll:
Segmentation and Recognition of Meeting Events using a Two-Layered HMM and a Combined MLP-HMM Approach. ICME 2006: 953-956 - [c24]Björn W. Schuller, Frank Wallhoff, Dejan Arsic, Gerhard Rigoll:
Musical Signal Type Discrimination based on Large Open Feature Sets. ICME 2006: 1089-1092 - [c23]Björn W. Schuller, Niels Köhler, Ronald Müller, Gerhard Rigoll:
Recognition of interest in human conversational speech. INTERSPEECH 2006 - [c22]Björn W. Schuller, Gerhard Rigoll:
Timing levels in segment-based speech emotion recognition. INTERSPEECH 2006 - 2005
- [b1]Björn W. Schuller:
Automatische Emotionserkennung aus sprachlicher und manueller Interaktion. Technical University Munich, Germany, 2005, pp. 1-232 - [c21]Björn W. Schuller, Raquel Jiménez Villar, Gerhard Rigoll, Manfred K. Lang:
Meta-Classifiers in Acoustic and Linguistic Feature Fusion-Based Affect Recognition. ICASSP (1) 2005: 325-328 - [c20]Dejan Arsic, Frank Wallhoff, Björn W. Schuller, Gerhard Rigoll:
Video based online behavior detection using probabilistic multi stream fusion. ICIP (2) 2005: 606-609 - [c19]Björn W. Schuller, Brüning J. B. Schmitt, Dejan Arsic, Stephan Reiter, Manfred K. Lang, Gerhard Rigoll:
Feature Selection and Stacking for Robust Discrimination of Speech, Monophonic Singing, and Polyphonic Music. ICME 2005: 840-843 - [c18]Björn W. Schuller, Stephan Reiter, Ronald Müller, Marc A. Al-Hames, Manfred K. Lang, Gerhard Rigoll:
Speaker Independent Speech Emotion Recognition by Ensemble Classification. ICME 2005: 864-867 - [c17]Dejan Arsic, Frank Wallhoff, Björn W. Schuller, Gerhard Rigoll:
Video Based Online Behavior Detection Using Probabilistic Multi Stream Fusion. ICME 2005: 1354-1357 - [c16]Björn W. Schuller, Ronald Müller, Manfred K. Lang, Gerhard Rigoll:
Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles. INTERSPEECH 2005: 805-808 - 2004
- [c15]Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture. ICASSP (1) 2004: 577-580 - [c14]Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
Multimodal music retrieval for large databases. ICME 2004: 755-758 - [c13]Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
Emotion recognition in the manual interaction with graphical user interfaces. ICME 2004: 1215-1218 - [c12]Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
Discrimination of speech and monophonic singing in continuous audio streams applying multi-layer support vector machines. ICME 2004: 1655-1658 - [c11]Björn W. Schuller, Ronald Müller, Gerhard Rigoll, Manfred K. Lang:
Applying Bayesian belief networks in approximate string matching for robust keyword-based retrieval. ICME 2004: 1999-2002 - 2003
- [c10]Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
Hidden Markov model-based speech emotion recognition. ICASSP (2) 2003: 1-4 - [c9]Björn W. Schuller, Martin Zobl, Gerhard Rigoll, Manfred K. Lang:
A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge. ICME 2003: 57-60 - [c8]Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
Hidden Markov model-based speech emotion recognition. ICME 2003: 401-404 - [c7]Martin Zobl, Michael Geiger, Björn W. Schuller, Manfred K. Lang, Gerhard Rigoll:
A real-time system for hand gesture controlled operation of in-car devices. ICME 2003: 541-544 - [c6]Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
HMM-based music retrieval using stereophonic feature information and framelength adaptation. ICME 2003: 713-716 - 2002
- [j1]Ralf Nieschulz, Björn W. Schuller, Michael Geiger, Robert Neuss:
Aspekte effizienten Usability Engineerings (Aspects of Efficient Usability Engineering). Informationstechnik Tech. Inform. 44(1): 23-30 (2002) - [c5]Gregor McGlaun, Frank Althoff, Björn W. Schuller, Manfred K. Lang:
A new technique for adjusting distraction moments in multitasking non-field usability tests. CHI Extended Abstracts 2002: 666-667 - [c4]Frank Althoff, Karla Geiss, Gregor McGlaun, Björn W. Schuller, Manfred K. Lang:
Experimental evaluation of user errors at the skill-based level in an automative environment. CHI Extended Abstracts 2002: 782-783 - [c3]Björn W. Schuller, Manfred K. Lang, Gerhard Rigoll:
Multimodal emotion recognition in audiovisual communication. ICME (1) 2002: 745-748 - [c2]Björn W. Schuller:
Towards intuitive speech interaction by the integration of emotional aspects. SMC 2002: 6 - 2001
- [c1]Frank Althoff, Gregor McGlaun, Björn W. Schuller, Peter Morguet, Manfred K. Lang:
Using multimodal interaction to navigate in arbitrary virtual VRML worlds. PUI 2001: 10:1-10:8
Coauthor Index
aka: Alice E. Baird
aka: Alan S. Cowen
aka: Laurence Y. Devillers
aka: Eva-Maria Meßner
aka: Ognjen (Oggi) Rudovic
aka: Dagmar M. Schuller
aka: Michel François Valstar
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-27 22:23 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint