2022 |
Schuller, Björn; Batliner, Anton; Amiriparian, Shahin; Bergler, Christian; Gerczuk, Maurice; Holz, Natalie; Larrouy-Maestri, Pauline; Bayerl, Sebastien; Riedhammer, Korbinian; Mallol-Ragolta, Adria; Pateraki, Maria; Coppock, Harry; Kiskin, Ivan; Sinka, Marianne; Roberts, Stephen The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes Proceedings Article In: Proceedings of the 30th ACM International Conference on Multimedia, pp. 7120–7124, Association for Computing Machinery, Lisboa, Portugal, 2022, ISBN: 9781450392037. Abstract | Links | BibTeX | Tags: Benchmark, Challenge, Computational Paralinguistics, Human Activity Recognition, Mosquito Detection, Stuttering, Vocalisations @inproceedings{Schuller2022, The ACM Multimedia 2022 Computational Paralinguistics Challenge addresses four different problems for the first time in a research competition under well-defined conditions: In the Vocalisations and Stuttering Sub-Challenges, a classification on human non-verbal vocalisations and speech has to be made; the Activity Sub-Challenge aims at beyond-audio human activity recognition from smartwatch sensor data; and in the Mosquitoes Sub-Challenge, mosquitoes need to be detected. We describe the Sub-Challenges, baseline feature extraction, and classifiers based on the 'usual' ComParE and BoAW features, the auDeep toolkit, and deep feature extraction from pre-trained CNNs using the DeepSpectrum toolkit; in addition, we add end-to-end sequential modelling, and a log-mel-128-BNN. |
Mallol-Ragolta, Adria; Semertzidou, Anastasia; Pateraki, Maria; Schuller, Björn Outer Product-Based Fusion of Smartwatch Sensor Data for Human Activity Recognition Journal Article In: Frontiers in Computer Science, vol. 4, 2022, ISSN: 2624-9898. Abstract | Links | BibTeX | Tags: artificial intelligence, Human Activity Recognition, multimodal fusion, smartwatch sensor data, ubiquitous computing @article{Mallol2022, The advent of IoT devices in combination with Human Activity Recognition (HAR) technologies can contribute to battle with sedentariness by continuously monitoring the users' daily activities. With this information, autonomous systems could detect users' physical weaknesses and plan personalized training routines to improve them. This work investigates the multimodal fusion of smartwatch sensor data for HAR. Specifically, we exploit pedometer, heart rate, and accelerometer information to train unimodal and multimodal models for the task at hand. The models are trained end-to-end, and we compare the performance of dedicated Recurrent Neural Network-based (RNN) and Convolutional Neural Network-based (CNN) architectures to extract deep learnt representations from the input modalities. To fuse the embedded representations when training the multimodal models, we investigate a concatenation-based and an outer product-based approach. This work explores the harAGE dataset, a new dataset for HAR collected using a Garmin Vivoactive 3 device with more than 17 h of data. Our best models obtain an Unweighted Average Recall (UAR) of 95.6, 69.5, and 60.8% when tackling the task as a 2-class, 7-class, and 10-class classification problem, respectively. These performances are obtained using multimodal models that fuse the embedded representations extracted with dedicated CNN-based architectures from the pedometer, heart rate, and accelerometer modalities. The concatenation-based fusion scores the highest UAR in the 2-class classification problem, while the outer product-based fusion obtains the best performances in the 7-class and the 10-class classification problems. |
2022 |
The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes Proceedings Article In: Proceedings of the 30th ACM International Conference on Multimedia, pp. 7120–7124, Association for Computing Machinery, Lisboa, Portugal, 2022, ISBN: 9781450392037. |
Outer Product-Based Fusion of Smartwatch Sensor Data for Human Activity Recognition Journal Article In: Frontiers in Computer Science, vol. 4, 2022, ISSN: 2624-9898. |