Deep learning

ASVspoof 5: Crowdsourced Speech Data, Deepfakes, and Adversarial Attacks at Scale

August 16, 2024
Natural language processing Deep learning Cross-modal adaptation Sentence embeddings Semantic matching Probing classifiers

Analysis of Joint Speech-Text Embeddings for Semantic Matching

January 9, 2022
Natural language processing Deep learning Cross-modal adaptation Sentence embeddings Semantic matching Probing classifiers

Polyphonic Sound Event Detection: Phonetic Features and Environmental Sounds

December 10, 2021
Deep learning Speech processing Multi-label classification Speech articulatory attributes Acoustic event detection Maximal figure-of-merit Non-decomposable objective functions

DigiMo - Towards Developing an Emotional Intelligent Chatbot in Singapore

April 25, 2020
Natural language interaction Deep learning Data annotation Emotion Chatbot Expert evaluation

The I2R’s Submission To VOiCES Distance Speaker Recognition Challenge 2019

September 15, 2019
Audio tagging Multi-label classification Equal error rate EER Convolutional neural networks Convolutional recurrent neural networks Maximal figure-of-merit MFoM Deep learning t-SNE