Alert button

"speech": models, code, and papers
Alert button

Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models

Add code
Bookmark button
Alert button
Feb 20, 2024
Shengpeng Ji, Minghui Fang, Ziyue Jiang, Rongjie Huang, Jialung Zuo, Shulei Wang, Zhou Zhao

Viaarxiv icon

Graph Expansion in Pruned Recurrent Neural Network Layers Preserve Performance

Mar 17, 2024
Suryam Arnav Kalra, Arindam Biswas, Pabitra Mitra, Biswajit Basu

Figure 1 for Graph Expansion in Pruned Recurrent Neural Network Layers Preserve Performance
Figure 2 for Graph Expansion in Pruned Recurrent Neural Network Layers Preserve Performance
Figure 3 for Graph Expansion in Pruned Recurrent Neural Network Layers Preserve Performance
Figure 4 for Graph Expansion in Pruned Recurrent Neural Network Layers Preserve Performance
Viaarxiv icon

Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models

Add code
Bookmark button
Alert button
Mar 18, 2024
Linus Nwankwo, Elmar Rueckert

Figure 1 for Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models
Figure 2 for Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models
Viaarxiv icon

ScanTalk: 3D Talking Heads from Unregistered Scans

Mar 19, 2024
Federico Nocentini, Thomas Besnier, Claudio Ferrari, Sylvain Arguillere, Stefano Berretti, Mohamed Daoudi

Figure 1 for ScanTalk: 3D Talking Heads from Unregistered Scans
Figure 2 for ScanTalk: 3D Talking Heads from Unregistered Scans
Figure 3 for ScanTalk: 3D Talking Heads from Unregistered Scans
Figure 4 for ScanTalk: 3D Talking Heads from Unregistered Scans
Viaarxiv icon

HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models

Add code
Bookmark button
Alert button
Mar 18, 2024
Huy Nghiem, Hal Daumé III

Figure 1 for HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models
Figure 2 for HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models
Figure 3 for HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models
Figure 4 for HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models
Viaarxiv icon

Don't Go To Extremes: Revealing the Excessive Sensitivity and Calibration Limitations of LLMs in Implicit Hate Speech Detection

Feb 26, 2024
Min Zhang, Jianfeng He, Taoran Ji, Chang-Tien Lu

Viaarxiv icon

Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations

Feb 10, 2024
Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain

Viaarxiv icon

The Effect of Batch Size on Contrastive Self-Supervised Speech Representation Learning

Add code
Bookmark button
Alert button
Feb 21, 2024
Nik Vaessen, David A. van Leeuwen

Viaarxiv icon

Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey

Mar 02, 2024
Hamza Kheddar, Mustapha Hemis, Yassine Himeur

Figure 1 for Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
Figure 2 for Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
Figure 3 for Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
Figure 4 for Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
Viaarxiv icon

Beyond Hate Speech: NLP's Challenges and Opportunities in Uncovering Dehumanizing Language

Feb 21, 2024
Hezhao Zhang, Lasana Harris, Nafise Sadat Moosavi

Viaarxiv icon