Alert button
Picture for Huaming Wang

Huaming Wang

Alert button

PAM: Prompting Audio-Language Models for Audio Quality Assessment

Add code
Bookmark button
Alert button
Feb 01, 2024
Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde, Hannes Gamper, Mahmoud Al Ismail, Rita Singh, Bhiksha Raj, Huaming Wang

Viaarxiv icon

NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription

Add code
Bookmark button
Alert button
Jan 16, 2024
Alon Vinnikov, Amir Ivry, Aviv Hurvitz, Igor Abramovski, Sharon Koubi, Ilya Gurvich, Shai Pe`er, Xiong Xiao, Benjamin Martinez Elizalde, Naoyuki Kanda, Xiaofei Wang, Shalev Shaer, Stav Yagev, Yossi Asher, Sunit Sivasankaran, Yifan Gong, Min Tang, Huaming Wang, Eyal Krupka

Viaarxiv icon

Prompting Audios Using Acoustic Properties For Emotion Representation

Add code
Bookmark button
Alert button
Oct 05, 2023
Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh

Figure 1 for Prompting Audios Using Acoustic Properties For Emotion Representation
Figure 2 for Prompting Audios Using Acoustic Properties For Emotion Representation
Figure 3 for Prompting Audios Using Acoustic Properties For Emotion Representation
Figure 4 for Prompting Audios Using Acoustic Properties For Emotion Representation
Viaarxiv icon

Training Audio Captioning Models without Audio

Add code
Bookmark button
Alert button
Sep 14, 2023
Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang

Figure 1 for Training Audio Captioning Models without Audio
Figure 2 for Training Audio Captioning Models without Audio
Figure 3 for Training Audio Captioning Models without Audio
Figure 4 for Training Audio Captioning Models without Audio
Viaarxiv icon

Natural Language Supervision for General-Purpose Audio Representations

Add code
Bookmark button
Alert button
Sep 11, 2023
Benjamin Elizalde, Soham Deshmukh, Huaming Wang

Viaarxiv icon

Pengi: An Audio Language Model for Audio Tasks

Add code
Bookmark button
Alert button
May 19, 2023
Soham Deshmukh, Benjamin Elizalde, Rita Singh, Huaming Wang

Figure 1 for Pengi: An Audio Language Model for Audio Tasks
Figure 2 for Pengi: An Audio Language Model for Audio Tasks
Figure 3 for Pengi: An Audio Language Model for Audio Tasks
Figure 4 for Pengi: An Audio Language Model for Audio Tasks
Viaarxiv icon

Real-Time Audio-Visual End-to-End Speech Enhancement

Add code
Bookmark button
Alert button
Mar 13, 2023
Zirun Zhu, Hemin Yang, Min Tang, Ziyi Yang, Sefik Emre Eskimez, Huaming Wang

Figure 1 for Real-Time Audio-Visual End-to-End Speech Enhancement
Figure 2 for Real-Time Audio-Visual End-to-End Speech Enhancement
Figure 3 for Real-Time Audio-Visual End-to-End Speech Enhancement
Viaarxiv icon

Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling

Add code
Bookmark button
Alert button
Mar 07, 2023
Ziqiang Zhang, Long Zhou, Chengyi Wang, Sanyuan Chen, Yu Wu, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei

Figure 1 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Figure 2 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Figure 3 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Figure 4 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Viaarxiv icon

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

Add code
Bookmark button
Alert button
Jan 05, 2023
Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei

Figure 1 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Figure 2 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Figure 3 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Figure 4 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Viaarxiv icon

Learning to mask: Towards generalized face forgery detection

Add code
Bookmark button
Alert button
Dec 29, 2022
Jianwei Fei, Yunshu Dai, Huaming Wang, Zhihua Xia

Figure 1 for Learning to mask: Towards generalized face forgery detection
Figure 2 for Learning to mask: Towards generalized face forgery detection
Figure 3 for Learning to mask: Towards generalized face forgery detection
Figure 4 for Learning to mask: Towards generalized face forgery detection
Viaarxiv icon