Alert button

"speech": models, code, and papers
Alert button

CPPF: A contextual and post-processing-free model for automatic speech recognition

Add code
Bookmark button
Alert button
Sep 14, 2023
Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan

Figure 1 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 2 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 3 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Viaarxiv icon

Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model

Jul 29, 2023
S. Rijal, R. Neupane, S. P. Mainali, S. K. Regmi, S. Maharjan

Figure 1 for Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Figure 2 for Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Figure 3 for Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Figure 4 for Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
Viaarxiv icon

Prosody Analysis of Audiobooks

Add code
Bookmark button
Alert button
Oct 10, 2023
Charuta Pethe, Yunting Yin, Steven Skiena

Viaarxiv icon

Task-Agnostic Low-Rank Adapters for Unseen English Dialects

Add code
Bookmark button
Alert button
Nov 02, 2023
Zedian Xiao, William Held, Yanchen Liu, Diyi Yang

Viaarxiv icon

Identifying Context-Dependent Translations for Evaluation Set Production

Add code
Bookmark button
Alert button
Nov 04, 2023
Rachel Wicks, Matt Post

Viaarxiv icon

On the Opportunities of Green Computing: A Survey

Add code
Bookmark button
Alert button
Nov 06, 2023
You Zhou, Xiujing Lin, Xiang Zhang, Maolin Wang, Gangwei Jiang, Huakang Lu, Yupeng Wu, Kai Zhang, Zhe Yang, Kehang Wang, Yongduo Sui, Fengwei Jia, Zuoli Tang, Yao Zhao, Hongxuan Zhang, Tiannuo Yang, Weibo Chen, Yunong Mao, Yi Li, De Bao, Yu Li, Hongrui Liao, Ting Liu, Jingwen Liu, Jinchi Guo, Jin Zhao, Xiangyu Zhao, Ying WEI, Hong Qian, Qi Liu, Xiang Wang, Wai Kin, Chan, Chenliang Li, Yusen Li, Shiyu Yang, Jining Yan, Chao Mou, Shuai Han, Wuxia Jin, Guannan Zhang, Xiaodong Zeng

Figure 1 for On the Opportunities of Green Computing: A Survey
Figure 2 for On the Opportunities of Green Computing: A Survey
Figure 3 for On the Opportunities of Green Computing: A Survey
Figure 4 for On the Opportunities of Green Computing: A Survey
Viaarxiv icon

Securing Voice Biometrics: One-Shot Learning Approach for Audio Deepfake Detection

Oct 05, 2023
Awais Khan, Khalid Mahmood Malik

Figure 1 for Securing Voice Biometrics: One-Shot Learning Approach for Audio Deepfake Detection
Figure 2 for Securing Voice Biometrics: One-Shot Learning Approach for Audio Deepfake Detection
Figure 3 for Securing Voice Biometrics: One-Shot Learning Approach for Audio Deepfake Detection
Figure 4 for Securing Voice Biometrics: One-Shot Learning Approach for Audio Deepfake Detection
Viaarxiv icon

MM-VID: Advancing Video Understanding with GPT-4V(ision)

Add code
Bookmark button
Alert button
Oct 30, 2023
Kevin Lin, Faisal Ahmed, Linjie Li, Chung-Ching Lin, Ehsan Azarnasab, Zhengyuan Yang, Jianfeng Wang, Lin Liang, Zicheng Liu, Yumao Lu, Ce Liu, Lijuan Wang

Figure 1 for MM-VID: Advancing Video Understanding with GPT-4V(ision)
Figure 2 for MM-VID: Advancing Video Understanding with GPT-4V(ision)
Figure 3 for MM-VID: Advancing Video Understanding with GPT-4V(ision)
Figure 4 for MM-VID: Advancing Video Understanding with GPT-4V(ision)
Viaarxiv icon

Challenges and Insights: Exploring 3D Spatial Features and Complex Networks on the MISP Dataset

Oct 05, 2023
Yiwen Shao

Figure 1 for Challenges and Insights: Exploring 3D Spatial Features and Complex Networks on the MISP Dataset
Figure 2 for Challenges and Insights: Exploring 3D Spatial Features and Complex Networks on the MISP Dataset
Figure 3 for Challenges and Insights: Exploring 3D Spatial Features and Complex Networks on the MISP Dataset
Figure 4 for Challenges and Insights: Exploring 3D Spatial Features and Complex Networks on the MISP Dataset
Viaarxiv icon

Speech-Driven 3D Face Animation with Composite and Regional Facial Movements

Add code
Bookmark button
Alert button
Aug 10, 2023
Haozhe Wu, Songtao Zhou, Jia Jia, Junliang Xing, Qi Wen, Xiang Wen

Figure 1 for Speech-Driven 3D Face Animation with Composite and Regional Facial Movements
Figure 2 for Speech-Driven 3D Face Animation with Composite and Regional Facial Movements
Figure 3 for Speech-Driven 3D Face Animation with Composite and Regional Facial Movements
Figure 4 for Speech-Driven 3D Face Animation with Composite and Regional Facial Movements
Viaarxiv icon