Alert button
Picture for Chao-Han Huck Yang

Chao-Han Huck Yang

Alert button

HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models

Add code
Bookmark button
Alert button
Oct 16, 2023
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Macro Siniscalchi, Pin-Yu Chen, Eng Siong Chng

Figure 1 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Figure 2 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Figure 3 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Figure 4 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Viaarxiv icon

Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting

Add code
Bookmark button
Alert button
Oct 10, 2023
Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke

Figure 1 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Figure 2 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Figure 3 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Figure 4 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Viaarxiv icon

Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition

Add code
Bookmark button
Alert button
Oct 10, 2023
Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Rohit Kumar, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegner

Figure 1 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Figure 2 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Figure 3 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Figure 4 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Viaarxiv icon

Generative Speech Recognition Error Correction with Large Language Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke

Figure 1 for Generative Speech Recognition Error Correction with Large Language Models
Figure 2 for Generative Speech Recognition Error Correction with Large Language Models
Figure 3 for Generative Speech Recognition Error Correction with Large Language Models
Figure 4 for Generative Speech Recognition Error Correction with Large Language Models
Viaarxiv icon

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Add code
Bookmark button
Alert button
Sep 26, 2023
Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth G. Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastow, Ivan Bulyko

Figure 1 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 2 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 3 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 4 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Viaarxiv icon

Can Whisper perform speech-based in-context learning

Add code
Bookmark button
Alert button
Sep 13, 2023
Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang

Figure 1 for Can Whisper perform speech-based in-context learning
Figure 2 for Can Whisper perform speech-based in-context learning
Figure 3 for Can Whisper perform speech-based in-context learning
Figure 4 for Can Whisper perform speech-based in-context learning
Viaarxiv icon

Causal Video Summarizer for Video Exploration

Add code
Bookmark button
Alert button
Jul 04, 2023
Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Andrew Brown, Marcel Worring

Figure 1 for Causal Video Summarizer for Video Exploration
Figure 2 for Causal Video Summarizer for Video Exploration
Figure 3 for Causal Video Summarizer for Video Exploration
Figure 4 for Causal Video Summarizer for Video Exploration
Viaarxiv icon

How to Estimate Model Transferability of Pre-Trained Speech Models?

Add code
Bookmark button
Alert button
Jun 01, 2023
Zih-Ching Chen, Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Shou-Yiin Chang, Rohit Prabhavalkar, Hung-yi Lee, Tara N. Sainath

Figure 1 for How to Estimate Model Transferability of Pre-Trained Speech Models?
Figure 2 for How to Estimate Model Transferability of Pre-Trained Speech Models?
Figure 3 for How to Estimate Model Transferability of Pre-Trained Speech Models?
Figure 4 for How to Estimate Model Transferability of Pre-Trained Speech Models?
Viaarxiv icon

A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models

Add code
Bookmark button
Alert button
Jun 01, 2023
Pin-Jui Ku, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Chin-Hui Lee

Figure 1 for A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
Figure 2 for A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
Figure 3 for A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
Figure 4 for A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
Viaarxiv icon