Alert button
Picture for Yangyang Shi

Yangyang Shi

Alert button

Enhance audio generation controllability through representation similarity regularization

Add code
Bookmark button
Alert button
Sep 15, 2023
Yangyang Shi, Gael Le Lan, Varun Nagaraja, Zhaoheng Ni, Xinhao Mei, Ernie Chang, Forrest Iandola, Yang Liu, Vikas Chandra

Figure 1 for Enhance audio generation controllability through representation similarity regularization
Figure 2 for Enhance audio generation controllability through representation similarity regularization
Figure 3 for Enhance audio generation controllability through representation similarity regularization
Figure 4 for Enhance audio generation controllability through representation similarity regularization
Viaarxiv icon

DISGO: Automatic End-to-End Evaluation for Scene Text OCR

Add code
Bookmark button
Alert button
Aug 25, 2023
Mei-Yuh Hwang, Yangyang Shi, Ankit Ramchandani, Guan Pang, Praveen Krishnan, Lucas Kabela, Frank Seide, Samyak Datta, Jun Liu

Figure 1 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 2 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 3 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 4 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Viaarxiv icon

Revisiting Sample Size Determination in Natural Language Understanding

Add code
Bookmark button
Alert button
Jul 01, 2023
Ernie Chang, Muhammad Hassan Rashid, Pin-Jie Lin, Changsheng Zhao, Vera Demberg, Yangyang Shi, Vikas Chandra

Figure 1 for Revisiting Sample Size Determination in Natural Language Understanding
Figure 2 for Revisiting Sample Size Determination in Natural Language Understanding
Figure 3 for Revisiting Sample Size Determination in Natural Language Understanding
Figure 4 for Revisiting Sample Size Determination in Natural Language Understanding
Viaarxiv icon

Binary and Ternary Natural Language Generation

Add code
Bookmark button
Alert button
Jun 02, 2023
Zechun Liu, Barlas Oguz, Aasish Pappu, Yangyang Shi, Raghuraman Krishnamoorthi

Figure 1 for Binary and Ternary Natural Language Generation
Figure 2 for Binary and Ternary Natural Language Generation
Figure 3 for Binary and Ternary Natural Language Generation
Figure 4 for Binary and Ternary Natural Language Generation
Viaarxiv icon

LLM-QAT: Data-Free Quantization Aware Training for Large Language Models

Add code
Bookmark button
Alert button
May 29, 2023
Zechun Liu, Barlas Oguz, Changsheng Zhao, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, Vikas Chandra

Figure 1 for LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Figure 2 for LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Figure 3 for LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Figure 4 for LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Viaarxiv icon

Multi-Head State Space Model for Speech Recognition

Add code
Bookmark button
Alert button
May 25, 2023
Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales

Figure 1 for Multi-Head State Space Model for Speech Recognition
Figure 2 for Multi-Head State Space Model for Speech Recognition
Figure 3 for Multi-Head State Space Model for Speech Recognition
Figure 4 for Multi-Head State Space Model for Speech Recognition
Viaarxiv icon

Improving Fast-slow Encoder based Transducer with Streaming Deliberation

Add code
Bookmark button
Alert button
Dec 15, 2022
Ke Li, Jay Mahadeokar, Jinxi Guo, Yangyang Shi, Gil Keren, Ozlem Kalinli, Michael L. Seltzer, Duc Le

Figure 1 for Improving Fast-slow Encoder based Transducer with Streaming Deliberation
Figure 2 for Improving Fast-slow Encoder based Transducer with Streaming Deliberation
Figure 3 for Improving Fast-slow Encoder based Transducer with Streaming Deliberation
Figure 4 for Improving Fast-slow Encoder based Transducer with Streaming Deliberation
Viaarxiv icon

LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting

Add code
Bookmark button
Alert button
Nov 09, 2022
Haichuan Yang, Zhaojun Yang, Li Wan, Biqiao Zhang, Yangyang Shi, Yiteng Huang, Ivaylo Enchev, Limin Tang, Raziel Alvarez, Ming Sun, Xin Lei, Raghuraman Krishnamoorthi, Vikas Chandra

Figure 1 for LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting
Figure 2 for LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting
Figure 3 for LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting
Figure 4 for LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting
Viaarxiv icon

Biased Self-supervised learning for ASR

Add code
Bookmark button
Alert button
Nov 04, 2022
Florian L. Kreyssig, Yangyang Shi, Jinxi Guo, Leda Sari, Abdelrahman Mohamed, Philip C. Woodland

Figure 1 for Biased Self-supervised learning for ASR
Figure 2 for Biased Self-supervised learning for ASR
Figure 3 for Biased Self-supervised learning for ASR
Viaarxiv icon

SCA: Streaming Cross-attention Alignment for Echo Cancellation

Add code
Bookmark button
Alert button
Nov 01, 2022
Yang Liu, Yangyang Shi, Yun Li, Kaustubh Kalgaonkar, Sriram Srinivasan, Xin Lei

Figure 1 for SCA: Streaming Cross-attention Alignment for Echo Cancellation
Figure 2 for SCA: Streaming Cross-attention Alignment for Echo Cancellation
Figure 3 for SCA: Streaming Cross-attention Alignment for Echo Cancellation
Figure 4 for SCA: Streaming Cross-attention Alignment for Echo Cancellation
Viaarxiv icon