Picture for Hung-yi Lee

Hung-yi Lee

LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play

Add code
May 10, 2024
Figure 1 for LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play
Figure 2 for LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play
Figure 3 for LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play
Figure 4 for LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play
Viaarxiv icon

A Large-Scale Evaluation of Speech Foundation Models

Add code
Apr 15, 2024
Viaarxiv icon

Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations

Add code
Feb 23, 2024
Figure 1 for Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations
Figure 2 for Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations
Figure 3 for Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations
Figure 4 for Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations
Viaarxiv icon

Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations

Add code
Feb 20, 2024
Viaarxiv icon

Towards audio language modeling -- an overview

Add code
Feb 20, 2024
Viaarxiv icon

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models

Add code
Feb 20, 2024
Figure 1 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 2 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 3 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 4 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Viaarxiv icon

SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data

Add code
Feb 10, 2024
Viaarxiv icon

Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model

Add code
Feb 08, 2024
Viaarxiv icon

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR

Add code
Feb 06, 2024
Viaarxiv icon

SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering

Add code
Jan 24, 2024
Figure 1 for SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Figure 2 for SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Figure 3 for SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Viaarxiv icon