Picture for Tom Ko

Tom Ko

Learning Retrieval Augmentation for Personalized Dialogue Generation

Add code
Jun 27, 2024
Figure 1 for Learning Retrieval Augmentation for Personalized Dialogue Generation
Figure 2 for Learning Retrieval Augmentation for Personalized Dialogue Generation
Figure 3 for Learning Retrieval Augmentation for Personalized Dialogue Generation
Figure 4 for Learning Retrieval Augmentation for Personalized Dialogue Generation
Viaarxiv icon

Selective Prompting Tuning for Personalized Conversations with LLMs

Add code
Jun 26, 2024
Figure 1 for Selective Prompting Tuning for Personalized Conversations with LLMs
Figure 2 for Selective Prompting Tuning for Personalized Conversations with LLMs
Figure 3 for Selective Prompting Tuning for Personalized Conversations with LLMs
Figure 4 for Selective Prompting Tuning for Personalized Conversations with LLMs
Viaarxiv icon

Speech Translation with Large Language Models: An Industrial Practice

Add code
Dec 21, 2023
Viaarxiv icon

RepCodec: A Speech Representation Codec for Speech Tokenization

Add code
Aug 31, 2023
Figure 1 for RepCodec: A Speech Representation Codec for Speech Tokenization
Figure 2 for RepCodec: A Speech Representation Codec for Speech Tokenization
Figure 3 for RepCodec: A Speech Representation Codec for Speech Tokenization
Figure 4 for RepCodec: A Speech Representation Codec for Speech Tokenization
Viaarxiv icon

Recent Advances in Direct Speech-to-text Translation

Add code
Jun 20, 2023
Figure 1 for Recent Advances in Direct Speech-to-text Translation
Figure 2 for Recent Advances in Direct Speech-to-text Translation
Viaarxiv icon

MOSPC: MOS Prediction Based on Pairwise Comparison

Add code
Jun 18, 2023
Figure 1 for MOSPC: MOS Prediction Based on Pairwise Comparison
Figure 2 for MOSPC: MOS Prediction Based on Pairwise Comparison
Figure 3 for MOSPC: MOS Prediction Based on Pairwise Comparison
Figure 4 for MOSPC: MOS Prediction Based on Pairwise Comparison
Viaarxiv icon

PolyVoice: Language Models for Speech to Speech Translation

Add code
Jun 13, 2023
Figure 1 for PolyVoice: Language Models for Speech to Speech Translation
Figure 2 for PolyVoice: Language Models for Speech to Speech Translation
Figure 3 for PolyVoice: Language Models for Speech to Speech Translation
Figure 4 for PolyVoice: Language Models for Speech to Speech Translation
Viaarxiv icon

CTC-based Non-autoregressive Speech Translation

Add code
May 27, 2023
Figure 1 for CTC-based Non-autoregressive Speech Translation
Figure 2 for CTC-based Non-autoregressive Speech Translation
Figure 3 for CTC-based Non-autoregressive Speech Translation
Figure 4 for CTC-based Non-autoregressive Speech Translation
Viaarxiv icon

DUB: Discrete Unit Back-translation for Speech Translation

Add code
May 19, 2023
Figure 1 for DUB: Discrete Unit Back-translation for Speech Translation
Figure 2 for DUB: Discrete Unit Back-translation for Speech Translation
Figure 3 for DUB: Discrete Unit Back-translation for Speech Translation
Figure 4 for DUB: Discrete Unit Back-translation for Speech Translation
Viaarxiv icon

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

Add code
Mar 30, 2023
Figure 1 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 2 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 3 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 4 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Viaarxiv icon