Picture for Yifan Ding

Yifan Ding

SimulTron: On-Device Simultaneous Speech to Speech Translation

Add code
Jun 04, 2024
Figure 1 for SimulTron: On-Device Simultaneous Speech to Speech Translation
Figure 2 for SimulTron: On-Device Simultaneous Speech to Speech Translation
Figure 3 for SimulTron: On-Device Simultaneous Speech to Speech Translation
Figure 4 for SimulTron: On-Device Simultaneous Speech to Speech Translation
Viaarxiv icon

Span-Oriented Information Extraction -- A Unifying Perspective on Information Extraction

Add code
Mar 18, 2024
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

ChatEL: Entity Linking with Chatbots

Add code
Feb 20, 2024
Viaarxiv icon

EntGPT: Linking Generative Large Language Models with Knowledge Bases

Add code
Feb 09, 2024
Viaarxiv icon

Multi-modal Domain Adaptation for REG via Relation Transfer

Add code
Sep 23, 2023
Figure 1 for Multi-modal Domain Adaptation for REG via Relation Transfer
Figure 2 for Multi-modal Domain Adaptation for REG via Relation Transfer
Figure 3 for Multi-modal Domain Adaptation for REG via Relation Transfer
Figure 4 for Multi-modal Domain Adaptation for REG via Relation Transfer
Viaarxiv icon

Translatotron 3: Speech to Speech Translation with Monolingual Data

Add code
Jun 01, 2023
Figure 1 for Translatotron 3: Speech to Speech Translation with Monolingual Data
Figure 2 for Translatotron 3: Speech to Speech Translation with Monolingual Data
Figure 3 for Translatotron 3: Speech to Speech Translation with Monolingual Data
Figure 4 for Translatotron 3: Speech to Speech Translation with Monolingual Data
Viaarxiv icon

LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus

Add code
May 30, 2023
Figure 1 for LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
Figure 2 for LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
Figure 3 for LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
Figure 4 for LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
Viaarxiv icon

Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations

Add code
Mar 03, 2023
Figure 1 for Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
Figure 2 for Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
Figure 3 for Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
Figure 4 for Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
Viaarxiv icon

Residual Adapters for Few-Shot Text-to-Speech Speaker Adaptation

Add code
Oct 28, 2022
Figure 1 for Residual Adapters for Few-Shot Text-to-Speech Speaker Adaptation
Figure 2 for Residual Adapters for Few-Shot Text-to-Speech Speaker Adaptation
Figure 3 for Residual Adapters for Few-Shot Text-to-Speech Speaker Adaptation
Figure 4 for Residual Adapters for Few-Shot Text-to-Speech Speaker Adaptation
Viaarxiv icon