Picture for Xianyu Zhao

Xianyu Zhao

Phonemes vs. Projectors: An Investigation of Speech-Language Interfaces for LLM-based ASR

Add code
Apr 10, 2026
Viaarxiv icon

Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition

Add code
Mar 31, 2026
Viaarxiv icon

Improving End-to-End Training of Retrieval-Augmented Generation Models via Joint Stochastic Approximation

Add code
Aug 25, 2025
Figure 1 for Improving End-to-End Training of Retrieval-Augmented Generation Models via Joint Stochastic Approximation
Figure 2 for Improving End-to-End Training of Retrieval-Augmented Generation Models via Joint Stochastic Approximation
Figure 3 for Improving End-to-End Training of Retrieval-Augmented Generation Models via Joint Stochastic Approximation
Figure 4 for Improving End-to-End Training of Retrieval-Augmented Generation Models via Joint Stochastic Approximation
Viaarxiv icon

An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought

Add code
Jul 22, 2024
Figure 1 for An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought
Figure 2 for An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought
Figure 3 for An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought
Figure 4 for An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought
Viaarxiv icon