Picture for Yang Xiang

Yang Xiang

Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations

Add code
Jul 16, 2025
Figure 1 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Figure 2 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Figure 3 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Figure 4 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Viaarxiv icon

KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model

Add code
Jun 26, 2025
Figure 1 for KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model
Figure 2 for KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model
Figure 3 for KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model
Figure 4 for KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model
Viaarxiv icon

XBOUND: Exploring the Capability Boundaries of Device-Control Agents through Trajectory Tree Exploration

Add code
May 27, 2025
Viaarxiv icon

Evaluating and Steering Modality Preferences in Multimodal Large Language Model

Add code
May 27, 2025
Viaarxiv icon

Enhancing Generalization of Speech Large Language Models with Multi-Task Behavior Imitation and Speech-Text Interleaving

Add code
May 24, 2025
Viaarxiv icon

A Semantic Information-based Hierarchical Speech Enhancement Method Using Factorized Codec and Diffusion Model

Add code
May 20, 2025
Viaarxiv icon

ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation

Add code
Mar 10, 2025
Figure 1 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Figure 2 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Figure 3 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Figure 4 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Viaarxiv icon

ASurvey: Spatiotemporal Consistency in Video Generation

Add code
Feb 25, 2025
Viaarxiv icon

Exploiting Epistemic Uncertainty in Cold-Start Recommendation Systems

Add code
Feb 22, 2025
Figure 1 for Exploiting Epistemic Uncertainty in Cold-Start Recommendation Systems
Figure 2 for Exploiting Epistemic Uncertainty in Cold-Start Recommendation Systems
Figure 3 for Exploiting Epistemic Uncertainty in Cold-Start Recommendation Systems
Viaarxiv icon

Learning-based A Posteriori Speech Presence Probability Estimation and Applications

Add code
Jan 23, 2025
Figure 1 for Learning-based A Posteriori Speech Presence Probability Estimation and Applications
Figure 2 for Learning-based A Posteriori Speech Presence Probability Estimation and Applications
Figure 3 for Learning-based A Posteriori Speech Presence Probability Estimation and Applications
Figure 4 for Learning-based A Posteriori Speech Presence Probability Estimation and Applications
Viaarxiv icon