Picture for Kai Yu

Kai Yu

Sherman

Joint decoding method for controllable contextual speech recognition based on Speech LLM

Add code
Aug 12, 2025
Viaarxiv icon

ChemDFM-R: An Chemical Reasoner LLM Enhanced with Atomized Chemical Knowledge

Add code
Jul 30, 2025
Viaarxiv icon

Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning

Add code
Jul 23, 2025
Viaarxiv icon

Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning

Add code
Jun 12, 2025
Viaarxiv icon

Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning

Add code
Jun 06, 2025
Viaarxiv icon

Masked Self-distilled Transducer-based Keyword Spotting with Semi-autoregressive Decoding

Add code
May 30, 2025
Viaarxiv icon

Towards General Discrete Speech Codec for Complex Acoustic Environments: A Study of Reconstruction and Downstream Task Consistency

Add code
May 28, 2025
Viaarxiv icon

HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer

Add code
May 28, 2025
Viaarxiv icon

NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering

Add code
May 26, 2025
Viaarxiv icon

Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling

Add code
May 26, 2025
Viaarxiv icon