Picture for Jinyu Li

Jinyu Li

Beijing Institute of Technology, China

Is Text All You Need? Text as a Universal Information Bottleneck for Speech LLMs

Add code
Jun 08, 2026
Viaarxiv icon

LLM can Read Spectrogram: Encoder-free Speech-Language Modeling

Add code
Jun 08, 2026
Viaarxiv icon

Reasoning Matters: Mitigate Hallucination in Multimodal Large Reasoning Models via Reasoning-Conditioned Preference Optimization

Add code
May 27, 2026
Viaarxiv icon

UNIQUE: Universal Top-k Sparse Attention for Training-free Inference and Sparsity-aware Training

Add code
May 26, 2026
Viaarxiv icon

Speech LLMs are Contextual Reasoning Transcribers

Add code
Apr 01, 2026
Viaarxiv icon

Prior Knowledge-enhanced Spatio-temporal Epidemic Forecasting

Add code
Feb 25, 2026
Viaarxiv icon

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

Add code
Feb 15, 2026
Viaarxiv icon

Seeing Through the Chain: Mitigate Hallucination in Multimodal Reasoning Models via CoT Compression and Contrastive Preference Optimization

Add code
Feb 03, 2026
Viaarxiv icon

RLBR: Reinforcement Learning with Biasing Rewards for Contextual Speech Large Language Models

Add code
Jan 19, 2026
Viaarxiv icon

Smooth Operator: Smooth Verifiable Reward Activates Spatial Reasoning Ability of Vision-Language Model

Add code
Jan 12, 2026
Viaarxiv icon