Picture for Jingbo Zhu

Jingbo Zhu

Dissecting Long Reasoning Models: An Empirical Study

Add code
Jun 05, 2025
Viaarxiv icon

Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation

Add code
May 21, 2025
Viaarxiv icon

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation

Add code
Mar 09, 2025
Viaarxiv icon

Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders

Add code
Feb 21, 2025
Viaarxiv icon

Foundations of Large Language Models

Add code
Jan 16, 2025
Viaarxiv icon

Optimizing Speech Multi-View Feature Fusion through Conditional Computation

Add code
Jan 14, 2025
Viaarxiv icon

Boosting Text-To-Image Generation via Multilingual Prompting in Large Multimodal Models

Add code
Jan 13, 2025
Viaarxiv icon

SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment

Add code
Jan 07, 2025
Viaarxiv icon

Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization

Add code
Dec 02, 2024
Figure 1 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Figure 2 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Figure 3 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Figure 4 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Viaarxiv icon

Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning

Add code
Nov 05, 2024
Viaarxiv icon