Picture for Bo Zheng

Bo Zheng

additional authors not shown

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

Add code
Jun 06, 2025
Viaarxiv icon

Masked Self-distilled Transducer-based Keyword Spotting with Semi-autoregressive Decoding

Add code
May 30, 2025
Viaarxiv icon

Weight Spectra Induced Efficient Model Adaptation

Add code
May 29, 2025
Viaarxiv icon

Differentiable Solver Search for Fast Diffusion Sampling

Add code
May 27, 2025
Viaarxiv icon

Beyond Safe Answers: A Benchmark for Evaluating True Risk Awareness in Large Reasoning Models

Add code
May 26, 2025
Viaarxiv icon

USB: A Comprehensive and Unified Safety Evaluation Benchmark for Multimodal Large Language Models

Add code
May 26, 2025
Viaarxiv icon

LeTS: Learning to Think-and-Search via Process-and-Outcome Reward Hybridization

Add code
May 23, 2025
Viaarxiv icon

EVADE: Multimodal Benchmark for Evasive Content Detection in E-Commerce Applications

Add code
May 23, 2025
Viaarxiv icon

NAN: A Training-Free Solution to Coefficient Estimation in Model Merging

Add code
May 22, 2025
Viaarxiv icon

Think-J: Learning to Think for Generative LLM-as-a-Judge

Add code
May 20, 2025
Viaarxiv icon