Picture for Bo Zheng

Bo Zheng

additional authors not shown

Complementary Reinforcement Learning

Add code
Mar 18, 2026
Viaarxiv icon

SecAgent: Efficient Mobile GUI Agent with Semantic Context

Add code
Mar 09, 2026
Viaarxiv icon

Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning

Add code
Mar 02, 2026
Viaarxiv icon

MAC: A Conversion Rate Prediction Benchmark Featuring Labels Under Multiple Attribution Mechanisms

Add code
Mar 02, 2026
Viaarxiv icon

RuCL: Stratified Rubric-Based Curriculum Learning for Multimodal Large Language Model Reasoning

Add code
Feb 25, 2026
Viaarxiv icon

Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search

Add code
Feb 14, 2026
Viaarxiv icon

SpiralFormer: Looped Transformers Can Learn Hierarchical Dependencies via Multi-Resolution Recursion

Add code
Feb 12, 2026
Viaarxiv icon

Stop Unnecessary Reflection: Training LRMs for Efficient Reasoning with Adaptive Reflection and Length Coordinated Penalty

Add code
Feb 12, 2026
Viaarxiv icon

Learning from the Irrecoverable: Error-Localized Policy Optimization for Tool-Integrated LLM Reasoning

Add code
Feb 10, 2026
Viaarxiv icon

E-VAds: An E-commerce Short Videos Understanding Benchmark for MLLMs

Add code
Feb 09, 2026
Viaarxiv icon