Picture for Li Shen

Li Shen

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Add code
May 30, 2025
Viaarxiv icon

Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging

Add code
May 26, 2025
Figure 1 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 2 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 3 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 4 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Viaarxiv icon

Decision Flow Policy Optimization

Add code
May 26, 2025
Viaarxiv icon

Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought

Add code
May 26, 2025
Viaarxiv icon

MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval

Add code
May 26, 2025
Viaarxiv icon

Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval

Add code
May 26, 2025
Viaarxiv icon

Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO

Add code
May 22, 2025
Viaarxiv icon

FreshRetailNet-50K: A Stockout-Annotated Censored Demand Dataset for Latent Demand Recovery and Forecasting in Fresh Retail

Add code
May 22, 2025
Figure 1 for FreshRetailNet-50K: A Stockout-Annotated Censored Demand Dataset for Latent Demand Recovery and Forecasting in Fresh Retail
Figure 2 for FreshRetailNet-50K: A Stockout-Annotated Censored Demand Dataset for Latent Demand Recovery and Forecasting in Fresh Retail
Figure 3 for FreshRetailNet-50K: A Stockout-Annotated Censored Demand Dataset for Latent Demand Recovery and Forecasting in Fresh Retail
Figure 4 for FreshRetailNet-50K: A Stockout-Annotated Censored Demand Dataset for Latent Demand Recovery and Forecasting in Fresh Retail
Viaarxiv icon

R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search

Add code
May 22, 2025
Viaarxiv icon