Picture for Zhuoran Li

Zhuoran Li

Outcome-Grounded Advantage Reshaping for Fine-Grained Credit Assignment in Mathematical Reasoning

Add code
Jan 12, 2026
Viaarxiv icon

Scoring, Reasoning, and Selecting the Best! Ensembling Large Language Models via a Peer-Review Process

Add code
Dec 29, 2025
Viaarxiv icon

Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network

Add code
Dec 22, 2025
Figure 1 for Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network
Figure 2 for Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network
Figure 3 for Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network
Figure 4 for Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network
Viaarxiv icon

Chirp Delay-Doppler Domain Modulation Based Joint Communication and Radar for Autonomous Vehicles

Add code
Dec 20, 2025
Viaarxiv icon

Radiation Pattern Reconfigurable FAS-Empowered Interference-Resilient UAV Communication

Add code
Oct 01, 2025
Viaarxiv icon

OM2P: Offline Multi-Agent Mean-Flow Policy

Add code
Aug 08, 2025
Viaarxiv icon

Reparameterization Proximal Policy Optimization

Add code
Aug 08, 2025
Figure 1 for Reparameterization Proximal Policy Optimization
Figure 2 for Reparameterization Proximal Policy Optimization
Figure 3 for Reparameterization Proximal Policy Optimization
Figure 4 for Reparameterization Proximal Policy Optimization
Viaarxiv icon

Proxy-Free GFlowNet

Add code
May 26, 2025
Viaarxiv icon

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Add code
May 26, 2025
Viaarxiv icon

Chirp Delay-Doppler Domain Modulation: A New Paradigm of Integrated Sensing and Communication for Autonomous Vehicles

Add code
May 22, 2025
Viaarxiv icon