Picture for Yisheng Lv

Yisheng Lv

Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning

Add code
Apr 21, 2025
Viaarxiv icon

Offline Reinforcement Learning with Discrete Diffusion Skills

Add code
Mar 26, 2025
Viaarxiv icon

Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving

Add code
Mar 09, 2025
Viaarxiv icon

UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility

Add code
Jan 04, 2025
Viaarxiv icon

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Add code
Oct 01, 2024
Figure 1 for Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Figure 2 for Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Figure 3 for Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Figure 4 for Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Viaarxiv icon

MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving

Add code
Sep 11, 2024
Viaarxiv icon

MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering

Add code
Aug 21, 2024
Viaarxiv icon

Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models

Add code
May 08, 2024
Figure 1 for Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
Figure 2 for Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
Figure 3 for Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
Figure 4 for Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
Viaarxiv icon

SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models

Add code
Mar 20, 2024
Figure 1 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 2 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 3 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 4 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Viaarxiv icon

BjTT: A Large-scale Multimodal Dataset for Traffic Prediction

Add code
Mar 14, 2024
Viaarxiv icon