Picture for Jinghan Li

Jinghan Li

Think, then Score: Decoupled Reasoning and Scoring for Video Reward Modeling

Add code
May 07, 2026
Viaarxiv icon

Beyond Where to Look: Trajectory-Guided Reinforcement Learning for Multimodal RLVR

Add code
Mar 27, 2026
Viaarxiv icon

Bridging Perception and Reasoning: Token Reweighting for RLVR in Multimodal LLMs

Add code
Mar 26, 2026
Viaarxiv icon

Enhancing Multi-Modal LLMs Reasoning via Difficulty-Aware Group Normalization

Add code
Feb 26, 2026
Viaarxiv icon

Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations

Add code
Dec 24, 2025
Figure 1 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Figure 2 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Figure 3 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Figure 4 for Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
Viaarxiv icon

AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization

Add code
Apr 22, 2025
Viaarxiv icon

DAMO: Data- and Model-aware Alignment of Multi-modal LLMs

Add code
Feb 04, 2025
Figure 1 for DAMO: Data- and Model-aware Alignment of Multi-modal LLMs
Figure 2 for DAMO: Data- and Model-aware Alignment of Multi-modal LLMs
Figure 3 for DAMO: Data- and Model-aware Alignment of Multi-modal LLMs
Figure 4 for DAMO: Data- and Model-aware Alignment of Multi-modal LLMs
Viaarxiv icon

DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector

Add code
Oct 09, 2024
Figure 1 for DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector
Figure 2 for DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector
Figure 3 for DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector
Figure 4 for DiffGAD: A Diffusion-based Unsupervised Graph Anomaly Detector
Viaarxiv icon

Closed-loop Long-horizon Robotic Planning via Equilibrium Sequence Modeling

Add code
Oct 02, 2024
Figure 1 for Closed-loop Long-horizon Robotic Planning via Equilibrium Sequence Modeling
Figure 2 for Closed-loop Long-horizon Robotic Planning via Equilibrium Sequence Modeling
Figure 3 for Closed-loop Long-horizon Robotic Planning via Equilibrium Sequence Modeling
Figure 4 for Closed-loop Long-horizon Robotic Planning via Equilibrium Sequence Modeling
Viaarxiv icon