Picture for Jianhua Han

Jianhua Han

MagicSeg: Open-World Segmentation Pretraining via Counterfactural Diffusion-Based Auto-Generation

Add code
Mar 20, 2026
Viaarxiv icon

Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization

Add code
Mar 10, 2026
Viaarxiv icon

AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots

Add code
Mar 08, 2026
Viaarxiv icon

CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Add code
Mar 03, 2026
Viaarxiv icon

RADAR: Revealing Asymmetric Development of Abilities in MLLM Pre-training

Add code
Feb 13, 2026
Viaarxiv icon

Thinking with Geometry: Active Geometry Integration for Spatial Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

SlowFocus: Enhancing Fine-grained Temporal Understanding in Video LLM

Add code
Feb 03, 2026
Viaarxiv icon

Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI

Add code
Oct 06, 2025
Figure 1 for Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Figure 2 for Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Figure 3 for Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Figure 4 for Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI
Viaarxiv icon

C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning

Add code
Jul 22, 2025
Figure 1 for C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning
Figure 2 for C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning
Figure 3 for C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning
Figure 4 for C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning
Viaarxiv icon

Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs

Add code
Jun 06, 2025
Figure 1 for Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs
Figure 2 for Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs
Figure 3 for Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs
Figure 4 for Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs
Viaarxiv icon