Picture for Zhaoxiang Zhang

Zhaoxiang Zhang

DexVLG: Dexterous Vision-Language-Grasp Model at Scale

Add code
Jul 03, 2025
Viaarxiv icon

Unified Vision-Language-Action Model

Add code
Jun 24, 2025
Figure 1 for Unified Vision-Language-Action Model
Figure 2 for Unified Vision-Language-Action Model
Figure 3 for Unified Vision-Language-Action Model
Figure 4 for Unified Vision-Language-Action Model
Viaarxiv icon

TC-Light: Temporally Consistent Relighting for Dynamic Long Videos

Add code
Jun 23, 2025
Viaarxiv icon

MLLM-CL: Continual Learning for Multimodal Large Language Models

Add code
Jun 05, 2025
Figure 1 for MLLM-CL: Continual Learning for Multimodal Large Language Models
Figure 2 for MLLM-CL: Continual Learning for Multimodal Large Language Models
Figure 3 for MLLM-CL: Continual Learning for Multimodal Large Language Models
Figure 4 for MLLM-CL: Continual Learning for Multimodal Large Language Models
Viaarxiv icon

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

Add code
May 21, 2025
Figure 1 for KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
Figure 2 for KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
Figure 3 for KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
Figure 4 for KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
Viaarxiv icon

Semi-parametric Memory Consolidation: Towards Brain-like Deep Continual Learning

Add code
Apr 20, 2025
Viaarxiv icon

TrustLoRA: Low-Rank Adaptation for Failure Detection under Out-of-distribution Data

Add code
Apr 20, 2025
Figure 1 for TrustLoRA: Low-Rank Adaptation for Failure Detection under Out-of-distribution Data
Figure 2 for TrustLoRA: Low-Rank Adaptation for Failure Detection under Out-of-distribution Data
Figure 3 for TrustLoRA: Low-Rank Adaptation for Failure Detection under Out-of-distribution Data
Figure 4 for TrustLoRA: Low-Rank Adaptation for Failure Detection under Out-of-distribution Data
Viaarxiv icon

Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness

Add code
Apr 02, 2025
Figure 1 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Figure 2 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Figure 3 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Figure 4 for Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
Viaarxiv icon

End-to-End Driving with Online Trajectory Evaluation via BEV World Model

Add code
Apr 02, 2025
Viaarxiv icon

EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models

Add code
Mar 14, 2025
Viaarxiv icon