Picture for Zhao Jin

Zhao Jin

Self-Supervised Multi-Part Articulated Objects Modeling via Deformable Gaussian Splatting and Progressive Primitive Segmentation

Add code
Jun 11, 2025
Viaarxiv icon

ArtVIP: Articulated Digital Assets of Visual Realism, Modular Interaction, and Physical Fidelity for Robot Learning

Add code
Jun 06, 2025
Viaarxiv icon

MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval

Add code
May 26, 2025
Viaarxiv icon

VORTA: Efficient Video Diffusion via Routing Sparse Attention

Add code
May 24, 2025
Viaarxiv icon

Where is this coming from? Making groundedness count in the evaluation of Document VQA models

Add code
Mar 24, 2025
Viaarxiv icon

Correctness Learning: Deductive Verification Guided Learning for Human-AI Collaboration

Add code
Mar 10, 2025
Viaarxiv icon

RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation

Add code
Dec 18, 2024
Viaarxiv icon

AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration

Add code
Dec 16, 2024
Viaarxiv icon

SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing

Add code
Nov 28, 2024
Viaarxiv icon

SM$^3$: Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects

Add code
Jan 17, 2024
Figure 1 for SM$^3$: Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects
Figure 2 for SM$^3$: Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects
Figure 3 for SM$^3$: Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects
Figure 4 for SM$^3$: Self-Supervised Multi-task Modeling with Multi-view 2D Images for Articulated Objects
Viaarxiv icon