Picture for Zhiyuan Zhang

Zhiyuan Zhang

High-Fidelity Differential-information Driven Binary Vision Transformer

Add code
Jul 03, 2025
Figure 1 for High-Fidelity Differential-information Driven Binary Vision Transformer
Figure 2 for High-Fidelity Differential-information Driven Binary Vision Transformer
Figure 3 for High-Fidelity Differential-information Driven Binary Vision Transformer
Figure 4 for High-Fidelity Differential-information Driven Binary Vision Transformer
Viaarxiv icon

Time-Lapse Video-Based Embryo Grading via Complementary Spatial-Temporal Pattern Mining

Add code
Jun 05, 2025
Viaarxiv icon

ManiFeel: Benchmarking and Understanding Visuotactile Manipulation Policy Learning

Add code
May 24, 2025
Viaarxiv icon

Canonical Policy: Learning Canonical 3D Representation for Equivariant Policy

Add code
May 24, 2025
Figure 1 for Canonical Policy: Learning Canonical 3D Representation for Equivariant Policy
Figure 2 for Canonical Policy: Learning Canonical 3D Representation for Equivariant Policy
Figure 3 for Canonical Policy: Learning Canonical 3D Representation for Equivariant Policy
Figure 4 for Canonical Policy: Learning Canonical 3D Representation for Equivariant Policy
Viaarxiv icon

ExoGait-MS: Learning Periodic Dynamics with Multi-Scale Graph Network for Exoskeleton Gait Recognition

Add code
May 23, 2025
Figure 1 for ExoGait-MS: Learning Periodic Dynamics with Multi-Scale Graph Network for Exoskeleton Gait Recognition
Figure 2 for ExoGait-MS: Learning Periodic Dynamics with Multi-Scale Graph Network for Exoskeleton Gait Recognition
Figure 3 for ExoGait-MS: Learning Periodic Dynamics with Multi-Scale Graph Network for Exoskeleton Gait Recognition
Figure 4 for ExoGait-MS: Learning Periodic Dynamics with Multi-Scale Graph Network for Exoskeleton Gait Recognition
Viaarxiv icon

TS-Diff: Two-Stage Diffusion Model for Low-Light RAW Image Enhancement

Add code
May 07, 2025
Viaarxiv icon

HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation

Add code
May 07, 2025
Viaarxiv icon

RoboAct-CLIP: Video-Driven Pre-training of Atomic Action Understanding for Robotics

Add code
Apr 02, 2025
Viaarxiv icon

MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving

Add code
Apr 01, 2025
Figure 1 for MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving
Figure 2 for MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving
Figure 3 for MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving
Figure 4 for MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving
Viaarxiv icon

GenM$^3$: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation

Add code
Mar 19, 2025
Viaarxiv icon