Picture for Zhenhua Feng

Zhenhua Feng

PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination

Add code
Sep 05, 2025
Viaarxiv icon

DASViT: Differentiable Architecture Search for Vision Transformer

Add code
Jul 17, 2025
Viaarxiv icon

Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing

Add code
Apr 11, 2025
Figure 1 for Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
Figure 2 for Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
Figure 3 for Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
Figure 4 for Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
Viaarxiv icon

DGFM: Full Body Dance Generation Driven by Music Foundation Models

Add code
Feb 27, 2025
Viaarxiv icon

One Model for ALL: Low-Level Task Interaction Is a Key to Task-Agnostic Image Fusion

Add code
Feb 27, 2025
Viaarxiv icon

GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music

Add code
Feb 25, 2025
Viaarxiv icon

Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose Estimation

Add code
Feb 25, 2025
Viaarxiv icon

PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation

Add code
Dec 10, 2024
Figure 1 for PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation
Figure 2 for PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation
Figure 3 for PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation
Figure 4 for PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation
Viaarxiv icon

Rethinking Positive Pairs in Contrastive Learning

Add code
Oct 23, 2024
Figure 1 for Rethinking Positive Pairs in Contrastive Learning
Figure 2 for Rethinking Positive Pairs in Contrastive Learning
Figure 3 for Rethinking Positive Pairs in Contrastive Learning
Figure 4 for Rethinking Positive Pairs in Contrastive Learning
Viaarxiv icon

SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion

Add code
Sep 26, 2024
Viaarxiv icon