Picture for Xiangyang Xue

Xiangyang Xue

Fudan University

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation

Add code
Apr 21, 2025
Viaarxiv icon

CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image

Add code
Apr 15, 2025
Viaarxiv icon

DecoFuse: Decomposing and Fusing the "What", "Where", and "How" for Brain-Inspired fMRI-to-Video Decoding

Add code
Apr 01, 2025
Viaarxiv icon

ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning

Add code
Mar 30, 2025
Viaarxiv icon

EmoHead: Emotional Talking Head via Manipulating Semantic Expression Parameters

Add code
Mar 25, 2025
Viaarxiv icon

ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models

Add code
Feb 27, 2025
Viaarxiv icon

Global Semantic-Guided Sub-image Feature Weight Allocation in High-Resolution Large Vision-Language Models

Add code
Jan 24, 2025
Viaarxiv icon

SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from Sparse Multi-View RGB Images

Add code
Dec 03, 2024
Viaarxiv icon

When Large Vision-Language Models Meet Person Re-Identification

Add code
Nov 27, 2024
Figure 1 for When Large Vision-Language Models Meet Person Re-Identification
Figure 2 for When Large Vision-Language Models Meet Person Re-Identification
Figure 3 for When Large Vision-Language Models Meet Person Re-Identification
Figure 4 for When Large Vision-Language Models Meet Person Re-Identification
Viaarxiv icon

MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model

Add code
Nov 26, 2024
Viaarxiv icon