Picture for Dan Xu

Dan Xu

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

Add code
Jul 28, 2025
Viaarxiv icon

UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation

Add code
Jul 03, 2025
Viaarxiv icon

HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning

Add code
May 21, 2025
Viaarxiv icon

Learning Heterogeneous Mixture of Scene Experts for Large-scale Neural Radiance Fields

Add code
May 04, 2025
Viaarxiv icon

Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation

Add code
Apr 03, 2025
Viaarxiv icon

GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping

Add code
Mar 13, 2025
Viaarxiv icon

Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations

Add code
Mar 13, 2025
Viaarxiv icon

Multi-Attribute Multi-Grained Adaptation of Pre-Trained Language Models for Text Understanding from Bayesian Perspective

Add code
Mar 08, 2025
Viaarxiv icon

Vision-aware Multimodal Prompt Tuning for Uploadable Multi-source Few-shot Domain Adaptation

Add code
Mar 08, 2025
Figure 1 for Vision-aware Multimodal Prompt Tuning for Uploadable Multi-source Few-shot Domain Adaptation
Figure 2 for Vision-aware Multimodal Prompt Tuning for Uploadable Multi-source Few-shot Domain Adaptation
Figure 3 for Vision-aware Multimodal Prompt Tuning for Uploadable Multi-source Few-shot Domain Adaptation
Figure 4 for Vision-aware Multimodal Prompt Tuning for Uploadable Multi-source Few-shot Domain Adaptation
Viaarxiv icon

Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs

Add code
Mar 07, 2025
Viaarxiv icon