Picture for Xingqun Qi

Xingqun Qi

SemiTooth: a Generalizable Semi-supervised Framework for Multi-Source Tooth Segmentation

Add code
Mar 12, 2026
Viaarxiv icon

RA-SSU: Towards Fine-Grained Audio-Visual Learning with Region-Aware Sound Source Understanding

Add code
Mar 10, 2026
Viaarxiv icon

DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions

Add code
Aug 24, 2025
Figure 1 for DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions
Figure 2 for DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions
Figure 3 for DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions
Figure 4 for DanceEditor: Towards Iterative Editable Music-driven Dance Generation with Open-Vocabulary Descriptions
Viaarxiv icon

ReMeREC: Relation-aware and Multi-entity Referring Expression Comprehension

Add code
Jul 22, 2025
Figure 1 for ReMeREC: Relation-aware and Multi-entity Referring Expression Comprehension
Figure 2 for ReMeREC: Relation-aware and Multi-entity Referring Expression Comprehension
Figure 3 for ReMeREC: Relation-aware and Multi-entity Referring Expression Comprehension
Figure 4 for ReMeREC: Relation-aware and Multi-entity Referring Expression Comprehension
Viaarxiv icon

Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion

Add code
May 03, 2025
Figure 1 for Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Figure 2 for Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Figure 3 for Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Figure 4 for Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Viaarxiv icon

VividListener: Expressive and Controllable Listener Dynamics Modeling for Multi-Modal Responsive Interaction

Add code
Apr 30, 2025
Figure 1 for VividListener: Expressive and Controllable Listener Dynamics Modeling for Multi-Modal Responsive Interaction
Figure 2 for VividListener: Expressive and Controllable Listener Dynamics Modeling for Multi-Modal Responsive Interaction
Figure 3 for VividListener: Expressive and Controllable Listener Dynamics Modeling for Multi-Modal Responsive Interaction
Figure 4 for VividListener: Expressive and Controllable Listener Dynamics Modeling for Multi-Modal Responsive Interaction
Viaarxiv icon

EVA: An Embodied World Model for Future Video Anticipation

Add code
Oct 20, 2024
Figure 1 for EVA: An Embodied World Model for Future Video Anticipation
Figure 2 for EVA: An Embodied World Model for Future Video Anticipation
Figure 3 for EVA: An Embodied World Model for Future Video Anticipation
Figure 4 for EVA: An Embodied World Model for Future Video Anticipation
Viaarxiv icon

PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion

Add code
Sep 16, 2024
Figure 1 for PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
Figure 2 for PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
Figure 3 for PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
Figure 4 for PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
Viaarxiv icon

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Add code
Jul 30, 2024
Figure 1 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 2 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 3 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 4 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Viaarxiv icon

M-LRM: Multi-view Large Reconstruction Model

Add code
Jun 11, 2024
Figure 1 for M-LRM: Multi-view Large Reconstruction Model
Figure 2 for M-LRM: Multi-view Large Reconstruction Model
Figure 3 for M-LRM: Multi-view Large Reconstruction Model
Figure 4 for M-LRM: Multi-view Large Reconstruction Model
Viaarxiv icon