Picture for Changsheng Xu

Changsheng Xu

Libra: Building Decoupled Vision System on Large Language Models

Add code
May 16, 2024
Viaarxiv icon

HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual Grounding

Add code
Apr 20, 2024
Viaarxiv icon

StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion

Add code
Apr 09, 2024
Viaarxiv icon

Music Style Transfer with Time-Varying Inversion of Diffusion Models

Add code
Feb 21, 2024
Viaarxiv icon

Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models

Add code
Jan 31, 2024
Figure 1 for Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models
Figure 2 for Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models
Figure 3 for Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models
Figure 4 for Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models
Viaarxiv icon

CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion

Add code
Jan 30, 2024
Viaarxiv icon

Hierarchical Prompts for Rehearsal-free Continual Learning

Add code
Jan 21, 2024
Viaarxiv icon

Hierarchical Augmentation and Distillation for Class Incremental Audio-Visual Video Recognition

Add code
Jan 11, 2024
Viaarxiv icon

Three Heads Are Better Than One: Complementary Experts for Long-Tailed Semi-supervised Learning

Add code
Dec 25, 2023
Viaarxiv icon

Erasing Self-Supervised Learning Backdoor by Cluster Activation Masking

Add code
Dec 13, 2023
Viaarxiv icon