Aist


V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation

Add code
Mar 11, 2026
Viaarxiv icon

Not Like Transformers: Drop the Beat Representation for Dance Generation with Mamba-Based Diffusion Model

Add code
Mar 09, 2026
Viaarxiv icon

AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service

Add code
Feb 17, 2026
Viaarxiv icon

QuaMo: Quaternion Motions for Vision-based 3D Human Kinematics Capture

Add code
Jan 27, 2026
Viaarxiv icon

The CMU-AIST submission for the ICME 2025 Audio Encoder Challenge

Add code
Jan 22, 2026
Viaarxiv icon

Reframing Music-Driven 2D Dance Pose Generation as Multi-Channel Image Generation

Add code
Dec 12, 2025
Viaarxiv icon

DanceChat: Large Language Model-Guided Music-to-Dance Generation

Add code
Jun 12, 2025
Viaarxiv icon

MEGADance: Mixture-of-Experts Architecture for Genre-Aware 3D Dance Generation

Add code
May 23, 2025
Figure 1 for MEGADance: Mixture-of-Experts Architecture for Genre-Aware 3D Dance Generation
Figure 2 for MEGADance: Mixture-of-Experts Architecture for Genre-Aware 3D Dance Generation
Figure 3 for MEGADance: Mixture-of-Experts Architecture for Genre-Aware 3D Dance Generation
Figure 4 for MEGADance: Mixture-of-Experts Architecture for Genre-Aware 3D Dance Generation
Viaarxiv icon

Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model

Add code
Mar 28, 2025
Viaarxiv icon

AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs

Add code
Mar 28, 2025
Figure 1 for AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Figure 2 for AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Figure 3 for AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Figure 4 for AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Viaarxiv icon