Aist


DanceChat: Large Language Model-Guided Music-to-Dance Generation

Add code
Jun 12, 2025
Viaarxiv icon

MEGADance: Mixture-of-Experts Architecture for Genre-Aware 3D Dance Generation

Add code
May 23, 2025
Viaarxiv icon

AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs

Add code
Mar 28, 2025
Figure 1 for AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Figure 2 for AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Figure 3 for AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Figure 4 for AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs
Viaarxiv icon

Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model

Add code
Mar 28, 2025
Viaarxiv icon

Motion Anything: Any to Motion Generation

Add code
Mar 10, 2025
Viaarxiv icon

GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music

Add code
Feb 25, 2025
Viaarxiv icon

LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh

Add code
Feb 13, 2025
Figure 1 for LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh
Figure 2 for LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh
Figure 3 for LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh
Figure 4 for LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh
Viaarxiv icon

ABCI 3.0: Evolution of the leading AI infrastructure in Japan

Add code
Nov 14, 2024
Viaarxiv icon

MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation

Add code
Oct 02, 2024
Viaarxiv icon

AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation

Add code
Jun 11, 2024
Figure 1 for AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation
Figure 2 for AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation
Figure 3 for AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation
Figure 4 for AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation
Viaarxiv icon