Picture for Yike Guo

Yike Guo

Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion

Add code
May 03, 2025
Figure 1 for Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Figure 2 for Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Figure 3 for Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Figure 4 for Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
Viaarxiv icon

Benchmarking Multi-National Value Alignment for Large Language Models

Add code
Apr 19, 2025
Figure 1 for Benchmarking Multi-National Value Alignment for Large Language Models
Figure 2 for Benchmarking Multi-National Value Alignment for Large Language Models
Figure 3 for Benchmarking Multi-National Value Alignment for Large Language Models
Figure 4 for Benchmarking Multi-National Value Alignment for Large Language Models
Viaarxiv icon

ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs

Add code
Mar 17, 2025
Viaarxiv icon

AudioX: Diffusion Transformer for Anything-to-Audio Generation

Add code
Mar 13, 2025
Viaarxiv icon

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Add code
Mar 11, 2025
Viaarxiv icon

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Add code
Mar 03, 2025
Figure 1 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 2 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 3 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 4 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Viaarxiv icon

Delta Decompression for MoE-based LLMs Compression

Add code
Feb 24, 2025
Figure 1 for Delta Decompression for MoE-based LLMs Compression
Figure 2 for Delta Decompression for MoE-based LLMs Compression
Figure 3 for Delta Decompression for MoE-based LLMs Compression
Figure 4 for Delta Decompression for MoE-based LLMs Compression
Viaarxiv icon

Audio-FLAN: A Preliminary Release

Add code
Feb 23, 2025
Figure 1 for Audio-FLAN: A Preliminary Release
Figure 2 for Audio-FLAN: A Preliminary Release
Figure 3 for Audio-FLAN: A Preliminary Release
Figure 4 for Audio-FLAN: A Preliminary Release
Viaarxiv icon

Machine learning for modelling unstructured grid data in computational physics: a review

Add code
Feb 13, 2025
Figure 1 for Machine learning for modelling unstructured grid data in computational physics: a review
Figure 2 for Machine learning for modelling unstructured grid data in computational physics: a review
Figure 3 for Machine learning for modelling unstructured grid data in computational physics: a review
Figure 4 for Machine learning for modelling unstructured grid data in computational physics: a review
Viaarxiv icon

VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer

Add code
Feb 09, 2025
Viaarxiv icon