Picture for Meng Meng

Meng Meng

TTS-PRISM: A Perceptual Reasoning and Interpretable Speech Model for Fine-Grained Diagnosis

Add code
Apr 24, 2026
Viaarxiv icon

ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling

Add code
Apr 16, 2026
Viaarxiv icon

Borderless Long Speech Synthesis

Add code
Mar 20, 2026
Viaarxiv icon

Pareto-guided Pipeline for Distilling Featherweight AI Agents in Mobile MOBA Games

Add code
Feb 07, 2026
Viaarxiv icon

One Ring to Rule Them All: Unifying Group-Based RL via Dynamic Power-Mean Geometry

Add code
Jan 30, 2026
Viaarxiv icon

AdaTooler-V: Adaptive Tool-Use for Images and Videos

Add code
Dec 19, 2025
Figure 1 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 2 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 3 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 4 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Viaarxiv icon

DiffRhythm 2: Efficient and High Fidelity Song Generation via Block Flow Matching

Add code
Oct 27, 2025
Viaarxiv icon

Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling

Add code
Jun 11, 2024
Figure 1 for Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling
Figure 2 for Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling
Figure 3 for Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling
Figure 4 for Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling
Viaarxiv icon

Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning

Add code
Jun 06, 2024
Viaarxiv icon

In-Session Personalization for Talent Search

Add code
Sep 18, 2018
Figure 1 for In-Session Personalization for Talent Search
Figure 2 for In-Session Personalization for Talent Search
Figure 3 for In-Session Personalization for Talent Search
Figure 4 for In-Session Personalization for Talent Search
Viaarxiv icon