Picture for Wen Wang

Wen Wang

Fun-Audio-Chat Technical Report

Add code
Dec 23, 2025
Viaarxiv icon

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

Add code
Dec 18, 2025
Viaarxiv icon

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Add code
Dec 08, 2025
Viaarxiv icon

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Add code
Oct 23, 2025
Viaarxiv icon

FunAudio-ASR Technical Report

Add code
Sep 15, 2025
Figure 1 for FunAudio-ASR Technical Report
Figure 2 for FunAudio-ASR Technical Report
Figure 3 for FunAudio-ASR Technical Report
Figure 4 for FunAudio-ASR Technical Report
Viaarxiv icon

Say More with Less: Variable-Frame-Rate Speech Tokenization via Adaptive Clustering and Implicit Duration Coding

Add code
Sep 04, 2025
Viaarxiv icon

Solving the Min-Max Multiple Traveling Salesmen Problem via Learning-Based Path Generation and Optimal Splitting

Add code
Aug 23, 2025
Viaarxiv icon

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Add code
Aug 12, 2025
Viaarxiv icon

SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models

Add code
Aug 08, 2025
Viaarxiv icon

Token Communication in the Era of Large Models: An Information Bottleneck-Based Approach

Add code
Jul 02, 2025
Viaarxiv icon