Picture for Yiwen Guo

Yiwen Guo

Breaking Contextual Inertia: Reinforcement Learning with Single-Turn Anchors for Stable Multi-Turn Interaction

Add code
Mar 05, 2026
Viaarxiv icon

Ex-Omni: Enabling 3D Facial Animation Generation for Omni-modal Large Language Models

Add code
Feb 06, 2026
Viaarxiv icon

SemanticAudio: Audio Generation and Editing in Semantic Space

Add code
Jan 29, 2026
Viaarxiv icon

MSR-Codec: A Low-Bitrate Multi-Stream Residual Codec for High-Fidelity Speech Generation with Information Disentanglement

Add code
Sep 16, 2025
Viaarxiv icon

Emotion Omni: Enabling Empathetic Speech Response Generation through Large Language Models

Add code
Aug 26, 2025
Viaarxiv icon

Triple X: A LLM-Based Multilingual Speech Recognition System for the INTERSPEECH2025 MLC-SLM Challenge

Add code
Jul 23, 2025
Viaarxiv icon

Identifying and Understanding Cross-Class Features in Adversarial Training

Add code
Jun 05, 2025
Viaarxiv icon

Cultivating Game Sense for Yourself: Making VLMs Gaming Experts

Add code
Mar 27, 2025
Viaarxiv icon

Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm

Add code
Mar 04, 2025
Viaarxiv icon

Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models

Add code
Feb 13, 2025
Viaarxiv icon