Picture for Tingle Li

Tingle Li

Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech

Add code
Dec 25, 2025
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

Add code
Aug 25, 2025
Viaarxiv icon

MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real

Add code
Jul 03, 2025
Viaarxiv icon

Sounding that Object: Interactive Object-Aware Image to Audio Generation

Add code
Jun 04, 2025
Viaarxiv icon

Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities

Add code
Mar 06, 2025
Figure 1 for Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Figure 2 for Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Figure 3 for Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Figure 4 for Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Viaarxiv icon

Audio Texture Manipulation by Exemplar-Based Analogy

Add code
Jan 21, 2025
Figure 1 for Audio Texture Manipulation by Exemplar-Based Analogy
Figure 2 for Audio Texture Manipulation by Exemplar-Based Analogy
Figure 3 for Audio Texture Manipulation by Exemplar-Based Analogy
Figure 4 for Audio Texture Manipulation by Exemplar-Based Analogy
Viaarxiv icon

Self-Supervised Audio-Visual Soundscape Stylization

Add code
Sep 22, 2024
Viaarxiv icon

Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and Detection

Add code
Dec 20, 2023
Viaarxiv icon

Deep Speech Synthesis from MRI-Based Articulatory Representations

Add code
Jul 05, 2023
Figure 1 for Deep Speech Synthesis from MRI-Based Articulatory Representations
Figure 2 for Deep Speech Synthesis from MRI-Based Articulatory Representations
Figure 3 for Deep Speech Synthesis from MRI-Based Articulatory Representations
Figure 4 for Deep Speech Synthesis from MRI-Based Articulatory Representations
Viaarxiv icon