Picture for Yisi Liu

Yisi Liu

StyleStream: Real-Time Zero-Shot Voice Style Conversion

Add code
Feb 23, 2026
Viaarxiv icon

HuPER: A Human-Inspired Framework for Phonetic Perception

Add code
Feb 02, 2026
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

Prompt-and-Check: Using Large Language Models to Evaluate Communication Protocol Compliance in Simulation-Based Training

Add code
Aug 12, 2025
Figure 1 for Prompt-and-Check: Using Large Language Models to Evaluate Communication Protocol Compliance in Simulation-Based Training
Figure 2 for Prompt-and-Check: Using Large Language Models to Evaluate Communication Protocol Compliance in Simulation-Based Training
Figure 3 for Prompt-and-Check: Using Large Language Models to Evaluate Communication Protocol Compliance in Simulation-Based Training
Figure 4 for Prompt-and-Check: Using Large Language Models to Evaluate Communication Protocol Compliance in Simulation-Based Training
Viaarxiv icon

Enhancing Egocentric Object Detection in Static Environments using Graph-based Spatial Anomaly Detection and Correction

Add code
Aug 11, 2025
Viaarxiv icon

AI Meets Maritime Training: Precision Analytics for Enhanced Safety and Performance

Add code
Jul 02, 2025
Figure 1 for AI Meets Maritime Training: Precision Analytics for Enhanced Safety and Performance
Figure 2 for AI Meets Maritime Training: Precision Analytics for Enhanced Safety and Performance
Figure 3 for AI Meets Maritime Training: Precision Analytics for Enhanced Safety and Performance
Figure 4 for AI Meets Maritime Training: Precision Analytics for Enhanced Safety and Performance
Viaarxiv icon

RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding

Add code
Jun 12, 2025
Figure 1 for RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding
Figure 2 for RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding
Figure 3 for RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding
Figure 4 for RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding
Viaarxiv icon

Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model

Add code
Oct 24, 2024
Figure 1 for Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model
Figure 2 for Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model
Figure 3 for Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model
Figure 4 for Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model
Viaarxiv icon

Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP

Add code
Sep 04, 2024
Figure 1 for Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
Figure 2 for Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
Figure 3 for Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
Figure 4 for Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
Viaarxiv icon

Taxonomic analysis of asteroids with artificial neural networks

Add code
Nov 18, 2023
Viaarxiv icon