speech


STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits

Add code
Dec 15, 2025
Figure 1 for STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
Figure 2 for STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
Figure 3 for STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
Figure 4 for STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
Viaarxiv icon

NagaNLP: Bootstrapping NLP for Low-Resource Nagamese Creole with Human-in-the-Loop Synthetic Data

Add code
Dec 14, 2025
Viaarxiv icon

BUT Systems for WildSpoof Challenge: SASV in the Wild

Add code
Dec 14, 2025
Figure 1 for BUT Systems for WildSpoof Challenge: SASV in the Wild
Figure 2 for BUT Systems for WildSpoof Challenge: SASV in the Wild
Figure 3 for BUT Systems for WildSpoof Challenge: SASV in the Wild
Viaarxiv icon

InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation

Add code
Dec 14, 2025
Figure 1 for InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation
Figure 2 for InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation
Figure 3 for InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation
Figure 4 for InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation
Viaarxiv icon

JointAVBench: A Benchmark for Joint Audio-Visual Reasoning Evaluation

Add code
Dec 14, 2025
Viaarxiv icon

EEG-to-Voice Decoding of Spoken and Imagined speech Using Non-Invasive EEG

Add code
Dec 14, 2025
Viaarxiv icon

F5-TTS-RO: Extending F5-TTS to Romanian TTS via Lightweight Input Adaptation

Add code
Dec 13, 2025
Viaarxiv icon

KeyframeFace: From Text to Expressive Facial Keyframes

Add code
Dec 12, 2025
Viaarxiv icon

All-in-One ASR: Unifying Encoder-Decoder Models of CTC, Attention, and Transducer in Dual-Mode ASR

Add code
Dec 12, 2025
Viaarxiv icon

Stronger Normalization-Free Transformers

Add code
Dec 11, 2025
Viaarxiv icon