Picture for Xiaohua Liao

Xiaohua Liao

CineSRD: Leveraging Visual, Acoustic, and Linguistic Cues for Open-World Visual Media Speaker Diarization

Add code
Mar 17, 2026
Viaarxiv icon