Picture for Liangbin Huang

Liangbin Huang

CineSRD: Leveraging Visual, Acoustic, and Linguistic Cues for Open-World Visual Media Speaker Diarization

Add code
Mar 17, 2026
Viaarxiv icon

From Utterance to Vividity: Training Expressive Subtitle Translation LLM via Adaptive Local Preference Optimization

Add code
Feb 01, 2026
Viaarxiv icon

Hermes the Polyglot: A Unified Framework to Enhance Expressiveness for Multimodal Interlingual Subtitling

Add code
Jan 31, 2026
Viaarxiv icon

Fine-grained Video Dubbing Duration Alignment with Segment Supervised Preference Optimization

Add code
Aug 12, 2025
Viaarxiv icon