Picture for Le Thien Phuc Nguyen

Le Thien Phuc Nguyen

Describe Anything Model for Visual Question Answering on Text-rich Images

Add code
Jul 16, 2025
Viaarxiv icon

UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios

Add code
May 28, 2025
Viaarxiv icon

LASER: Lip Landmark Assisted Speaker Detection for Robustness

Add code
Jan 21, 2025
Viaarxiv icon