Picture for Lee Onn Mak

Lee Onn Mak

QMAVIS: Long Video-Audio Understanding using Fusion of Large Multimodal Models

Add code
Jan 10, 2026
Viaarxiv icon

QCaption: Video Captioning and Q&A through Fusion of Large Multimodal Models

Add code
Jan 10, 2026
Viaarxiv icon