Picture for BinHong Yang

BinHong Yang

Scene Graph-guided SegCaptioning Transformer with Fine-grained Alignment for Controllable Video Segmentation and Captioning

Add code
Mar 21, 2026
Viaarxiv icon