Picture for Zhuang Yu

Zhuang Yu

LEMON: How Well Do MLLMs Perform Temporal Multimodal Understanding on Instructional Videos?

Add code
Jan 27, 2026
Viaarxiv icon

Combining Self-attention and Dilation Convolutional for Semantic Segmentation of Coal Maceral Groups

Add code
Jun 15, 2025
Viaarxiv icon

Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation

Add code
Apr 25, 2025
Viaarxiv icon