Picture for Trung Bui

Trung Bui

FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation

Add code
Jul 09, 2025
Viaarxiv icon

Context-Informed Grounding Supervision

Add code
Jun 18, 2025
Viaarxiv icon

MS4UI: A Dataset for Multi-modal Summarization of User Interface Instructional Videos

Add code
Jun 14, 2025
Viaarxiv icon

Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning

Add code
Jun 08, 2025
Viaarxiv icon

Understanding Generative AI Capabilities in Everyday Image Editing Tasks

Add code
May 22, 2025
Viaarxiv icon

YoChameleon: Personalized Vision and Language Generation

Add code
Apr 29, 2025
Viaarxiv icon

CORG: Generating Answers from Complex, Interrelated Contexts

Add code
Apr 25, 2025
Viaarxiv icon

NoLiMa: Long-Context Evaluation Beyond Literal Matching

Add code
Feb 07, 2025
Viaarxiv icon

Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage

Add code
Dec 24, 2024
Viaarxiv icon

GUI Agents: A Survey

Add code
Dec 18, 2024
Viaarxiv icon