Picture for Ming Liu

Ming Liu

Beijing Institute of Technology, China

CCFQA: A Benchmark for Cross-Lingual and Cross-Modal Speech and Text Factuality Evaluation

Add code
Aug 10, 2025
Viaarxiv icon

DuLoc: Life-Long Dual-Layer Localization in Changing and Dynamic Expansive Scenarios

Add code
Jul 31, 2025
Viaarxiv icon

Reasoning Multimodal Large Language Model: Data Contamination and Dynamic Evaluation

Add code
Jun 08, 2025
Viaarxiv icon

Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering

Add code
May 25, 2025
Viaarxiv icon

Decoupled Visual Interpretation and Linguistic Reasoning for Math Problem Solving

Add code
May 23, 2025
Viaarxiv icon

Challenger: Affordable Adversarial Driving Video Generation

Add code
May 21, 2025
Viaarxiv icon

Investigating and Enhancing the Robustness of Large Multimodal Models Against Temporal Inconsistency

Add code
May 20, 2025
Viaarxiv icon

Is your multimodal large language model a good science tutor?

Add code
May 09, 2025
Viaarxiv icon

Natural Reflection Backdoor Attack on Vision Language Model for Autonomous Driving

Add code
May 09, 2025
Viaarxiv icon

AnimateAnywhere: Rouse the Background in Human Image Animation

Add code
Apr 28, 2025
Viaarxiv icon