Picture for Ming Liu

Ming Liu

Beijing Institute of Technology, China

Reasoning Multimodal Large Language Model: Data Contamination and Dynamic Evaluation

Add code
Jun 08, 2025
Viaarxiv icon

Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering

Add code
May 25, 2025
Viaarxiv icon

Decoupled Visual Interpretation and Linguistic Reasoning for Math Problem Solving

Add code
May 23, 2025
Viaarxiv icon

Challenger: Affordable Adversarial Driving Video Generation

Add code
May 21, 2025
Viaarxiv icon

Investigating and Enhancing the Robustness of Large Multimodal Models Against Temporal Inconsistency

Add code
May 20, 2025
Viaarxiv icon

Natural Reflection Backdoor Attack on Vision Language Model for Autonomous Driving

Add code
May 09, 2025
Viaarxiv icon

Is your multimodal large language model a good science tutor?

Add code
May 09, 2025
Viaarxiv icon

AnimateAnywhere: Rouse the Background in Human Image Animation

Add code
Apr 28, 2025
Viaarxiv icon

Active Learning Methods for Efficient Data Utilization and Model Performance Enhancement

Add code
Apr 21, 2025
Viaarxiv icon

Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection

Add code
Apr 20, 2025
Viaarxiv icon