Picture for Jihao Gu

Jihao Gu

InquireMobile: Teaching VLM-based Mobile Agent to Request Human Assistance via Reinforcement Fine-Tuning

Add code
Aug 27, 2025
Viaarxiv icon

Motion Matters: Motion-guided Modulation Network for Skeleton-based Micro-Action Recognition

Add code
Jul 29, 2025
Viaarxiv icon

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models

Add code
Apr 25, 2025
Viaarxiv icon

GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning

Add code
Apr 17, 2025
Viaarxiv icon

Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models

Add code
Mar 24, 2025
Viaarxiv icon

ChineseSimpleVQA -- "See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models

Add code
Feb 19, 2025
Viaarxiv icon

Performance Analysis of Traditional VQA Models Under Limited Computational Resources

Add code
Feb 09, 2025
Viaarxiv icon

Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation

Add code
Dec 19, 2024
Viaarxiv icon

2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision

Add code
Oct 25, 2024
Figure 1 for 2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision
Figure 2 for 2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision
Figure 3 for 2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision
Figure 4 for 2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision
Viaarxiv icon

SARA: Singular-Value Based Adaptive Low-Rank Adaption

Add code
Aug 06, 2024
Viaarxiv icon