Picture for Hongjin Lu

Hongjin Lu

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Add code
Apr 10, 2025
Viaarxiv icon

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

Add code
Jan 25, 2024
Figure 1 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Figure 2 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Figure 3 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Figure 4 for Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
Viaarxiv icon