Picture for Weiye Xu

Weiye Xu

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Add code
Apr 21, 2025
Viaarxiv icon

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Add code
Apr 15, 2025
Viaarxiv icon

P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task

Add code
Sep 17, 2024
Figure 1 for P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task
Figure 2 for P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task
Figure 3 for P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task
Figure 4 for P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task
Viaarxiv icon