Picture for Botian Shi

Botian Shi

TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

Add code
Apr 22, 2025
Viaarxiv icon

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Add code
Apr 15, 2025
Viaarxiv icon

RAKG:Document-level Retrieval Augmented Knowledge Graph Construction

Add code
Apr 14, 2025
Viaarxiv icon

OmniCaptioner: One Captioner to Rule Them All

Add code
Apr 09, 2025
Viaarxiv icon

Aligning Vision to Language: Text-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning

Add code
Mar 17, 2025
Viaarxiv icon

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Add code
Mar 10, 2025
Viaarxiv icon

LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement

Add code
Feb 13, 2025
Viaarxiv icon

LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking

Add code
Jan 14, 2025
Viaarxiv icon

Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback

Add code
Jan 07, 2025
Figure 1 for Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback
Figure 2 for Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback
Figure 3 for Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback
Figure 4 for Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback
Viaarxiv icon

GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training

Add code
Dec 16, 2024
Figure 1 for GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
Figure 2 for GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
Figure 3 for GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
Figure 4 for GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
Viaarxiv icon