Picture for Le Tian

Le Tian

POINTS-Seeker: Towards Training a Multimodal Agentic Search Model from Scratch

Add code
Apr 15, 2026
Viaarxiv icon

POINTS-Long: Adaptive Dual-Mode Visual Reasoning in MLLMs

Add code
Apr 13, 2026
Viaarxiv icon

VersaViT: Enhancing MLLM Vision Backbones via Task-Guided Optimization

Add code
Feb 10, 2026
Viaarxiv icon

POINTS-GUI-G: GUI-Grounding Journey

Add code
Feb 06, 2026
Viaarxiv icon

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Add code
Feb 03, 2026
Viaarxiv icon

POINTS1.5: Building a Vision-Language Model towards Real World Applications

Add code
Dec 11, 2024
Figure 1 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Figure 2 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Figure 3 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Figure 4 for POINTS1.5: Building a Vision-Language Model towards Real World Applications
Viaarxiv icon

POINTS: Improving Your Vision-language Model with Affordable Strategies

Add code
Sep 07, 2024
Viaarxiv icon

Rethinking Overlooked Aspects in Vision-Language Models

Add code
May 20, 2024
Viaarxiv icon