Picture for Wei Li

Wei Li

Tsinghua University, Beijing, China

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

TISDiSS: A Training-Time and Inference-Time Scalable Framework for Discriminative Source Separation

Add code
Sep 19, 2025
Viaarxiv icon

Cross-Modal Deep Metric Learning for Time Series Anomaly Detection

Add code
Sep 16, 2025
Viaarxiv icon

Enhancing Physical Consistency in Lightweight World Models

Add code
Sep 15, 2025
Viaarxiv icon

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Add code
Sep 09, 2025
Figure 1 for Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Figure 2 for Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Figure 3 for Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Figure 4 for Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Viaarxiv icon

Multi-modal Uncertainty Robust Tree Cover Segmentation For High-Resolution Remote Sensing Images

Add code
Sep 05, 2025
Viaarxiv icon

CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification

Add code
Aug 28, 2025
Figure 1 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Figure 2 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Figure 3 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Figure 4 for CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Viaarxiv icon

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Figure 1 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 2 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 3 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 4 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Viaarxiv icon

RDDM: Practicing RAW Domain Diffusion Model for Real-world Image Restoration

Add code
Aug 26, 2025
Viaarxiv icon

C-Flat++: Towards a More Efficient and Powerful Framework for Continual Learning

Add code
Aug 26, 2025
Viaarxiv icon