Picture for Run Shao

Run Shao

The Wittgensteinian Representation Hypothesis: Is Language the Attractor of Multimodal Convergence?

Add code
May 10, 2026
Viaarxiv icon

Don't Act Blindly: Robust GUI Automation via Action-Effect Verification and Self-Correction

Add code
Apr 07, 2026
Viaarxiv icon

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

Add code
Apr 07, 2026
Viaarxiv icon

Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering

Add code
Aug 21, 2025
Figure 1 for Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering
Figure 2 for Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering
Figure 3 for Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering
Figure 4 for Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering
Viaarxiv icon

Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding

Add code
Mar 27, 2024
Figure 1 for Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding
Figure 2 for Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding
Figure 3 for Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding
Figure 4 for Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding
Viaarxiv icon

AllSpark: a multimodal spatiotemporal general model

Add code
Dec 31, 2023
Figure 1 for AllSpark: a multimodal spatiotemporal general model
Figure 2 for AllSpark: a multimodal spatiotemporal general model
Figure 3 for AllSpark: a multimodal spatiotemporal general model
Figure 4 for AllSpark: a multimodal spatiotemporal general model
Viaarxiv icon