Picture for Rui Min

Rui Min

RemoteAgent: Bridging Vague Human Intents and Earth Observation with RL-based Agentic MLLMs

Add code
Apr 09, 2026
Viaarxiv icon

GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces

Add code
Apr 05, 2026
Viaarxiv icon

Mitigating Safety Tax via Distribution-Grounded Refinement in Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon

Empowering Reliable Visual-Centric Instruction Following in MLLMs

Add code
Jan 06, 2026
Viaarxiv icon

EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce

Add code
Dec 11, 2025
Figure 1 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Figure 2 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Figure 3 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Figure 4 for EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce
Viaarxiv icon

Reasoning Path Divergence: A New Metric and Curation Strategy to Unlock LLM Diverse Thinking

Add code
Oct 30, 2025
Viaarxiv icon

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Add code
Sep 16, 2025
Figure 1 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 2 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 3 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 4 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Viaarxiv icon

Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering

Add code
May 22, 2025
Figure 1 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Figure 2 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Figure 3 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Figure 4 for Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering
Viaarxiv icon

Improving Your Model Ranking on Chatbot Arena by Vote Rigging

Add code
Jan 29, 2025
Figure 1 for Improving Your Model Ranking on Chatbot Arena by Vote Rigging
Figure 2 for Improving Your Model Ranking on Chatbot Arena by Vote Rigging
Figure 3 for Improving Your Model Ranking on Chatbot Arena by Vote Rigging
Figure 4 for Improving Your Model Ranking on Chatbot Arena by Vote Rigging
Viaarxiv icon

WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages

Add code
Jan 24, 2025
Figure 1 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 2 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 3 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 4 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Viaarxiv icon