Picture for Qiyao Sun

Qiyao Sun

MLLMs Get It Right, Then Get It Wrong: Tracing and Correcting Late-Layer Textual Bias

Add code
Jun 16, 2026
Viaarxiv icon

StemBind: When MLLMs Get Lost Between Rules and Instances in Abstract Visual Reasoning

Add code
May 29, 2026
Viaarxiv icon

Advantage Collapse in Group Relative Policy Optimization: Diagnosis and Mitigation

Add code
May 20, 2026
Viaarxiv icon

AiraXiv: An AI-Driven Open-Access Platform for Human and AI Scientists

Add code
May 20, 2026
Viaarxiv icon

ENC-Bench: A Benchmark for Evaluating Multimodal Large Language Models in Electronic Navigational Chart Understanding

Add code
Mar 24, 2026
Viaarxiv icon

Efficient Hallucination Detection: Adaptive Bayesian Estimation of Semantic Entropy with Guided Semantic Exploration

Add code
Mar 24, 2026
Viaarxiv icon

AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations

Add code
Feb 03, 2026
Viaarxiv icon

MCP-RiskCue: Can LLM Infer Risk Information From MCP Server System Logs?

Add code
Nov 12, 2025
Viaarxiv icon

DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively

Add code
Sep 30, 2025
Figure 1 for DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively
Figure 2 for DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively
Figure 3 for DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively
Figure 4 for DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively
Viaarxiv icon

RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation

Add code
May 06, 2025
Viaarxiv icon