Picture for Yun Ma

Yun Ma

SGR-Bench: Benchmarking Search Agents on State-Gated Retrieval

Add code
May 21, 2026
Viaarxiv icon

Teaching AI Through Benchmark Construction: QuestBench as a Course-Based Practice for Accountable Knowledge Work

Add code
May 21, 2026
Viaarxiv icon

MindLoom: Composing Thought Modes for Frontier-Level Reasoning Data Synthesis

Add code
May 20, 2026
Viaarxiv icon

DeepWeb-Bench: A Deep Research Benchmark Demanding Massive Cross-Source Evidence and Long-Horizon Derivation

Add code
May 20, 2026
Viaarxiv icon

Quant.npu: Enabling Efficient Mobile NPU Inference for on-device LLMs via Fully Static Quantization

Add code
May 19, 2026
Viaarxiv icon

ViDR: Grounding Multimodal Deep Research Reports in Source Visual Evidence

Add code
May 13, 2026
Viaarxiv icon

MEME: Modeling the Evolutionary Modes of Financial Markets

Add code
Feb 12, 2026
Viaarxiv icon

AlphaPROBE: Alpha Mining via Principled Retrieval and On-graph biased evolution

Add code
Feb 12, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

WebSplatter: Enabling Cross-Device Efficient Gaussian Splatting in Web Browsers via WebGPU

Add code
Feb 03, 2026
Viaarxiv icon