Picture for Mingxuan Yuan

Mingxuan Yuan

Unleashing Low-Bit Inference on Ascend NPUs: A Comprehensive Evaluation of HiFloat Formats

Add code
Feb 13, 2026
Viaarxiv icon

C-MOP: Integrating Momentum and Boundary-Aware Clustering for Enhanced Prompt Evolution

Add code
Feb 11, 2026
Viaarxiv icon

DLLM Agent: See Farther, Run Faster

Add code
Feb 07, 2026
Viaarxiv icon

ReThinker: Scientific Reasoning by Rethinking with Guided Reflection and Confidence Control

Add code
Feb 04, 2026
Viaarxiv icon

Why Attention Patterns Exist: A Unifying Temporal Perspective Analysis

Add code
Jan 29, 2026
Viaarxiv icon

Beyond Speedup -- Utilizing KV Cache for Sampling and Reasoning

Add code
Jan 28, 2026
Viaarxiv icon

RIFT: Repurposing Negative Samples via Reward-Informed Fine-Tuning

Add code
Jan 14, 2026
Viaarxiv icon

SwiftMem: Fast Agentic Memory via Query-aware Indexing

Add code
Jan 13, 2026
Viaarxiv icon

Revisiting Judge Decoding from First Principles via Training-Free Distributional Divergence

Add code
Jan 08, 2026
Viaarxiv icon

What Matters For Safety Alignment?

Add code
Jan 07, 2026
Viaarxiv icon