Picture for Yikai Zhang

Yikai Zhang

Xi'an Jiaotong University

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Add code
Apr 13, 2026
Viaarxiv icon

ORACLE-SWE: Quantifying the Contribution of Oracle Information Signals on SWE Agents

Add code
Apr 09, 2026
Viaarxiv icon

From AI Assistant to AI Scientist: Autonomous Discovery of LLM-RL Algorithms with LLM Agents

Add code
Mar 25, 2026
Viaarxiv icon

CODA: Difficulty-Aware Compute Allocation for Adaptive Reasoning

Add code
Mar 09, 2026
Viaarxiv icon

RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

Add code
Mar 05, 2026
Viaarxiv icon

MeCaMIL: Causality-Aware Multiple Instance Learning for Fair and Interpretable Whole Slide Image Diagnosis

Add code
Nov 14, 2025
Viaarxiv icon

Two-Stage Decoupling Framework for Variable-Length Glaucoma Prognosis

Add code
Sep 15, 2025
Viaarxiv icon

Rethinking Layer-wise Gaussian Noise Injection: Bridging Implicit Objectives and Privacy Budget Allocation

Add code
Sep 04, 2025
Figure 1 for Rethinking Layer-wise Gaussian Noise Injection: Bridging Implicit Objectives and Privacy Budget Allocation
Figure 2 for Rethinking Layer-wise Gaussian Noise Injection: Bridging Implicit Objectives and Privacy Budget Allocation
Figure 3 for Rethinking Layer-wise Gaussian Noise Injection: Bridging Implicit Objectives and Privacy Budget Allocation
Figure 4 for Rethinking Layer-wise Gaussian Noise Injection: Bridging Implicit Objectives and Privacy Budget Allocation
Viaarxiv icon

R-KV: Redundancy-aware KV Cache Compression for Training-Free Reasoning Models Acceleration

Add code
May 30, 2025
Viaarxiv icon

Can LLMs Learn to Map the World from Local Descriptions?

Add code
May 27, 2025
Viaarxiv icon