Picture for Kexin Huang

Kexin Huang

One-Way Policy Optimization for Self-Evolving LLMs

Add code
May 21, 2026
Viaarxiv icon

Clipping Bottleneck: Stabilizing RLVR via Stochastic Recovery of Near-Boundary Signals

Add code
May 21, 2026
Viaarxiv icon

A Versatile AI Agent for Rare Disease Diagnosis and Risk Gene Prioritization

Add code
May 07, 2026
Viaarxiv icon

BenchGuard: Who Guards the Benchmarks? Automated Auditing of LLM Agent Benchmarks

Add code
Apr 27, 2026
Viaarxiv icon

MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions

Add code
Mar 30, 2026
Viaarxiv icon

Beyond Where to Look: Trajectory-Guided Reinforcement Learning for Multimodal RLVR

Add code
Mar 27, 2026
Viaarxiv icon

Incorporating contextual information into KGWAS for interpretable GWAS discovery

Add code
Mar 26, 2026
Viaarxiv icon

Bridging Perception and Reasoning: Token Reweighting for RLVR in Multimodal LLMs

Add code
Mar 26, 2026
Viaarxiv icon

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Add code
Mar 23, 2026
Viaarxiv icon

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Add code
Mar 23, 2026
Viaarxiv icon