Picture for Jing Huang

Jing Huang

Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

Add code
Mar 19, 2026
Viaarxiv icon

The Unlearning Mirage: A Dynamic Framework for Evaluating LLM Unlearning

Add code
Mar 11, 2026
Viaarxiv icon

CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions

Add code
Feb 23, 2026
Viaarxiv icon

HiFloat4 Format for Language Model Inference

Add code
Feb 13, 2026
Viaarxiv icon

TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution

Add code
Feb 10, 2026
Viaarxiv icon

Variational Speculative Decoding: Rethinking Draft Training from Token Likelihood to Sequence Acceptance

Add code
Feb 05, 2026
Viaarxiv icon

Agentic Reward Modeling: Verifying GUI Agent via Online Proactive Interaction

Add code
Jan 31, 2026
Viaarxiv icon

UCPO: Uncertainty-Aware Policy Optimization

Add code
Jan 30, 2026
Viaarxiv icon

OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents

Add code
Jan 28, 2026
Viaarxiv icon