Picture for Jing Huang

Jing Huang

HiFloat4 Format for Language Model Inference

Add code
Feb 13, 2026
Viaarxiv icon

TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution

Add code
Feb 10, 2026
Viaarxiv icon

Variational Speculative Decoding: Rethinking Draft Training from Token Likelihood to Sequence Acceptance

Add code
Feb 05, 2026
Viaarxiv icon

Agentic Reward Modeling: Verifying GUI Agent via Online Proactive Interaction

Add code
Jan 31, 2026
Viaarxiv icon

UCPO: Uncertainty-Aware Policy Optimization

Add code
Jan 30, 2026
Viaarxiv icon

OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents

Add code
Jan 28, 2026
Viaarxiv icon

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models

Add code
Jan 26, 2026
Viaarxiv icon

Diffusion Epistemic Uncertainty with Asymmetric Learning for Diffusion-Generated Image Detection

Add code
Jan 21, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon