Picture for Han Xiao

Han Xiao

PiCA: Pivot-Based Credit Assignment for Search Agentic Reinforcement Learning

Add code
May 10, 2026
Viaarxiv icon

SOLAR-RL: Semi-Online Long-horizon Assignment Reinforcement Learning

Add code
Apr 24, 2026
Viaarxiv icon

Skill-SD: Skill-Conditioned Self-Distillation for Multi-turn LLM Agents

Add code
Apr 12, 2026
Viaarxiv icon

PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents

Add code
Mar 09, 2026
Viaarxiv icon

mlx-vis: GPU-Accelerated Dimensionality Reduction and Visualization on Apple Silicon

Add code
Mar 04, 2026
Viaarxiv icon

DHP: Efficient Scaling of MLLM Training with Dynamic Hybrid Parallelism

Add code
Feb 25, 2026
Viaarxiv icon

jina-embeddings-v5-text: Task-Targeted Embedding Distillation

Add code
Feb 17, 2026
Viaarxiv icon

Embedding Inversion via Conditional Masked Diffusion Language Models

Add code
Feb 12, 2026
Viaarxiv icon

UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents

Add code
Feb 05, 2026
Viaarxiv icon

AI-Native 6G Physical Layer with Cross-Module Optimization and Cooperative Control Agents

Add code
Jan 07, 2026
Viaarxiv icon