Picture for Zhi Wang

Zhi Wang

Moore Threads

Feature-Based Instance Neighbor Discovery: Advanced Stable Test-Time Adaptation in Dynamic World

Add code
Jun 07, 2025
Viaarxiv icon

Mixture-of-Experts Meets In-Context Reinforcement Learning

Add code
Jun 05, 2025
Viaarxiv icon

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Viaarxiv icon

Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

$γ$-FedHT: Stepsize-Aware Hard-Threshold Gradient Compression in Federated Learning

Add code
May 18, 2025
Viaarxiv icon

High Quality Underwater Image Compression with Adaptive Correction and Codebook-based Augmentation

Add code
May 15, 2025
Viaarxiv icon

Towards Facial Image Compression with Consistency Preserving Diffusion Prior

Add code
May 09, 2025
Viaarxiv icon

Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision

Add code
Apr 22, 2025
Viaarxiv icon

Learning to Reason under Off-Policy Guidance

Add code
Apr 22, 2025
Viaarxiv icon

SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model

Add code
Apr 13, 2025
Viaarxiv icon