Picture for Weitong Zhang

Weitong Zhang

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Add code
May 19, 2026
Viaarxiv icon

Wasserstein Equilibrium Decoding for Reliable Medical Visual Question Answering

Add code
May 18, 2026
Viaarxiv icon

OGLS-SD: On-Policy Self-Distillation with Outcome-Guided Logit Steering for LLM Reasoning

Add code
May 12, 2026
Viaarxiv icon

GeomHerd: A Forward-looking Herding Quantification via Ricci Flow Geometry on Agent Interactive Simulations

Add code
May 12, 2026
Viaarxiv icon

Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation

Add code
Apr 15, 2026
Viaarxiv icon

Provable and Practical In-Context Policy Optimization for Self-Improvement

Add code
Mar 02, 2026
Viaarxiv icon

Your Self-Play Algorithm is Secretly an Adversarial Imitator: Understanding LLM Self-Play through the Lens of Imitation Learning

Add code
Feb 01, 2026
Viaarxiv icon

Imitation from Observations with Trajectory-Level Generative Embeddings

Add code
Jan 01, 2026
Viaarxiv icon

Near-Optimal Second-Order Guarantees for Model-Based Adversarial Imitation Learning

Add code
Oct 10, 2025
Viaarxiv icon

Graph Conditioned Diffusion for Controllable Histopathology Image Generation

Add code
Oct 08, 2025
Viaarxiv icon