Picture for Zhaoyang Wang

Zhaoyang Wang

Provable and Practical In-Context Policy Optimization for Self-Improvement

Add code
Mar 02, 2026
Viaarxiv icon

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Add code
Feb 25, 2026
Viaarxiv icon

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Add code
Feb 11, 2026
Viaarxiv icon

Reliable and Responsible Foundation Models: A Comprehensive Survey

Add code
Feb 04, 2026
Viaarxiv icon

Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction

Add code
Nov 14, 2025
Figure 1 for Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction
Figure 2 for Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction
Figure 3 for Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction
Figure 4 for Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction
Viaarxiv icon

Adapting Web Agents with Synthetic Supervision

Add code
Nov 08, 2025
Viaarxiv icon

GABFusion: Rethinking Feature Fusion for Low-Bit Quantization of Multi-Task Networks

Add code
Nov 08, 2025
Figure 1 for GABFusion: Rethinking Feature Fusion for Low-Bit Quantization of Multi-Task Networks
Figure 2 for GABFusion: Rethinking Feature Fusion for Low-Bit Quantization of Multi-Task Networks
Figure 3 for GABFusion: Rethinking Feature Fusion for Low-Bit Quantization of Multi-Task Networks
Figure 4 for GABFusion: Rethinking Feature Fusion for Low-Bit Quantization of Multi-Task Networks
Viaarxiv icon

FCA2: Frame Compression-Aware Autoencoder for Modular and Fast Compressed Video Super-Resolution

Add code
Jun 13, 2025
Viaarxiv icon

EyeSim-VQA: A Free-Energy-Guided Eye Simulation Framework for Video Quality Assessment

Add code
Jun 13, 2025
Viaarxiv icon

Efficient Long CoT Reasoning in Small Language Models

Add code
May 24, 2025
Viaarxiv icon