Picture for Zijian Wang

Zijian Wang

Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes

Add code
Jan 26, 2026
Viaarxiv icon

PaperGuide: Making Small Language-Model Paper-Reading Agents More Efficient

Add code
Jan 19, 2026
Viaarxiv icon

ThinkDrive: Chain-of-Thought Guided Progressive Reinforcement Learning Fine-Tuning for Autonomous Driving

Add code
Jan 08, 2026
Viaarxiv icon

Affordance Field Intervention: Enabling VLAs to Escape Memory Traps in Robotic Manipulation

Add code
Dec 08, 2025
Viaarxiv icon

Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization

Add code
Nov 18, 2025
Figure 1 for Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization
Figure 2 for Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization
Figure 3 for Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization
Figure 4 for Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization
Viaarxiv icon

Controllable Flow Matching for Online Reinforcement Learning

Add code
Nov 10, 2025
Viaarxiv icon

Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs

Add code
Oct 31, 2025
Figure 1 for Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs
Figure 2 for Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs
Figure 3 for Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs
Figure 4 for Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs
Viaarxiv icon

ThoughtProbe: Classifier-Guided LLM Thought Space Exploration via Probing Representations

Add code
Oct 31, 2025
Viaarxiv icon

Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks

Add code
Oct 01, 2025
Viaarxiv icon

FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data

Add code
Sep 08, 2025
Figure 1 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Figure 2 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Figure 3 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Figure 4 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Viaarxiv icon