Picture for Lang Feng

Lang Feng

Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks

Add code
Feb 26, 2026
Viaarxiv icon

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Add code
Feb 11, 2026
Viaarxiv icon

AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection

Add code
Feb 09, 2026
Viaarxiv icon

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Add code
Feb 09, 2026
Viaarxiv icon

AgentOCR: Reimagining Agent History via Optical Self-Compression

Add code
Jan 08, 2026
Viaarxiv icon

CaveAgent: Transforming LLMs into Stateful Runtime Operators

Add code
Jan 04, 2026
Viaarxiv icon

TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning

Add code
Jun 16, 2025
Viaarxiv icon

Group-in-Group Policy Optimization for LLM Agent Training

Add code
May 16, 2025
Viaarxiv icon

Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors

Add code
May 16, 2025
Viaarxiv icon

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Add code
May 01, 2025
Viaarxiv icon