Picture for Lang Feng

Lang Feng

Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks

Add code
Feb 26, 2026
Viaarxiv icon

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Add code
Feb 11, 2026
Viaarxiv icon

AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection

Add code
Feb 09, 2026
Viaarxiv icon

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Add code
Feb 09, 2026
Viaarxiv icon

AgentOCR: Reimagining Agent History via Optical Self-Compression

Add code
Jan 08, 2026
Viaarxiv icon

CaveAgent: Transforming LLMs into Stateful Runtime Operators

Add code
Jan 04, 2026
Viaarxiv icon

TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning

Add code
Jun 16, 2025
Viaarxiv icon

Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors

Add code
May 16, 2025
Viaarxiv icon

Group-in-Group Policy Optimization for LLM Agent Training

Add code
May 16, 2025
Viaarxiv icon

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Add code
May 01, 2025
Viaarxiv icon