Picture for Fei Fang

Fei Fang

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

Add code
Mar 17, 2026
Viaarxiv icon

VAM: Verbalized Action Masking for Controllable Exploration in RL Post-Training -- A Chess Case Study

Add code
Feb 18, 2026
Viaarxiv icon

Large Language Models in Peer-Run Community Behavioral Health Services: Understanding Peer Specialists and Service Users' Perspectives on Opportunities, Risks, and Mitigation Strategies

Add code
Feb 09, 2026
Viaarxiv icon

Antidistillation Fingerprinting

Add code
Feb 03, 2026
Viaarxiv icon

RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue

Add code
Nov 19, 2025
Figure 1 for RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue
Figure 2 for RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue
Figure 3 for RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue
Figure 4 for RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue
Viaarxiv icon

Strategic Planning and Rationalizing on Trees Make LLMs Better Debaters

Add code
May 20, 2025
Viaarxiv icon

GenTorrent: Scaling Large Language Model Serving with An Overley Network

Add code
Apr 30, 2025
Viaarxiv icon

Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning

Add code
Apr 18, 2025
Viaarxiv icon

REALM: A Dataset of Real-World LLM Use Cases

Add code
Mar 24, 2025
Viaarxiv icon

M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality

Add code
Mar 06, 2025
Figure 1 for M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Figure 2 for M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Figure 3 for M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Figure 4 for M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Viaarxiv icon