Picture for Hanhan Zhou

Hanhan Zhou

WebGraphEval: Multi-Turn Trajectory Evaluation for Web Agents using Graph Representation

Add code
Oct 22, 2025
Viaarxiv icon

When Facts Change: Probing LLMs on Evolving Knowledge with evolveQA

Add code
Oct 22, 2025
Viaarxiv icon

RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space

Add code
Oct 21, 2024
Figure 1 for RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space
Figure 2 for RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space
Figure 3 for RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space
Figure 4 for RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space
Viaarxiv icon

Collaborative AI Teaming in Unknown Environments via Active Goal Deduction

Add code
Mar 22, 2024
Viaarxiv icon

Real-time Network Intrusion Detection via Decision Transformers

Add code
Dec 17, 2023
Viaarxiv icon

Every Parameter Matters: Ensuring the Convergence of Federated Learning with Dynamic Heterogeneous Models Reduction

Add code
Oct 26, 2023
Viaarxiv icon

Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning

Add code
Aug 28, 2023
Viaarxiv icon

MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

Add code
Feb 28, 2023
Viaarxiv icon

ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning

Add code
Feb 11, 2023
Figure 1 for ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Figure 2 for ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Figure 3 for ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Figure 4 for ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Viaarxiv icon

PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning

Add code
Jun 22, 2022
Figure 1 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Figure 2 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Figure 3 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Figure 4 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Viaarxiv icon