Alert button
Picture for Hanhan Zhou

Hanhan Zhou

Alert button

Collaborative AI Teaming in Unknown Environments via Active Goal Deduction

Add code
Bookmark button
Alert button
Mar 22, 2024
Zuyuan Zhang, Hanhan Zhou, Mahdi Imani, Taeyoung Lee, Tian Lan

Viaarxiv icon

Real-time Network Intrusion Detection via Decision Transformers

Add code
Bookmark button
Alert button
Dec 17, 2023
Jingdi Chen, Hanhan Zhou, Yongsheng Mei, Gina Adam, Nathaniel D. Bastian, Tian Lan

Figure 1 for Real-time Network Intrusion Detection via Decision Transformers
Viaarxiv icon

Every Parameter Matters: Ensuring the Convergence of Federated Learning with Dynamic Heterogeneous Models Reduction

Add code
Bookmark button
Alert button
Oct 26, 2023
Hanhan Zhou, Tian Lan, Guru Venkataramani, Wenbo Ding

Viaarxiv icon

Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 28, 2023
Hanhan Zhou, Tian Lan, Vaneet Aggarwal

Figure 1 for Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning
Figure 2 for Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning
Figure 3 for Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning
Figure 4 for Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning
Viaarxiv icon

MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

Add code
Bookmark button
Alert button
Feb 28, 2023
Yongsheng Mei, Hanhan Zhou, Tian Lan, Guru Venkataramani, Peng Wei

Figure 1 for MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization
Figure 2 for MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization
Figure 3 for MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization
Figure 4 for MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization
Viaarxiv icon

ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 11, 2023
Yongsheng Mei, Hanhan Zhou, Tian Lan

Figure 1 for ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Figure 2 for ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Figure 3 for ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Figure 4 for ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Viaarxiv icon

PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 22, 2022
Hanhan Zhou, Tian Lan, Vaneet Aggarwal

Figure 1 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Figure 2 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Figure 3 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Figure 4 for PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Viaarxiv icon

On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning

Add code
Bookmark button
Alert button
Feb 09, 2022
Hanhan Zhou, Tian Lan, Guru Venkataramani, Wenbo Ding

Figure 1 for On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning
Figure 2 for On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning
Figure 3 for On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning
Figure 4 for On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning
Viaarxiv icon