Picture for Zizhan Zheng

Zizhan Zheng

MemBoost: A Memory-Boosted Framework for Cost-Aware LLM Inference

Add code
Mar 27, 2026
Viaarxiv icon

From Classical to Quantum Reinforcement Learning and Its Applications in Quantum Control: A Beginner's Tutorial

Add code
Jan 13, 2026
Viaarxiv icon

Online Learning with Probing for Sequential User-Centric Selection

Add code
Jul 27, 2025
Figure 1 for Online Learning with Probing for Sequential User-Centric Selection
Figure 2 for Online Learning with Probing for Sequential User-Centric Selection
Figure 3 for Online Learning with Probing for Sequential User-Centric Selection
Viaarxiv icon

Fair Algorithms with Probing for Multi-Agent Multi-Armed Bandits

Add code
Jun 17, 2025
Figure 1 for Fair Algorithms with Probing for Multi-Agent Multi-Armed Bandits
Viaarxiv icon

Meta Stackelberg Game: Robust Federated Learning against Adaptive and Mixed Poisoning Attacks

Add code
Oct 22, 2024
Figure 1 for Meta Stackelberg Game: Robust Federated Learning against Adaptive and Mixed Poisoning Attacks
Figure 2 for Meta Stackelberg Game: Robust Federated Learning against Adaptive and Mixed Poisoning Attacks
Figure 3 for Meta Stackelberg Game: Robust Federated Learning against Adaptive and Mixed Poisoning Attacks
Figure 4 for Meta Stackelberg Game: Robust Federated Learning against Adaptive and Mixed Poisoning Attacks
Viaarxiv icon

Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations

Add code
Mar 06, 2024
Figure 1 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Figure 2 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Figure 3 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Figure 4 for Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations
Viaarxiv icon

Enhancing LLM Safety via Constrained Direct Preference Optimization

Add code
Mar 04, 2024
Figure 1 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 2 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 3 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 4 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Viaarxiv icon

A First Order Meta Stackelberg Method for Robust Federated Learning

Add code
Jul 16, 2023
Figure 1 for A First Order Meta Stackelberg Method for Robust Federated Learning
Figure 2 for A First Order Meta Stackelberg Method for Robust Federated Learning
Viaarxiv icon

Learning to Backdoor Federated Learning

Add code
Mar 06, 2023
Figure 1 for Learning to Backdoor Federated Learning
Figure 2 for Learning to Backdoor Federated Learning
Figure 3 for Learning to Backdoor Federated Learning
Figure 4 for Learning to Backdoor Federated Learning
Viaarxiv icon

Online Learning for Adaptive Probing and Scheduling in Dense WLANs

Add code
Dec 27, 2022
Viaarxiv icon