Picture for Shuai Li

Shuai Li

Refer to the report for detailed contributions

DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning

Add code
Aug 19, 2023
Figure 1 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Disentangled Counterfactual Reasoning for Unbiased Sequential Recommendation

Add code
Aug 05, 2023
Viaarxiv icon

Player-optimal Stable Regret for Bandit Learning in Matching Markets

Add code
Jul 20, 2023
Viaarxiv icon

InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding

Add code
Jun 08, 2023
Figure 1 for InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
Figure 2 for InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
Figure 3 for InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
Figure 4 for InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
Viaarxiv icon

Future-conditioned Unsupervised Pretraining for Decision Transformer

Add code
May 26, 2023
Figure 1 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 2 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 3 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Figure 4 for Future-conditioned Unsupervised Pretraining for Decision Transformer
Viaarxiv icon

Adversarial Attacks on Online Learning to Rank with Click Feedback

Add code
May 26, 2023
Figure 1 for Adversarial Attacks on Online Learning to Rank with Click Feedback
Figure 2 for Adversarial Attacks on Online Learning to Rank with Click Feedback
Figure 3 for Adversarial Attacks on Online Learning to Rank with Click Feedback
Figure 4 for Adversarial Attacks on Online Learning to Rank with Click Feedback
Viaarxiv icon

Online Influence Maximization under Decreasing Cascade Model

Add code
May 19, 2023
Figure 1 for Online Influence Maximization under Decreasing Cascade Model
Figure 2 for Online Influence Maximization under Decreasing Cascade Model
Figure 3 for Online Influence Maximization under Decreasing Cascade Model
Viaarxiv icon

Large-Scale Package Manipulation via Learned Metrics of Pick Success

Add code
May 17, 2023
Figure 1 for Large-Scale Package Manipulation via Learned Metrics of Pick Success
Figure 2 for Large-Scale Package Manipulation via Learned Metrics of Pick Success
Figure 3 for Large-Scale Package Manipulation via Learned Metrics of Pick Success
Figure 4 for Large-Scale Package Manipulation via Learned Metrics of Pick Success
Viaarxiv icon

The Closeness of In-Context Learning and Weight Shifting for Softmax Regression

Add code
Apr 26, 2023
Viaarxiv icon

Using Alternation Direction Method of Multipliers to Enhance robots Calibration Accuracy based on Multi-Planal Constraints

Add code
Apr 23, 2023
Viaarxiv icon