Alert button
Picture for Shuai Zhang

Shuai Zhang

Alert button

On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $ε$-Greedy Exploration

Add code
Bookmark button
Alert button
Oct 24, 2023
Shuai Zhang, Hongkang Li, Meng Wang, Miao Liu, Pin-Yu Chen, Songtao Lu, Sijia Liu, Keerthiram Murugesan, Subhajit Chaudhury

Viaarxiv icon

Offline Imitation Learning with Variational Counterfactual Reasoning

Add code
Bookmark button
Alert button
Oct 17, 2023
Bowei He, Zexu Sun, Jinxin Liu, Shuai Zhang, Xu Chen, Chen Ma

Figure 1 for Offline Imitation Learning with Variational Counterfactual Reasoning
Figure 2 for Offline Imitation Learning with Variational Counterfactual Reasoning
Figure 3 for Offline Imitation Learning with Variational Counterfactual Reasoning
Figure 4 for Offline Imitation Learning with Variational Counterfactual Reasoning
Viaarxiv icon

Lightweight In-Context Tuning for Multimodal Unified Models

Add code
Bookmark button
Alert button
Oct 08, 2023
Yixin Chen, Shuai Zhang, Boran Han, Jiaya Jia

Figure 1 for Lightweight In-Context Tuning for Multimodal Unified Models
Figure 2 for Lightweight In-Context Tuning for Multimodal Unified Models
Figure 3 for Lightweight In-Context Tuning for Multimodal Unified Models
Figure 4 for Lightweight In-Context Tuning for Multimodal Unified Models
Viaarxiv icon

Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation

Add code
Bookmark button
Alert button
Sep 19, 2023
Huachuan Qiu, Shuai Zhang, Hongliang He, Anqi Li, Zhenzhong Lan

Figure 1 for Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation
Figure 2 for Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation
Figure 3 for Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation
Figure 4 for Facilitating NSFW Text Detection in Open-Domain Dialogue Systems via Knowledge Distillation
Viaarxiv icon

InfeRE: Step-by-Step Regex Generation via Chain of Inference

Add code
Bookmark button
Alert button
Aug 08, 2023
Shuai Zhang, Xiaodong Gu, Yuting Chen, Beijun Shen

Figure 1 for InfeRE: Step-by-Step Regex Generation via Chain of Inference
Figure 2 for InfeRE: Step-by-Step Regex Generation via Chain of Inference
Figure 3 for InfeRE: Step-by-Step Regex Generation via Chain of Inference
Figure 4 for InfeRE: Step-by-Step Regex Generation via Chain of Inference
Viaarxiv icon

A Benchmark for Understanding Dialogue Safety in Mental Health Support

Add code
Bookmark button
Alert button
Jul 31, 2023
Huachuan Qiu, Tong Zhao, Anqi Li, Shuai Zhang, Hongliang He, Zhenzhong Lan

Figure 1 for A Benchmark for Understanding Dialogue Safety in Mental Health Support
Figure 2 for A Benchmark for Understanding Dialogue Safety in Mental Health Support
Figure 3 for A Benchmark for Understanding Dialogue Safety in Mental Health Support
Figure 4 for A Benchmark for Understanding Dialogue Safety in Mental Health Support
Viaarxiv icon

MAS: Towards Resource-Efficient Federated Multiple-Task Learning

Add code
Bookmark button
Alert button
Jul 21, 2023
Weiming Zhuang, Yonggang Wen, Lingjuan Lyu, Shuai Zhang

Figure 1 for MAS: Towards Resource-Efficient Federated Multiple-Task Learning
Figure 2 for MAS: Towards Resource-Efficient Federated Multiple-Task Learning
Figure 3 for MAS: Towards Resource-Efficient Federated Multiple-Task Learning
Figure 4 for MAS: Towards Resource-Efficient Federated Multiple-Task Learning
Viaarxiv icon

Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models

Add code
Bookmark button
Alert button
Jul 17, 2023
Huachuan Qiu, Shuai Zhang, Anqi Li, Hongliang He, Zhenzhong Lan

Figure 1 for Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models
Figure 2 for Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models
Figure 3 for Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models
Figure 4 for Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models
Viaarxiv icon

Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators

Add code
Bookmark button
Alert button
Jul 16, 2023
Sikai Bai, Shuaicheng Li, Weiming Zhuang, Jie Zhang, Song Guo, Kunlin Yang, Jun Hou, Shuai Zhang, Junyu Gao, Shuai Yi

Figure 1 for Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators
Figure 2 for Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators
Figure 3 for Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators
Figure 4 for Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators
Viaarxiv icon