Picture for Wenkai Yang

Wenkai Yang

Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization

Add code
Jun 17, 2024
Viaarxiv icon

Exploring Backdoor Vulnerabilities of Chat Models

Add code
Apr 03, 2024
Figure 1 for Exploring Backdoor Vulnerabilities of Chat Models
Figure 2 for Exploring Backdoor Vulnerabilities of Chat Models
Figure 3 for Exploring Backdoor Vulnerabilities of Chat Models
Figure 4 for Exploring Backdoor Vulnerabilities of Chat Models
Viaarxiv icon

Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents

Add code
Feb 17, 2024
Figure 1 for Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents
Figure 2 for Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents
Figure 3 for Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents
Figure 4 for Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents
Viaarxiv icon

Enabling Large Language Models to Learn from Rules

Add code
Nov 15, 2023
Figure 1 for Enabling Large Language Models to Learn from Rules
Figure 2 for Enabling Large Language Models to Learn from Rules
Figure 3 for Enabling Large Language Models to Learn from Rules
Figure 4 for Enabling Large Language Models to Learn from Rules
Viaarxiv icon

Two Stream Scene Understanding on Graph Embedding

Add code
Nov 12, 2023
Figure 1 for Two Stream Scene Understanding on Graph Embedding
Figure 2 for Two Stream Scene Understanding on Graph Embedding
Figure 3 for Two Stream Scene Understanding on Graph Embedding
Figure 4 for Two Stream Scene Understanding on Graph Embedding
Viaarxiv icon

Towards Codable Text Watermarking for Large Language Models

Add code
Jul 29, 2023
Figure 1 for Towards Codable Text Watermarking for Large Language Models
Figure 2 for Towards Codable Text Watermarking for Large Language Models
Figure 3 for Towards Codable Text Watermarking for Large Language Models
Figure 4 for Towards Codable Text Watermarking for Large Language Models
Viaarxiv icon

Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter

Add code
May 21, 2023
Figure 1 for Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter
Figure 2 for Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter
Figure 3 for Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter
Figure 4 for Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter
Viaarxiv icon

Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features

Add code
Jan 30, 2023
Figure 1 for Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features
Figure 2 for Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features
Figure 3 for Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features
Figure 4 for Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features
Viaarxiv icon

Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning

Add code
Jan 26, 2023
Figure 1 for Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning
Figure 2 for Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning
Figure 3 for Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning
Figure 4 for Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning
Viaarxiv icon

When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning

Add code
Jan 25, 2023
Figure 1 for When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning
Figure 2 for When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning
Figure 3 for When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning
Figure 4 for When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning
Viaarxiv icon