Picture for Peiran Wang

Peiran Wang

From Hallucinations to Jailbreaks: Rethinking the Vulnerability of Large Foundation Models

Add code
May 30, 2025
Viaarxiv icon

Astra: Efficient and Money-saving Automatic Parallel Strategies Search on Heterogeneous GPUs

Add code
Feb 19, 2025
Viaarxiv icon

What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis

Add code
Feb 19, 2025
Viaarxiv icon

Fair-MoE: Fairness-Oriented Mixture of Experts in Vision-Language Models

Add code
Feb 10, 2025
Viaarxiv icon

RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process

Add code
Oct 11, 2024
Figure 1 for RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process
Figure 2 for RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process
Figure 3 for RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process
Figure 4 for RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process
Viaarxiv icon

Training on Fake Labels: Mitigating Label Leakage in Split Learning via Secure Dimension Transformation

Add code
Oct 11, 2024
Viaarxiv icon

DistDD: Distributed Data Distillation Aggregation through Gradient Matching

Add code
Oct 11, 2024
Viaarxiv icon

FedCliP: Federated Learning with Client Pruning

Add code
Jan 17, 2023
Viaarxiv icon