RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming

Add code
Jun 04, 2025
Figure 1 for RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming
Figure 2 for RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming
Figure 3 for RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming
Figure 4 for RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: