Picture for Dongbai Li

Dongbai Li

RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability

Add code
Apr 14, 2025
Figure 1 for RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability
Figure 2 for RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability
Figure 3 for RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability
Figure 4 for RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability
Viaarxiv icon

The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination

Add code
Mar 20, 2025
Viaarxiv icon

Sample Weight Averaging for Stable Prediction

Add code
Feb 11, 2025
Viaarxiv icon