Picture for David Cho

David Cho

Inference-Time Code Selection via Symbolic Equivalence Partitioning

Add code
Apr 07, 2026
Viaarxiv icon

SARL: Label-Free Reinforcement Learning by Rewarding Reasoning Topology

Add code
Mar 30, 2026
Viaarxiv icon

More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment

Add code
Apr 03, 2025
Figure 1 for More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment
Figure 2 for More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment
Figure 3 for More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment
Figure 4 for More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment
Viaarxiv icon