Picture for Hosung Song

Hosung Song

DEER: A Comprehensive and Reliable Benchmark for Deep-Research Expert Reports

Add code
Dec 19, 2025
Viaarxiv icon

KL Penalty Control via Perturbation for Direct Preference Optimization

Add code
Feb 18, 2025
Figure 1 for KL Penalty Control via Perturbation for Direct Preference Optimization
Figure 2 for KL Penalty Control via Perturbation for Direct Preference Optimization
Figure 3 for KL Penalty Control via Perturbation for Direct Preference Optimization
Figure 4 for KL Penalty Control via Perturbation for Direct Preference Optimization
Viaarxiv icon

External Knowledge Selection with Weighted Negative Sampling in Knowledge-grounded Task-oriented Dialogue Systems

Add code
Sep 06, 2022
Figure 1 for External Knowledge Selection with Weighted Negative Sampling in Knowledge-grounded Task-oriented Dialogue Systems
Figure 2 for External Knowledge Selection with Weighted Negative Sampling in Knowledge-grounded Task-oriented Dialogue Systems
Figure 3 for External Knowledge Selection with Weighted Negative Sampling in Knowledge-grounded Task-oriented Dialogue Systems
Figure 4 for External Knowledge Selection with Weighted Negative Sampling in Knowledge-grounded Task-oriented Dialogue Systems
Viaarxiv icon