Picture for Jiwoo Hong

Jiwoo Hong

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Add code
May 17, 2025
Viaarxiv icon

On the Robustness of Reward Models for Language Model Alignment

Add code
May 12, 2025
Viaarxiv icon

Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning

Add code
Apr 04, 2025
Viaarxiv icon

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Add code
Feb 24, 2025
Viaarxiv icon

AlphaPO -- Reward shape matters for LLM alignment

Add code
Jan 07, 2025
Viaarxiv icon

Evaluating the Consistency of LLM Evaluators

Add code
Nov 30, 2024
Viaarxiv icon

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Add code
Oct 23, 2024
Figure 1 for Cross-lingual Transfer of Reward Models in Multilingual Alignment
Figure 2 for Cross-lingual Transfer of Reward Models in Multilingual Alignment
Figure 3 for Cross-lingual Transfer of Reward Models in Multilingual Alignment
Figure 4 for Cross-lingual Transfer of Reward Models in Multilingual Alignment
Viaarxiv icon

Stable Language Model Pre-training by Reducing Embedding Variability

Add code
Sep 12, 2024
Viaarxiv icon

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Add code
Jun 10, 2024
Viaarxiv icon

ORPO: Monolithic Preference Optimization without Reference Model

Add code
Mar 14, 2024
Viaarxiv icon