Picture for Sewoong Oh

Sewoong Oh

Are Robust LLM Fingerprints Adversarially Robust?

Add code
Sep 30, 2025
Viaarxiv icon

Sampling from Your Language Model One Byte at a Time

Add code
Jun 17, 2025
Viaarxiv icon

Spurious Rewards: Rethinking Training Signals in RLVR

Add code
Jun 12, 2025
Figure 1 for Spurious Rewards: Rethinking Training Signals in RLVR
Figure 2 for Spurious Rewards: Rethinking Training Signals in RLVR
Figure 3 for Spurious Rewards: Rethinking Training Signals in RLVR
Figure 4 for Spurious Rewards: Rethinking Training Signals in RLVR
Viaarxiv icon

OpenThoughts: Data Recipes for Reasoning Models

Add code
Jun 05, 2025
Viaarxiv icon

Zeroth-Order Optimization Finds Flat Minima

Add code
Jun 05, 2025
Viaarxiv icon

Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models

Add code
Jun 05, 2025
Viaarxiv icon

Foundation model for mass spectrometry proteomics

Add code
May 19, 2025
Viaarxiv icon

A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage

Add code
Apr 28, 2025
Viaarxiv icon

Open Deep Search: Democratizing Search with Open-source Reasoning Agents

Add code
Mar 26, 2025
Viaarxiv icon

SuperBPE: Space Travel for Language Models

Add code
Mar 17, 2025
Figure 1 for SuperBPE: Space Travel for Language Models
Figure 2 for SuperBPE: Space Travel for Language Models
Figure 3 for SuperBPE: Space Travel for Language Models
Figure 4 for SuperBPE: Space Travel for Language Models
Viaarxiv icon