Picture for Seungone Kim

Seungone Kim

Let's Predict Sentence by Sentence

Add code
May 28, 2025
Viaarxiv icon

Measuring Sycophancy of Language Models in Multi-turn Dialogues

Add code
May 28, 2025
Viaarxiv icon

FREESON: Retriever-Free Retrieval-Augmented Reasoning via Corpus-Traversing MCTS

Add code
May 22, 2025
Viaarxiv icon

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Add code
May 21, 2025
Viaarxiv icon

Reasoning Models Better Express Their Confidence

Add code
May 20, 2025
Viaarxiv icon

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Add code
May 15, 2025
Viaarxiv icon

M-Prometheus: A Suite of Open Multilingual LLM Judges

Add code
Apr 07, 2025
Viaarxiv icon

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Add code
Mar 25, 2025
Viaarxiv icon

Bridging the Data Provenance Gap Across Text, Speech and Video

Add code
Dec 19, 2024
Figure 1 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 2 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 3 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 4 for Bridging the Data Provenance Gap Across Text, Speech and Video
Viaarxiv icon

LLM-AS-AN-INTERVIEWER: Beyond Static Testing Through Dynamic LLM Evaluation

Add code
Dec 10, 2024
Viaarxiv icon