Picture for Siyao Peng

Siyao Peng

BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)

Add code
Oct 14, 2025
Viaarxiv icon

Evaluation Should Not Ignore Variation: On the Impact of Reference Set Choice on Summarization Metrics

Add code
Jun 17, 2025
Viaarxiv icon

LiTEx: A Linguistic Taxonomy of Explanations for Understanding Within-Label Variation in Natural Language Inference

Add code
May 28, 2025
Viaarxiv icon

What Media Frames Reveal About Stance: A Dataset and Study about Memes in Climate Change Discourse

Add code
May 22, 2025
Viaarxiv icon

A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI

Add code
Dec 18, 2024
Figure 1 for A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI
Figure 2 for A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI
Figure 3 for A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI
Figure 4 for A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI
Viaarxiv icon

MultiClimate: Multimodal Stance Detection on Climate Change Videos

Add code
Sep 26, 2024
Viaarxiv icon

"Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?

Add code
Jun 25, 2024
Figure 1 for "Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Figure 2 for "Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Figure 3 for "Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Figure 4 for "Seeing the Big through the Small": Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Viaarxiv icon

CLIMATELI: Evaluating Entity Linking on Climate Change Data

Add code
Jun 24, 2024
Figure 1 for CLIMATELI: Evaluating Entity Linking on Climate Change Data
Figure 2 for CLIMATELI: Evaluating Entity Linking on Climate Change Data
Figure 3 for CLIMATELI: Evaluating Entity Linking on Climate Change Data
Figure 4 for CLIMATELI: Evaluating Entity Linking on Climate Change Data
Viaarxiv icon

SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolution

Add code
Mar 25, 2024
Figure 1 for SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolution
Figure 2 for SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolution
Figure 3 for SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolution
Figure 4 for SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolution
Viaarxiv icon

eRST: A Signaled Graph Theory of Discourse Relations and Organization

Add code
Mar 20, 2024
Viaarxiv icon