Picture for Haewoon Kwak

Haewoon Kwak

Understanding Moral Reasoning Trajectories in Large Language Models: Toward Probing-Based Explainability

Add code
Mar 16, 2026
Viaarxiv icon

LLMs Can Infer Political Alignment from Online Conversations

Add code
Mar 11, 2026
Viaarxiv icon

Vulnerability of LLMs' Belief Systems? LLMs Belief Resistance Check Through Strategic Persuasive Conversation Interventions

Add code
Jan 20, 2026
Viaarxiv icon

XChoice: Explainable Evaluation of AI-Human Alignment in LLM-based Constrained Choice Decision Making

Add code
Jan 16, 2026
Viaarxiv icon

Neural embedding of beliefs reveals the role of relative dissonance in human decision-making

Add code
Aug 13, 2024
Figure 1 for Neural embedding of beliefs reveals the role of relative dissonance in human decision-making
Figure 2 for Neural embedding of beliefs reveals the role of relative dissonance in human decision-making
Figure 3 for Neural embedding of beliefs reveals the role of relative dissonance in human decision-making
Figure 4 for Neural embedding of beliefs reveals the role of relative dissonance in human decision-making
Viaarxiv icon

Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity

Add code
Apr 02, 2024
Figure 1 for Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity
Figure 2 for Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity
Figure 3 for Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity
Figure 4 for Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity
Viaarxiv icon

ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales?

Add code
Mar 26, 2024
Figure 1 for ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales?
Figure 2 for ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales?
Figure 3 for ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales?
Figure 4 for ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales?
Viaarxiv icon

Benchmarking zero-shot stance detection with FlanT5-XXL: Insights from training data, prompting, and decoding strategies into its near-SoTA performance

Add code
Mar 01, 2024
Figure 1 for Benchmarking zero-shot stance detection with FlanT5-XXL: Insights from training data, prompting, and decoding strategies into its near-SoTA performance
Figure 2 for Benchmarking zero-shot stance detection with FlanT5-XXL: Insights from training data, prompting, and decoding strategies into its near-SoTA performance
Figure 3 for Benchmarking zero-shot stance detection with FlanT5-XXL: Insights from training data, prompting, and decoding strategies into its near-SoTA performance
Figure 4 for Benchmarking zero-shot stance detection with FlanT5-XXL: Insights from training data, prompting, and decoding strategies into its near-SoTA performance
Viaarxiv icon

Token-Ensemble Text Generation: On Attacking the Automatic AI-Generated Text Detection

Add code
Feb 17, 2024
Figure 1 for Token-Ensemble Text Generation: On Attacking the Automatic AI-Generated Text Detection
Figure 2 for Token-Ensemble Text Generation: On Attacking the Automatic AI-Generated Text Detection
Figure 3 for Token-Ensemble Text Generation: On Attacking the Automatic AI-Generated Text Detection
Figure 4 for Token-Ensemble Text Generation: On Attacking the Automatic AI-Generated Text Detection
Viaarxiv icon

Can we trust the evaluation on ChatGPT?

Add code
Mar 22, 2023
Viaarxiv icon