Picture for Hengli Li

Hengli Li

Discrete Markov Bridge

Add code
May 26, 2025
Viaarxiv icon

Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision

Add code
May 21, 2025
Viaarxiv icon

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Add code
May 19, 2025
Viaarxiv icon

Causal Graph Guided Steering of LLM Values via Prompts and Sparse Autoencoders

Add code
Dec 31, 2024
Viaarxiv icon

How to Synthesize Text Data without Model Collapse?

Add code
Dec 19, 2024
Figure 1 for How to Synthesize Text Data without Model Collapse?
Figure 2 for How to Synthesize Text Data without Model Collapse?
Figure 3 for How to Synthesize Text Data without Model Collapse?
Figure 4 for How to Synthesize Text Data without Model Collapse?
Viaarxiv icon

DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning

Add code
Jun 19, 2023
Figure 1 for DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
Figure 2 for DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
Figure 3 for DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
Figure 4 for DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
Viaarxiv icon