Picture for Dian Yu

Dian Yu

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Add code
Jun 30, 2024
Viaarxiv icon

LiteSearch: Efficacious Tree Search for LLM

Add code
Jun 29, 2024
Figure 1 for LiteSearch: Efficacious Tree Search for LLM
Figure 2 for LiteSearch: Efficacious Tree Search for LLM
Figure 3 for LiteSearch: Efficacious Tree Search for LLM
Figure 4 for LiteSearch: Efficacious Tree Search for LLM
Viaarxiv icon

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Add code
Jun 28, 2024
Figure 1 for Scaling Synthetic Data Creation with 1,000,000,000 Personas
Figure 2 for Scaling Synthetic Data Creation with 1,000,000,000 Personas
Figure 3 for Scaling Synthetic Data Creation with 1,000,000,000 Personas
Figure 4 for Scaling Synthetic Data Creation with 1,000,000,000 Personas
Viaarxiv icon

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning

Add code
Jun 17, 2024
Figure 1 for Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning
Figure 2 for Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning
Figure 3 for Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning
Figure 4 for Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning
Viaarxiv icon

MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions

Add code
May 29, 2024
Figure 1 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 2 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 3 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 4 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Viaarxiv icon

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Add code
Apr 18, 2024
Figure 1 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Figure 2 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Figure 3 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Figure 4 for Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Viaarxiv icon

Conceptual and Unbiased Reasoning in Language Models

Add code
Mar 30, 2024
Figure 1 for Conceptual and Unbiased Reasoning in Language Models
Figure 2 for Conceptual and Unbiased Reasoning in Language Models
Figure 3 for Conceptual and Unbiased Reasoning in Language Models
Figure 4 for Conceptual and Unbiased Reasoning in Language Models
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Retrieval Augmented End-to-End Spoken Dialog Models

Add code
Feb 02, 2024
Figure 1 for Retrieval Augmented End-to-End Spoken Dialog Models
Figure 2 for Retrieval Augmented End-to-End Spoken Dialog Models
Figure 3 for Retrieval Augmented End-to-End Spoken Dialog Models
Figure 4 for Retrieval Augmented End-to-End Spoken Dialog Models
Viaarxiv icon

Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models

Add code
Jan 17, 2024
Viaarxiv icon