Picture for Xuanjing Huang

Xuanjing Huang

RMB: Comprehensively Benchmarking Reward Models in LLM Alignment

Add code
Oct 13, 2024
Figure 1 for RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
Figure 2 for RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
Figure 3 for RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
Figure 4 for RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
Viaarxiv icon

AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models

Add code
Oct 10, 2024
Figure 1 for AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models
Figure 2 for AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models
Figure 3 for AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models
Figure 4 for AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models
Viaarxiv icon

Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing

Add code
Sep 25, 2024
Figure 1 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 2 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 3 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 4 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Viaarxiv icon

Empirical Insights on Fine-Tuning Large Language Models for Question-Answering

Add code
Sep 24, 2024
Figure 1 for Empirical Insights on Fine-Tuning Large Language Models for Question-Answering
Figure 2 for Empirical Insights on Fine-Tuning Large Language Models for Question-Answering
Figure 3 for Empirical Insights on Fine-Tuning Large Language Models for Question-Answering
Figure 4 for Empirical Insights on Fine-Tuning Large Language Models for Question-Answering
Viaarxiv icon

DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels

Add code
Sep 04, 2024
Figure 1 for DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels
Figure 2 for DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels
Figure 3 for DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels
Figure 4 for DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels
Viaarxiv icon

Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data

Add code
Aug 27, 2024
Viaarxiv icon

TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities

Add code
Jul 31, 2024
Figure 1 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Figure 2 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Figure 3 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Figure 4 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Viaarxiv icon

Identity-Driven Hierarchical Role-Playing Agents

Add code
Jul 28, 2024
Viaarxiv icon

Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

Add code
Jul 24, 2024
Figure 1 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Figure 2 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Figure 3 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Figure 4 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Viaarxiv icon

Case2Code: Learning Inductive Reasoning with Synthetic Data

Add code
Jul 17, 2024
Figure 1 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Figure 2 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Figure 3 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Figure 4 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Viaarxiv icon