Picture for Xuanjing Huang

Xuanjing Huang

Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data

Add code
Aug 27, 2024
Viaarxiv icon

TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities

Add code
Jul 31, 2024
Figure 1 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Figure 2 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Figure 3 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Figure 4 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Viaarxiv icon

Identity-Driven Hierarchical Role-Playing Agents

Add code
Jul 28, 2024
Viaarxiv icon

Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

Add code
Jul 24, 2024
Figure 1 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Figure 2 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Figure 3 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Figure 4 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Viaarxiv icon

Case2Code: Learning Inductive Reasoning with Synthetic Data

Add code
Jul 17, 2024
Figure 1 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Figure 2 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Figure 3 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Figure 4 for Case2Code: Learning Inductive Reasoning with Synthetic Data
Viaarxiv icon

Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks

Add code
Jul 13, 2024
Figure 1 for Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
Figure 2 for Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
Figure 3 for Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
Figure 4 for Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
Viaarxiv icon

What's Wrong with Your Code Generated by Large Language Models? An Extensive Study

Add code
Jul 08, 2024
Figure 1 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Figure 2 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Figure 3 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Figure 4 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Viaarxiv icon

HAF-RM: A Hybrid Alignment Framework for Reward Model Training

Add code
Jul 04, 2024
Viaarxiv icon

Searching for Best Practices in Retrieval-Augmented Generation

Add code
Jul 01, 2024
Figure 1 for Searching for Best Practices in Retrieval-Augmented Generation
Figure 2 for Searching for Best Practices in Retrieval-Augmented Generation
Figure 3 for Searching for Best Practices in Retrieval-Augmented Generation
Figure 4 for Searching for Best Practices in Retrieval-Augmented Generation
Viaarxiv icon

Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement

Add code
Jul 01, 2024
Figure 1 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Figure 2 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Figure 3 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Figure 4 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Viaarxiv icon