Picture for Xuanjing Huang

Xuanjing Huang

Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks

Add code
Jul 13, 2024
Figure 1 for Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
Figure 2 for Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
Figure 3 for Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
Figure 4 for Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
Viaarxiv icon

What's Wrong with Your Code Generated by Large Language Models? An Extensive Study

Add code
Jul 08, 2024
Figure 1 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Figure 2 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Figure 3 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Figure 4 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Viaarxiv icon

HAF-RM: A Hybrid Alignment Framework for Reward Model Training

Add code
Jul 04, 2024
Viaarxiv icon

Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement

Add code
Jul 01, 2024
Figure 1 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Figure 2 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Figure 3 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Figure 4 for Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement
Viaarxiv icon

Searching for Best Practices in Retrieval-Augmented Generation

Add code
Jul 01, 2024
Figure 1 for Searching for Best Practices in Retrieval-Augmented Generation
Figure 2 for Searching for Best Practices in Retrieval-Augmented Generation
Figure 3 for Searching for Best Practices in Retrieval-Augmented Generation
Figure 4 for Searching for Best Practices in Retrieval-Augmented Generation
Viaarxiv icon

LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement

Add code
Jun 29, 2024
Figure 1 for LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement
Figure 2 for LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement
Figure 3 for LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement
Figure 4 for LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement
Viaarxiv icon

SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance

Add code
Jun 26, 2024
Figure 1 for SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance
Figure 2 for SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance
Figure 3 for SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance
Figure 4 for SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance
Viaarxiv icon

Scaling Laws for Fact Memorization of Large Language Models

Add code
Jun 22, 2024
Viaarxiv icon

Cross-Modality Safety Alignment

Add code
Jun 21, 2024
Viaarxiv icon

Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation

Add code
Jun 20, 2024
Figure 1 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Figure 2 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Figure 3 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Figure 4 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Viaarxiv icon