Picture for Xuanjing Huang

Xuanjing Huang

Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

Add code
Oct 20, 2024
Figure 1 for Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs
Figure 2 for Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs
Figure 3 for Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs
Figure 4 for Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs
Viaarxiv icon

Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs

Add code
Oct 15, 2024
Figure 1 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 2 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 3 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Figure 4 for Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
Viaarxiv icon

RMB: Comprehensively Benchmarking Reward Models in LLM Alignment

Add code
Oct 13, 2024
Figure 1 for RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
Figure 2 for RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
Figure 3 for RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
Figure 4 for RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
Viaarxiv icon

AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models

Add code
Oct 10, 2024
Figure 1 for AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models
Figure 2 for AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models
Figure 3 for AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models
Figure 4 for AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models
Viaarxiv icon

Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing

Add code
Sep 25, 2024
Figure 1 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 2 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 3 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 4 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Viaarxiv icon

Empirical Insights on Fine-Tuning Large Language Models for Question-Answering

Add code
Sep 24, 2024
Figure 1 for Empirical Insights on Fine-Tuning Large Language Models for Question-Answering
Figure 2 for Empirical Insights on Fine-Tuning Large Language Models for Question-Answering
Figure 3 for Empirical Insights on Fine-Tuning Large Language Models for Question-Answering
Figure 4 for Empirical Insights on Fine-Tuning Large Language Models for Question-Answering
Viaarxiv icon

DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels

Add code
Sep 04, 2024
Figure 1 for DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels
Figure 2 for DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels
Figure 3 for DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels
Figure 4 for DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels
Viaarxiv icon

Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data

Add code
Aug 27, 2024
Viaarxiv icon

TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities

Add code
Jul 31, 2024
Figure 1 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Figure 2 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Figure 3 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Figure 4 for TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Viaarxiv icon

Identity-Driven Hierarchical Role-Playing Agents

Add code
Jul 28, 2024
Viaarxiv icon