Picture for Xuanjing Huang

Xuanjing Huang

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation

Add code
Nov 21, 2024
Figure 1 for PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation
Figure 2 for PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation
Figure 3 for PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation
Figure 4 for PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation
Viaarxiv icon

LongSafetyBench: Long-Context LLMs Struggle with Safety Issues

Add code
Nov 11, 2024
Figure 1 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 2 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 3 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 4 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Viaarxiv icon

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling

Add code
Nov 01, 2024
Viaarxiv icon

Multi-Programming Language Sandbox for LLMs

Add code
Oct 30, 2024
Figure 1 for Multi-Programming Language Sandbox for LLMs
Figure 2 for Multi-Programming Language Sandbox for LLMs
Figure 3 for Multi-Programming Language Sandbox for LLMs
Figure 4 for Multi-Programming Language Sandbox for LLMs
Viaarxiv icon

ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents

Add code
Oct 28, 2024
Figure 1 for ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents
Figure 2 for ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents
Figure 3 for ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents
Figure 4 for ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents
Viaarxiv icon

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders

Add code
Oct 27, 2024
Figure 1 for Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
Figure 2 for Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
Figure 3 for Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
Figure 4 for Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
Viaarxiv icon

AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios

Add code
Oct 25, 2024
Figure 1 for AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
Figure 2 for AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
Figure 3 for AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
Figure 4 for AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
Viaarxiv icon

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Add code
Oct 24, 2024
Figure 1 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 2 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 3 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Figure 4 for Distill Visual Chart Reasoning Ability from LLMs to MLLMs
Viaarxiv icon

Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

Add code
Oct 20, 2024
Figure 1 for Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs
Figure 2 for Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs
Figure 3 for Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs
Figure 4 for Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs
Viaarxiv icon