Picture for Xuanjing Huang

Xuanjing Huang

Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric

Add code
Feb 25, 2025
Figure 1 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Figure 2 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Figure 3 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Figure 4 for Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
Viaarxiv icon

Thus Spake Long-Context Large Language Model

Add code
Feb 24, 2025
Viaarxiv icon

AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models

Add code
Feb 24, 2025
Figure 1 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Figure 2 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Figure 3 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Figure 4 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Viaarxiv icon

How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation

Add code
Feb 20, 2025
Viaarxiv icon

Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models

Add code
Feb 13, 2025
Viaarxiv icon

Multi-Agent Simulator Drives Language Models for Legal Intensive Interaction

Add code
Feb 08, 2025
Figure 1 for Multi-Agent Simulator Drives Language Models for Legal Intensive Interaction
Figure 2 for Multi-Agent Simulator Drives Language Models for Legal Intensive Interaction
Figure 3 for Multi-Agent Simulator Drives Language Models for Legal Intensive Interaction
Figure 4 for Multi-Agent Simulator Drives Language Models for Legal Intensive Interaction
Viaarxiv icon

Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training

Add code
Feb 06, 2025
Viaarxiv icon

Toward Relative Positional Encoding in Spiking Transformers

Add code
Jan 28, 2025
Figure 1 for Toward Relative Positional Encoding in Spiking Transformers
Figure 2 for Toward Relative Positional Encoding in Spiking Transformers
Figure 3 for Toward Relative Positional Encoding in Spiking Transformers
Figure 4 for Toward Relative Positional Encoding in Spiking Transformers
Viaarxiv icon

Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework

Add code
Jan 26, 2025
Figure 1 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Figure 2 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Figure 3 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Figure 4 for Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
Viaarxiv icon

Dendritic Localized Learning: Toward Biologically Plausible Algorithm

Add code
Jan 17, 2025
Figure 1 for Dendritic Localized Learning: Toward Biologically Plausible Algorithm
Figure 2 for Dendritic Localized Learning: Toward Biologically Plausible Algorithm
Figure 3 for Dendritic Localized Learning: Toward Biologically Plausible Algorithm
Figure 4 for Dendritic Localized Learning: Toward Biologically Plausible Algorithm
Viaarxiv icon