Picture for Li Du

Li Du

School of Electronic Science and Engineering, Nanjing University

PAT: Pruning-Aware Tuning for Large Language Models

Add code
Aug 27, 2024
Viaarxiv icon

Causal-Guided Active Learning for Debiasing Large Language Models

Add code
Aug 23, 2024
Viaarxiv icon

Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning

Add code
Aug 21, 2024
Figure 1 for Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Figure 2 for Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Figure 3 for Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Figure 4 for Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Viaarxiv icon

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

Add code
Aug 13, 2024
Viaarxiv icon

SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

Add code
Jul 05, 2024
Figure 1 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 2 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 3 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 4 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Viaarxiv icon

SFC: Achieve Accurate Fast Convolution under Low-precision Arithmetic

Add code
Jul 03, 2024
Viaarxiv icon

Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation

Add code
May 26, 2024
Figure 1 for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
Figure 2 for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
Figure 3 for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
Figure 4 for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
Viaarxiv icon

Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges

Add code
May 17, 2024
Figure 1 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 2 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 3 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 4 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Viaarxiv icon

Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning

Add code
Apr 13, 2024
Viaarxiv icon

Towards Generalizable and Faithful Logic Reasoning over Natural Language via Resolution Refutation

Add code
Apr 03, 2024
Viaarxiv icon