Picture for Damai Dai

Damai Dai

PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization

Add code
Feb 25, 2024
Figure 1 for PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Figure 2 for PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Figure 3 for PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Figure 4 for PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Viaarxiv icon

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Add code
Jan 11, 2024
Figure 1 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Figure 2 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Figure 3 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Figure 4 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Viaarxiv icon

Language Models Understand Numbers, at Least Partially

Add code
Jan 08, 2024
Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Jan 05, 2024
Figure 1 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 2 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 3 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 4 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Viaarxiv icon

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

Add code
Dec 28, 2023
Figure 1 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Figure 2 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Figure 3 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Figure 4 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Viaarxiv icon

Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning

Add code
Oct 12, 2023
Viaarxiv icon

Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion

Add code
May 25, 2023
Figure 1 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Figure 2 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Figure 3 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Figure 4 for Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
Viaarxiv icon

Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization

Add code
May 24, 2023
Figure 1 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Figure 2 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Figure 3 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Figure 4 for Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization
Viaarxiv icon

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Add code
May 23, 2023
Figure 1 for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Figure 2 for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Figure 3 for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Figure 4 for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Viaarxiv icon

A Survey for In-context Learning

Add code
Dec 31, 2022
Figure 1 for A Survey for In-context Learning
Figure 2 for A Survey for In-context Learning
Figure 3 for A Survey for In-context Learning
Figure 4 for A Survey for In-context Learning
Viaarxiv icon