Picture for Yao Lu

Yao Lu

Reassessing Layer Pruning in LLMs: New Insights and Methods

Add code
Nov 23, 2024
Figure 1 for Reassessing Layer Pruning in LLMs: New Insights and Methods
Figure 2 for Reassessing Layer Pruning in LLMs: New Insights and Methods
Figure 3 for Reassessing Layer Pruning in LLMs: New Insights and Methods
Figure 4 for Reassessing Layer Pruning in LLMs: New Insights and Methods
Viaarxiv icon

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

Add code
Nov 19, 2024
Figure 1 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 2 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 3 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 4 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Viaarxiv icon

RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively

Add code
Nov 15, 2024
Figure 1 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Figure 2 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Figure 3 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Figure 4 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Viaarxiv icon

Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language

Add code
Oct 31, 2024
Figure 1 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 2 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 3 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 4 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Viaarxiv icon

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Add code
Oct 25, 2024
Figure 1 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 2 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 3 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 4 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Viaarxiv icon

ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification

Add code
Oct 15, 2024
Figure 1 for ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification
Figure 2 for ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification
Figure 3 for ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification
Figure 4 for ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification
Viaarxiv icon

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Add code
Oct 15, 2024
Figure 1 for SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Figure 2 for SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Figure 3 for SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Figure 4 for SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Viaarxiv icon

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Add code
Oct 14, 2024
Figure 1 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Figure 2 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Figure 3 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Figure 4 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Viaarxiv icon

Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models

Add code
Oct 14, 2024
Figure 1 for Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
Figure 2 for Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
Figure 3 for Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
Figure 4 for Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
Viaarxiv icon

SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models

Add code
Oct 14, 2024
Figure 1 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Figure 2 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Figure 3 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Figure 4 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Viaarxiv icon