Picture for Yao Lu

Yao Lu

NVILA: Efficient Frontier Visual Language Models

Add code
Dec 05, 2024
Figure 1 for NVILA: Efficient Frontier Visual Language Models
Figure 2 for NVILA: Efficient Frontier Visual Language Models
Figure 3 for NVILA: Efficient Frontier Visual Language Models
Figure 4 for NVILA: Efficient Frontier Visual Language Models
Viaarxiv icon

Instruct or Interact? Exploring and Eliciting LLMs' Capability in Code Snippet Adaptation Through Prompt Engineering

Add code
Nov 23, 2024
Figure 1 for Instruct or Interact? Exploring and Eliciting LLMs' Capability in Code Snippet Adaptation Through Prompt Engineering
Figure 2 for Instruct or Interact? Exploring and Eliciting LLMs' Capability in Code Snippet Adaptation Through Prompt Engineering
Figure 3 for Instruct or Interact? Exploring and Eliciting LLMs' Capability in Code Snippet Adaptation Through Prompt Engineering
Figure 4 for Instruct or Interact? Exploring and Eliciting LLMs' Capability in Code Snippet Adaptation Through Prompt Engineering
Viaarxiv icon

Reassessing Layer Pruning in LLMs: New Insights and Methods

Add code
Nov 23, 2024
Figure 1 for Reassessing Layer Pruning in LLMs: New Insights and Methods
Figure 2 for Reassessing Layer Pruning in LLMs: New Insights and Methods
Figure 3 for Reassessing Layer Pruning in LLMs: New Insights and Methods
Figure 4 for Reassessing Layer Pruning in LLMs: New Insights and Methods
Viaarxiv icon

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

Add code
Nov 19, 2024
Figure 1 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 2 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 3 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 4 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Viaarxiv icon

RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively

Add code
Nov 15, 2024
Figure 1 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Figure 2 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Figure 3 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Figure 4 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Viaarxiv icon

Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language

Add code
Oct 31, 2024
Figure 1 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 2 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 3 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Figure 4 for Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Viaarxiv icon

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Add code
Oct 25, 2024
Figure 1 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 2 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 3 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Figure 4 for COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Viaarxiv icon

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Add code
Oct 15, 2024
Figure 1 for SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Figure 2 for SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Figure 3 for SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Figure 4 for SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Viaarxiv icon

ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification

Add code
Oct 15, 2024
Figure 1 for ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification
Figure 2 for ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification
Figure 3 for ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification
Figure 4 for ECGN: A Cluster-Aware Approach to Graph Neural Networks for Imbalanced Classification
Viaarxiv icon

SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models

Add code
Oct 14, 2024
Figure 1 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Figure 2 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Figure 3 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Figure 4 for SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Viaarxiv icon