Picture for Rameswar Panda

Rameswar Panda

Richard

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Add code
May 07, 2024
Figure 1 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Figure 2 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Figure 3 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Figure 4 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Viaarxiv icon

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models

Add code
Apr 08, 2024
Figure 1 for Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models
Figure 2 for Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models
Figure 3 for Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models
Figure 4 for Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models
Viaarxiv icon

Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization

Add code
Apr 04, 2024
Viaarxiv icon

Scattered Mixture-of-Experts Implementation

Add code
Mar 13, 2024
Viaarxiv icon

API Pack: A Massive Multilingual Dataset for API Call Generation

Add code
Feb 16, 2024
Viaarxiv icon

Data Engineering for Scaling Language Models to 128K Context

Add code
Feb 15, 2024
Viaarxiv icon

Diversity Measurement and Subset Selection for Instruction Tuning Datasets

Add code
Feb 04, 2024
Viaarxiv icon

Gated Linear Attention Transformers with Hardware-Efficient Training

Add code
Dec 24, 2023
Figure 1 for Gated Linear Attention Transformers with Hardware-Efficient Training
Figure 2 for Gated Linear Attention Transformers with Hardware-Efficient Training
Figure 3 for Gated Linear Attention Transformers with Hardware-Efficient Training
Figure 4 for Gated Linear Attention Transformers with Hardware-Efficient Training
Viaarxiv icon

Learning Human Action Recognition Representations Without Real Humans

Add code
Nov 10, 2023
Viaarxiv icon

LangNav: Language as a Perceptual Representation for Navigation

Add code
Oct 11, 2023
Figure 1 for LangNav: Language as a Perceptual Representation for Navigation
Figure 2 for LangNav: Language as a Perceptual Representation for Navigation
Figure 3 for LangNav: Language as a Perceptual Representation for Navigation
Figure 4 for LangNav: Language as a Perceptual Representation for Navigation
Viaarxiv icon