Picture for Hany Hassan Awadalla

Hany Hassan Awadalla

Microsoft Redmond

Dissecting In-Context Learning of Translations in GPTs

Add code
Oct 24, 2023
Viaarxiv icon

Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness

Add code
Oct 03, 2023
Viaarxiv icon

A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models

Add code
Sep 20, 2023
Viaarxiv icon

Task-Based MoE for Multitask Multilingual Machine Translation

Add code
Sep 11, 2023
Figure 1 for Task-Based MoE for Multitask Multilingual Machine Translation
Figure 2 for Task-Based MoE for Multitask Multilingual Machine Translation
Figure 3 for Task-Based MoE for Multitask Multilingual Machine Translation
Figure 4 for Task-Based MoE for Multitask Multilingual Machine Translation
Viaarxiv icon

FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs

Add code
Aug 16, 2023
Figure 1 for FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs
Figure 2 for FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs
Figure 3 for FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs
Figure 4 for FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs
Viaarxiv icon

Do GPTs Produce Less Literal Translations?

Add code
Jun 06, 2023
Figure 1 for Do GPTs Produce Less Literal Translations?
Figure 2 for Do GPTs Produce Less Literal Translations?
Figure 3 for Do GPTs Produce Less Literal Translations?
Figure 4 for Do GPTs Produce Less Literal Translations?
Viaarxiv icon

ResiDual: Transformer with Dual Residual Connections

Add code
Apr 28, 2023
Figure 1 for ResiDual: Transformer with Dual Residual Connections
Figure 2 for ResiDual: Transformer with Dual Residual Connections
Figure 3 for ResiDual: Transformer with Dual Residual Connections
Figure 4 for ResiDual: Transformer with Dual Residual Connections
Viaarxiv icon

How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation

Add code
Feb 18, 2023
Figure 1 for How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation
Figure 2 for How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation
Figure 3 for How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation
Figure 4 for How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation
Viaarxiv icon

Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production

Add code
Nov 18, 2022
Figure 1 for Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production
Figure 2 for Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production
Figure 3 for Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production
Figure 4 for Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production
Viaarxiv icon

Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization

Add code
Aug 21, 2022
Figure 1 for Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization
Figure 2 for Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization
Figure 3 for Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization
Figure 4 for Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization
Viaarxiv icon