Picture for Xiang Kong

Xiang Kong

Large Language Model-guided Document Selection

Add code
Jun 07, 2024
Viaarxiv icon

Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training

Add code
May 23, 2024
Figure 1 for Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training
Figure 2 for Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training
Figure 3 for Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training
Figure 4 for Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training
Viaarxiv icon

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Add code
Mar 22, 2024
Figure 1 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 2 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 3 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 4 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Viaarxiv icon

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

Add code
Feb 19, 2024
Figure 1 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Figure 2 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Figure 3 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Figure 4 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Viaarxiv icon

Mega: Moving Average Equipped Gated Attention

Add code
Sep 26, 2022
Figure 1 for Mega: Moving Average Equipped Gated Attention
Figure 2 for Mega: Moving Average Equipped Gated Attention
Figure 3 for Mega: Moving Average Equipped Gated Attention
Figure 4 for Mega: Moving Average Equipped Gated Attention
Viaarxiv icon

Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders

Add code
Jun 05, 2022
Figure 1 for Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders
Figure 2 for Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders
Figure 3 for Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders
Figure 4 for Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders
Viaarxiv icon

BLT: Bidirectional Layout Transformer for Controllable Layout Generation

Add code
Dec 09, 2021
Figure 1 for BLT: Bidirectional Layout Transformer for Controllable Layout Generation
Figure 2 for BLT: Bidirectional Layout Transformer for Controllable Layout Generation
Figure 3 for BLT: Bidirectional Layout Transformer for Controllable Layout Generation
Figure 4 for BLT: Bidirectional Layout Transformer for Controllable Layout Generation
Viaarxiv icon

Luna: Linear Unified Nested Attention

Add code
Jun 03, 2021
Figure 1 for Luna: Linear Unified Nested Attention
Figure 2 for Luna: Linear Unified Nested Attention
Figure 3 for Luna: Linear Unified Nested Attention
Figure 4 for Luna: Linear Unified Nested Attention
Viaarxiv icon

Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade

Add code
Dec 31, 2020
Figure 1 for Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade
Figure 2 for Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade
Figure 3 for Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade
Figure 4 for Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade
Viaarxiv icon

Incorporating a Local Translation Mechanism into Non-autoregressive Translation

Add code
Nov 12, 2020
Figure 1 for Incorporating a Local Translation Mechanism into Non-autoregressive Translation
Figure 2 for Incorporating a Local Translation Mechanism into Non-autoregressive Translation
Figure 3 for Incorporating a Local Translation Mechanism into Non-autoregressive Translation
Figure 4 for Incorporating a Local Translation Mechanism into Non-autoregressive Translation
Viaarxiv icon