Picture for Xuezhe Ma

Xuezhe Ma

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Add code
Apr 12, 2024
Viaarxiv icon

Evaluating Large Language Models on Controlled Generation Tasks

Add code
Oct 23, 2023
Viaarxiv icon

LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Add code
Oct 05, 2023
Figure 1 for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers
Figure 2 for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers
Figure 3 for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers
Figure 4 for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers
Viaarxiv icon

MIDDAG: Where Does Our News Go? Investigating Information Diffusion via Community-Level Information Pathways

Add code
Oct 04, 2023
Viaarxiv icon

RECAP: Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation

Add code
Jun 12, 2023
Figure 1 for RECAP: Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation
Figure 2 for RECAP: Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation
Figure 3 for RECAP: Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation
Figure 4 for RECAP: Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation
Viaarxiv icon

Challenges in Context-Aware Neural Machine Translation

Add code
May 23, 2023
Figure 1 for Challenges in Context-Aware Neural Machine Translation
Figure 2 for Challenges in Context-Aware Neural Machine Translation
Figure 3 for Challenges in Context-Aware Neural Machine Translation
Figure 4 for Challenges in Context-Aware Neural Machine Translation
Viaarxiv icon

Look-back Decoding for Open-Ended Text Generation

May 22, 2023
Figure 1 for Look-back Decoding for Open-Ended Text Generation
Figure 2 for Look-back Decoding for Open-Ended Text Generation
Figure 3 for Look-back Decoding for Open-Ended Text Generation
Figure 4 for Look-back Decoding for Open-Ended Text Generation
Viaarxiv icon

LIMA: Less Is More for Alignment

Add code
May 18, 2023
Figure 1 for LIMA: Less Is More for Alignment
Figure 2 for LIMA: Less Is More for Alignment
Figure 3 for LIMA: Less Is More for Alignment
Figure 4 for LIMA: Less Is More for Alignment
Viaarxiv icon

Better May Not Be Fairer: Can Data Augmentation Mitigate Subgroup Degradation?

Add code
Dec 16, 2022
Figure 1 for Better May Not Be Fairer: Can Data Augmentation Mitigate Subgroup Degradation?
Figure 2 for Better May Not Be Fairer: Can Data Augmentation Mitigate Subgroup Degradation?
Figure 3 for Better May Not Be Fairer: Can Data Augmentation Mitigate Subgroup Degradation?
Figure 4 for Better May Not Be Fairer: Can Data Augmentation Mitigate Subgroup Degradation?
Viaarxiv icon

On Human Visual Contrast Sensitivity and Machine Vision Robustness: A Comparative Study

Add code
Dec 16, 2022
Figure 1 for On Human Visual Contrast Sensitivity and Machine Vision Robustness: A Comparative Study
Figure 2 for On Human Visual Contrast Sensitivity and Machine Vision Robustness: A Comparative Study
Figure 3 for On Human Visual Contrast Sensitivity and Machine Vision Robustness: A Comparative Study
Figure 4 for On Human Visual Contrast Sensitivity and Machine Vision Robustness: A Comparative Study
Viaarxiv icon