Picture for Konrad Staniszewski

Konrad Staniszewski

Analysing The Impact of Sequence Composition on Language Model Pre-Training

Add code
Feb 21, 2024
Figure 1 for Analysing The Impact of Sequence Composition on Language Model Pre-Training
Figure 2 for Analysing The Impact of Sequence Composition on Language Model Pre-Training
Figure 3 for Analysing The Impact of Sequence Composition on Language Model Pre-Training
Figure 4 for Analysing The Impact of Sequence Composition on Language Model Pre-Training
Viaarxiv icon

Structured Packing in LLM Training Improves Long Context Utilization

Add code
Jan 02, 2024
Figure 1 for Structured Packing in LLM Training Improves Long Context Utilization
Figure 2 for Structured Packing in LLM Training Improves Long Context Utilization
Figure 3 for Structured Packing in LLM Training Improves Long Context Utilization
Figure 4 for Structured Packing in LLM Training Improves Long Context Utilization
Viaarxiv icon

Focused Transformer: Contrastive Training for Context Scaling

Add code
Jul 06, 2023
Figure 1 for Focused Transformer: Contrastive Training for Context Scaling
Figure 2 for Focused Transformer: Contrastive Training for Context Scaling
Figure 3 for Focused Transformer: Contrastive Training for Context Scaling
Figure 4 for Focused Transformer: Contrastive Training for Context Scaling
Viaarxiv icon