Picture for Walid Ahmed

Walid Ahmed

Distributed Hybrid Parallelism for Large Language Models: Comparative Study and System Design Guide

Add code
Feb 09, 2026
Viaarxiv icon

EPAS: Efficient Training with Progressive Activation Sharing

Add code
Jan 27, 2026
Viaarxiv icon

FLOP-Efficient Training: Early Stopping Based on Test-Time Compute Awareness

Add code
Jan 04, 2026
Viaarxiv icon

Bayesian Mixture of Experts For Large Language Models

Add code
Nov 12, 2025
Viaarxiv icon

Continuous Self-Improvement of Large Language Models by Test-time Training with Verifier-Driven Sample Selection

Add code
May 26, 2025
Viaarxiv icon

Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks

Add code
May 26, 2025
Figure 1 for Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks
Figure 2 for Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks
Figure 3 for Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks
Figure 4 for Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks
Viaarxiv icon

ECHO-LLaMA: Efficient Caching for High-Performance LLaMA Training

Add code
May 22, 2025
Viaarxiv icon

Accelerating the Low-Rank Decomposed Models

Add code
Jul 24, 2024
Figure 1 for Accelerating the Low-Rank Decomposed Models
Figure 2 for Accelerating the Low-Rank Decomposed Models
Figure 3 for Accelerating the Low-Rank Decomposed Models
Figure 4 for Accelerating the Low-Rank Decomposed Models
Viaarxiv icon

Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?

Add code
Jul 23, 2024
Figure 1 for Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Figure 2 for Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Figure 3 for Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Figure 4 for Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Viaarxiv icon

SkipViT: Speeding Up Vision Transformers with a Token-Level Skip Connection

Add code
Jan 27, 2024
Viaarxiv icon