Picture for Kaiyue Yang

Kaiyue Yang

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Add code
May 12, 2025
Viaarxiv icon

MMViT: Multiscale Multiview Vision Transformers

Add code
Apr 28, 2023
Figure 1 for MMViT: Multiscale Multiview Vision Transformers
Figure 2 for MMViT: Multiscale Multiview Vision Transformers
Figure 3 for MMViT: Multiscale Multiview Vision Transformers
Figure 4 for MMViT: Multiscale Multiview Vision Transformers
Viaarxiv icon