Picture for Rohan Shravan

Rohan Shravan

Reversible Foundations: Training a 120B Sparse MoE through State-Preserving Scaling

Add code
Jun 05, 2026
Viaarxiv icon

Kronecker Embeddings: Byte-Level Structured Token Representations for Parameter-Efficient Language Models

Add code
May 28, 2026
Viaarxiv icon

BrahmicTokenizer-131K: An Indic-Capable Drop-In Replacement for o200k_base

Add code
May 28, 2026
Viaarxiv icon