Picture for Vasudev Shyam

Vasudev Shyam

Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior

Add code
May 06, 2026
Viaarxiv icon

Symmetry Breaking in Transformers for Efficient and Interpretable Training

Add code
Jan 29, 2026
Viaarxiv icon

The Zamba2 Suite: Technical Report

Add code
Nov 22, 2024
Figure 1 for The Zamba2 Suite: Technical Report
Figure 2 for The Zamba2 Suite: Technical Report
Figure 3 for The Zamba2 Suite: Technical Report
Figure 4 for The Zamba2 Suite: Technical Report
Viaarxiv icon

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Add code
Aug 09, 2024
Figure 1 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 2 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 3 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 4 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Viaarxiv icon