Picture for Tianyu Ruan

Tianyu Ruan

Muon Learns More Robust and Transferable Features than Adam

Add code
Jun 08, 2026
Viaarxiv icon

Towards understanding how attention mechanism works in deep learning

Add code
Dec 24, 2024
Figure 1 for Towards understanding how attention mechanism works in deep learning
Figure 2 for Towards understanding how attention mechanism works in deep learning
Figure 3 for Towards understanding how attention mechanism works in deep learning
Figure 4 for Towards understanding how attention mechanism works in deep learning
Viaarxiv icon