Picture for Songlin Yang

Songlin Yang

Kimi Linear: An Expressive, Efficient Attention Architecture

Add code
Oct 30, 2025
Viaarxiv icon

Instant Preference Alignment for Text-to-Image Diffusion Models

Add code
Aug 25, 2025
Viaarxiv icon

Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation

Add code
Jun 24, 2025
Viaarxiv icon

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

Add code
Jun 05, 2025
Viaarxiv icon

Log-Linear Attention

Add code
Jun 05, 2025
Figure 1 for Log-Linear Attention
Figure 2 for Log-Linear Attention
Figure 3 for Log-Linear Attention
Figure 4 for Log-Linear Attention
Viaarxiv icon

Test-Time Training Done Right

Add code
May 29, 2025
Viaarxiv icon

PaTH Attention: Position Encoding via Accumulating Householder Transformations

Add code
May 22, 2025
Viaarxiv icon

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Add code
May 10, 2025
Viaarxiv icon

Inductive Spatio-Temporal Kriging with Physics-Guided Increment Training Strategy for Air Quality Inference

Add code
Mar 12, 2025
Viaarxiv icon

Textured 3D Regenerative Morphing with 3D Diffusion Prior

Add code
Feb 20, 2025
Viaarxiv icon