Picture for Jianlin Su

Jianlin Su

Kimi Linear: An Expressive, Efficient Attention Architecture

Add code
Oct 30, 2025
Viaarxiv icon

Kimi K2: Open Agentic Intelligence

Add code
Jul 28, 2025
Viaarxiv icon

Kimi-VL Technical Report

Add code
Apr 10, 2025
Figure 1 for Kimi-VL Technical Report
Figure 2 for Kimi-VL Technical Report
Figure 3 for Kimi-VL Technical Report
Figure 4 for Kimi-VL Technical Report
Viaarxiv icon

Muon is Scalable for LLM Training

Add code
Feb 24, 2025
Viaarxiv icon

MoBA: Mixture of Block Attention for Long-Context LLMs

Add code
Feb 18, 2025
Figure 1 for MoBA: Mixture of Block Attention for Long-Context LLMs
Figure 2 for MoBA: Mixture of Block Attention for Long-Context LLMs
Figure 3 for MoBA: Mixture of Block Attention for Long-Context LLMs
Figure 4 for MoBA: Mixture of Block Attention for Long-Context LLMs
Viaarxiv icon

DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space

Add code
Dec 19, 2024
Viaarxiv icon

Naive Bayes-based Context Extension for Large Language Models

Add code
Mar 26, 2024
Figure 1 for Naive Bayes-based Context Extension for Large Language Models
Figure 2 for Naive Bayes-based Context Extension for Large Language Models
Figure 3 for Naive Bayes-based Context Extension for Large Language Models
Figure 4 for Naive Bayes-based Context Extension for Large Language Models
Viaarxiv icon

VisionLLaMA: A Unified LLaMA Interface for Vision Tasks

Add code
Mar 01, 2024
Viaarxiv icon

Elucidating the Exposure Bias in Diffusion Models

Add code
Sep 12, 2023
Viaarxiv icon

Rank-Aware Negative Training for Semi-Supervised Text Classification

Add code
Jun 13, 2023
Viaarxiv icon