Picture for Yuxin Wu

Yuxin Wu

Kimi K2: Open Agentic Intelligence

Add code
Jul 28, 2025
Viaarxiv icon

G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

Add code
May 19, 2025
Viaarxiv icon

Kimi-Audio Technical Report

Add code
Apr 25, 2025
Viaarxiv icon

Kimi-VL Technical Report

Add code
Apr 10, 2025
Viaarxiv icon

Muon is Scalable for LLM Training

Add code
Feb 24, 2025
Viaarxiv icon

MoBA: Mixture of Block Attention for Long-Context LLMs

Add code
Feb 18, 2025
Figure 1 for MoBA: Mixture of Block Attention for Long-Context LLMs
Figure 2 for MoBA: Mixture of Block Attention for Long-Context LLMs
Figure 3 for MoBA: Mixture of Block Attention for Long-Context LLMs
Figure 4 for MoBA: Mixture of Block Attention for Long-Context LLMs
Viaarxiv icon

A Plug-and-Play Bregman ADMM Module for Inferring Event Branches in Temporal Point Processes

Add code
Jan 08, 2025
Figure 1 for A Plug-and-Play Bregman ADMM Module for Inferring Event Branches in Temporal Point Processes
Figure 2 for A Plug-and-Play Bregman ADMM Module for Inferring Event Branches in Temporal Point Processes
Figure 3 for A Plug-and-Play Bregman ADMM Module for Inferring Event Branches in Temporal Point Processes
Figure 4 for A Plug-and-Play Bregman ADMM Module for Inferring Event Branches in Temporal Point Processes
Viaarxiv icon

Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity

Add code
Nov 15, 2024
Figure 1 for Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity
Figure 2 for Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity
Figure 3 for Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity
Figure 4 for Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity
Viaarxiv icon

AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis

Add code
Nov 04, 2024
Viaarxiv icon

FlamePINN-1D: Physics-informed neural networks to solve forward and inverse problems of 1D laminar flames

Add code
Jun 07, 2024
Viaarxiv icon