Kimi K2


KLong: Training LLM Agent for Extremely Long-horizon Tasks

Add code
Feb 19, 2026
Viaarxiv icon

Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments

Add code
Feb 12, 2026
Viaarxiv icon

EGSS: Entropy-guided Stepwise Scaling for Reliable Software Engineering

Add code
Feb 05, 2026
Viaarxiv icon

Fat-Cat: Document-Driven Metacognitive Multi-Agent System for Complex Reasoning

Add code
Feb 02, 2026
Viaarxiv icon

Semantic Compression of LLM Instructions via Symbolic Metalanguages

Add code
Jan 12, 2026
Viaarxiv icon

Coding in a Bubble? Evaluating LLMs in Resolving Context Adaptation Bugs During Code Adaptation

Add code
Jan 10, 2026
Viaarxiv icon

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

Beyond Accuracy: A Geometric Stability Analysis of Large Language Models in Chess Evaluation

Add code
Dec 17, 2025
Figure 1 for Beyond Accuracy: A Geometric Stability Analysis of Large Language Models in Chess Evaluation
Figure 2 for Beyond Accuracy: A Geometric Stability Analysis of Large Language Models in Chess Evaluation
Figure 3 for Beyond Accuracy: A Geometric Stability Analysis of Large Language Models in Chess Evaluation
Figure 4 for Beyond Accuracy: A Geometric Stability Analysis of Large Language Models in Chess Evaluation
Viaarxiv icon

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Add code
Nov 09, 2025
Figure 1 for Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
Figure 2 for Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
Figure 3 for Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
Figure 4 for Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
Viaarxiv icon

ROSBag MCP Server: Analyzing Robot Data with LLMs for Agentic Embodied AI Applications

Add code
Nov 05, 2025
Figure 1 for ROSBag MCP Server: Analyzing Robot Data with LLMs for Agentic Embodied AI Applications
Figure 2 for ROSBag MCP Server: Analyzing Robot Data with LLMs for Agentic Embodied AI Applications
Figure 3 for ROSBag MCP Server: Analyzing Robot Data with LLMs for Agentic Embodied AI Applications
Figure 4 for ROSBag MCP Server: Analyzing Robot Data with LLMs for Agentic Embodied AI Applications
Viaarxiv icon