Picture for Kai Lv

Kai Lv

Mousse: Rectifying the Geometry of Muon with Curvature-Aware Preconditioning

Add code
Mar 10, 2026
Viaarxiv icon

DRFormer: A Dual-Regularized Bidirectional Transformer for Person Re-identification

Add code
Feb 01, 2026
Viaarxiv icon

Explicit Multi-head Attention for Inter-head Interaction in Large Language Models

Add code
Jan 27, 2026
Viaarxiv icon

CritiQ: Mining Data Quality Criteria from Human Preferences

Add code
Feb 26, 2025
Viaarxiv icon

FastMCTS: A Simple Sampling Strategy for Data Synthesis

Add code
Feb 17, 2025
Figure 1 for FastMCTS: A Simple Sampling Strategy for Data Synthesis
Figure 2 for FastMCTS: A Simple Sampling Strategy for Data Synthesis
Figure 3 for FastMCTS: A Simple Sampling Strategy for Data Synthesis
Figure 4 for FastMCTS: A Simple Sampling Strategy for Data Synthesis
Viaarxiv icon

CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness

Add code
Jan 09, 2025
Figure 1 for CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness
Figure 2 for CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness
Figure 3 for CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness
Figure 4 for CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness
Viaarxiv icon

Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space

Add code
Aug 14, 2024
Figure 1 for Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space
Figure 2 for Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space
Figure 3 for Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space
Figure 4 for Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space
Viaarxiv icon

Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope

Add code
Jul 21, 2024
Figure 1 for Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope
Figure 2 for Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope
Figure 3 for Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope
Figure 4 for Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

LongWanjuan: Towards Systematic Measurement for Long Text Quality

Add code
Feb 22, 2024
Viaarxiv icon