Picture for Hongshen Xu

Hongshen Xu

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

MiMo-Audio: Audio Language Models are Few-Shot Learners

Add code
Dec 29, 2025
Viaarxiv icon

DiSRouter: Distributed Self-Routing for LLM Selections

Add code
Oct 22, 2025
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

Alignment for Efficient Tool Calling of Large Language Models

Add code
Mar 09, 2025
Figure 1 for Alignment for Efficient Tool Calling of Large Language Models
Figure 2 for Alignment for Efficient Tool Calling of Large Language Models
Figure 3 for Alignment for Efficient Tool Calling of Large Language Models
Figure 4 for Alignment for Efficient Tool Calling of Large Language Models
Viaarxiv icon

Delusions of Large Language Models

Add code
Mar 09, 2025
Viaarxiv icon

Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling

Add code
Mar 04, 2025
Viaarxiv icon

Reducing Tool Hallucination via Reliability Alignment

Add code
Dec 05, 2024
Figure 1 for Reducing Tool Hallucination via Reliability Alignment
Figure 2 for Reducing Tool Hallucination via Reliability Alignment
Figure 3 for Reducing Tool Hallucination via Reliability Alignment
Figure 4 for Reducing Tool Hallucination via Reliability Alignment
Viaarxiv icon

Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity

Add code
Dec 03, 2024
Figure 1 for Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity
Figure 2 for Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity
Figure 3 for Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity
Figure 4 for Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity
Viaarxiv icon

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Add code
Jul 15, 2024
Figure 1 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 2 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 3 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 4 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Viaarxiv icon