Picture for Shijie Cao

Shijie Cao

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

MiMo-Audio: Audio Language Models are Few-Shot Learners

Add code
Dec 29, 2025
Viaarxiv icon

Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning

Add code
Aug 09, 2025
Viaarxiv icon

Data Efficacy for Language Model Training

Add code
Jun 26, 2025
Viaarxiv icon

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Figure 1 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 2 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 3 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 4 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Viaarxiv icon

Rectified Sparse Attention

Add code
Jun 05, 2025
Viaarxiv icon

BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache

Add code
Mar 24, 2025
Viaarxiv icon

ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM

Add code
Mar 17, 2025
Figure 1 for ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM
Figure 2 for ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM
Figure 3 for ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM
Figure 4 for ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM
Viaarxiv icon

Bitnet.cpp: Efficient Edge Inference for Ternary LLMs

Add code
Feb 17, 2025
Viaarxiv icon

Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach

Add code
Nov 28, 2024
Figure 1 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach
Figure 2 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach
Figure 3 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach
Figure 4 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach
Viaarxiv icon