Picture for Bowen Shen

Bowen Shen

DIVE into MoE: Diversity-Enhanced Reconstruction of Large Language Models from Dense into Mixture-of-Experts

Add code
Jun 11, 2025
Viaarxiv icon

TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization

Add code
May 26, 2025
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations

Add code
Jul 08, 2024
Figure 1 for Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Figure 2 for Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Figure 3 for Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Figure 4 for Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Viaarxiv icon

Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning

Add code
Jun 06, 2024
Viaarxiv icon

SMAT: A Self-Reinforcing Framework for Simultaneous Mapping and Tracking in Unbounded Urban Environments

Add code
Apr 27, 2023
Viaarxiv icon

COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models

Add code
Oct 27, 2022
Viaarxiv icon

DynamicFilter: an Online Dynamic Objects Removal Framework for Highly Dynamic Environments

Add code
Jun 30, 2022
Figure 1 for DynamicFilter: an Online Dynamic Objects Removal Framework for Highly Dynamic Environments
Figure 2 for DynamicFilter: an Online Dynamic Objects Removal Framework for Highly Dynamic Environments
Figure 3 for DynamicFilter: an Online Dynamic Objects Removal Framework for Highly Dynamic Environments
Figure 4 for DynamicFilter: an Online Dynamic Objects Removal Framework for Highly Dynamic Environments
Viaarxiv icon

Deep Learning for Highly Accelerated Diffusion Tensor Imaging

Add code
Feb 03, 2020
Figure 1 for Deep Learning for Highly Accelerated Diffusion Tensor Imaging
Figure 2 for Deep Learning for Highly Accelerated Diffusion Tensor Imaging
Figure 3 for Deep Learning for Highly Accelerated Diffusion Tensor Imaging
Figure 4 for Deep Learning for Highly Accelerated Diffusion Tensor Imaging
Viaarxiv icon