Network Pruning


Network pruning is a popular approach to reduce a heavy network to obtain a lightweight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on some criteria, and finally fine-tuned to achieve comparable performance with reduced parameters.

Scalable Interconnect Learning in Boolean Networks

Add code
Jul 03, 2025
Viaarxiv icon

FlowSpec: Continuous Pipelined Speculative Decoding for Efficient Distributed LLM Inference

Add code
Jul 03, 2025
Viaarxiv icon

High-Layer Attention Pruning with Rescaling

Add code
Jul 02, 2025
Viaarxiv icon

Intrinsic and Extrinsic Organized Attention: Softmax Invariance and Network Sparsity

Add code
Jun 18, 2025
Viaarxiv icon

Training Neural Networks by Optimizing Neuron Positions

Add code
Jun 16, 2025
Viaarxiv icon

LOP: Learning Optimal Pruning for Efficient On-Demand MLLMs Scaling

Add code
Jun 15, 2025
Viaarxiv icon

Structure-Aware Automatic Channel Pruning by Searching with Graph Embedding

Add code
Jun 13, 2025
Viaarxiv icon

Deep Learning Model Acceleration and Optimization Strategies for Real-Time Recommendation Systems

Add code
Jun 13, 2025
Viaarxiv icon

Machine Unlearning for Robust DNNs: Attribution-Guided Partitioning and Neuron Pruning in Noisy Environments

Add code
Jun 13, 2025
Viaarxiv icon

Hyperpruning: Efficient Search through Pruned Variants of Recurrent Neural Networks Leveraging Lyapunov Spectrum

Add code
Jun 09, 2025
Viaarxiv icon