Picture for Jan-Jan Wu

Jan-Jan Wu

SAP: Syntactic Attention Pruning for Transformer-based Language Models

Add code
Dec 22, 2025
Viaarxiv icon

GPU Memory Usage Optimization for Backward Propagation in Deep Network Training

Add code
Feb 18, 2025
Viaarxiv icon