Alert button
Picture for Li Lyna Zhang

Li Lyna Zhang

Alert button

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Feb 21, 2024
Yiran Ding, Li Lyna Zhang, Chengruidong Zhang, Yuanyuan Xu, Ning Shang, Jiahang Xu, Fan Yang, Mao Yang

Viaarxiv icon

Boosting LLM Reasoning: Push the Limits of Few-shot Learning with Reinforced In-Context Pruning

Dec 26, 2023
Xijie Huang, Li Lyna Zhang, Kwang-Ting Cheng, Mao Yang

Viaarxiv icon

Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models

Oct 11, 2023
Song Guo, Jiahang Xu, Li Lyna Zhang, Mao Yang

Viaarxiv icon

Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference

Jun 26, 2023
Junyan Li, Li Lyna Zhang, Jiahang Xu, Yujing Wang, Shaoguang Yan, Yunqing Xia, Yuqing Yang, Ting Cao, Hao Sun, Weiwei Deng, Qi Zhang, Mao Yang

Figure 1 for Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference
Figure 2 for Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference
Figure 3 for Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference
Figure 4 for Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference
Viaarxiv icon

Accurate and Structured Pruning for Efficient Automatic Speech Recognition

May 31, 2023
Huiqiang Jiang, Li Lyna Zhang, Yuang Li, Yu Wu, Shijie Cao, Ting Cao, Yuqing Yang, Jinyu Li, Mao Yang, Lili Qiu

Figure 1 for Accurate and Structured Pruning for Efficient Automatic Speech Recognition
Figure 2 for Accurate and Structured Pruning for Efficient Automatic Speech Recognition
Figure 3 for Accurate and Structured Pruning for Efficient Automatic Speech Recognition
Figure 4 for Accurate and Structured Pruning for Efficient Automatic Speech Recognition
Viaarxiv icon

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices

Mar 21, 2023
Chen Tang, Li Lyna Zhang, Huiqiang Jiang, Jiahang Xu, Ting Cao, Quanlu Zhang, Yuqing Yang, Zhi Wang, Mao Yang

Figure 1 for ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices
Figure 2 for ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices
Figure 3 for ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices
Figure 4 for ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices
Viaarxiv icon

SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference

Mar 15, 2023
Li Lyna Zhang, Xudong Wang, Jiahang Xu, Quanlu Zhang, Yujing Wang, Yuqing Yang, Ningxin Zheng, Ting Cao, Mao Yang

Figure 1 for SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference
Figure 2 for SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference
Figure 3 for SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference
Figure 4 for SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference
Viaarxiv icon

LUT-NN: Towards Unified Neural Network Inference by Table Lookup

Feb 07, 2023
Xiaohu Tang, Yang Wang, Ting Cao, Li Lyna Zhang, Qi Chen, Deng Cai, Yunxin Liu, Mao Yang

Figure 1 for LUT-NN: Towards Unified Neural Network Inference by Table Lookup
Figure 2 for LUT-NN: Towards Unified Neural Network Inference by Table Lookup
Figure 3 for LUT-NN: Towards Unified Neural Network Inference by Table Lookup
Figure 4 for LUT-NN: Towards Unified Neural Network Inference by Table Lookup
Viaarxiv icon

Boosting Mobile CNN Inference through Semantic Memory

Dec 05, 2021
Yun Li, Chen Zhang, Shihao Han, Li Lyna Zhang, Baoqun Yin, Yunxin Liu, Mengwei Xu

Figure 1 for Boosting Mobile CNN Inference through Semantic Memory
Figure 2 for Boosting Mobile CNN Inference through Semantic Memory
Figure 3 for Boosting Mobile CNN Inference through Semantic Memory
Figure 4 for Boosting Mobile CNN Inference through Semantic Memory
Viaarxiv icon

AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing

Aug 06, 2021
Yuge Zhang, Chenqian Yan, Quanlu Zhang, Li Lyna Zhang, Yaming Yang, Xiaotian Gao, Yuqing Yang

Figure 1 for AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing
Figure 2 for AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing
Figure 3 for AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing
Figure 4 for AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing
Viaarxiv icon