Alert button
Picture for Liangzhen Lai

Liangzhen Lai

Alert button

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Add code
Bookmark button
Alert button
Feb 22, 2024
Zechun Liu, Changsheng Zhao, Forrest Iandola, Chen Lai, Yuandong Tian, Igor Fedorov, Yunyang Xiong, Ernie Chang, Yangyang Shi, Raghuraman Krishnamoorthi, Liangzhen Lai, Vikas Chandra

Viaarxiv icon

Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition

Add code
Bookmark button
Alert button
Feb 20, 2024
Yang Li, Yuan Shangguan, Yuhao Wang, Liangzhen Lai, Ernie Chang, Changsheng Zhao, Yangyang Shi, Vikas Chandra

Viaarxiv icon

Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition

Add code
Bookmark button
Alert button
Sep 21, 2023
Yang Li, Liangzhen Lai, Yuan Shangguan, Forrest N. Iandola, Ernie Chang, Yangyang Shi, Vikas Chandra

Figure 1 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 2 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 3 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 4 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Viaarxiv icon

SDRM3: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads

Add code
Bookmark button
Alert button
Dec 07, 2022
Seah Kim, Hyoukjun Kwon, Jinook Song, Jihyuck Jo, Yu-Hsin Chen, Liangzhen Lai, Vikas Chandra

Figure 1 for SDRM3: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads
Figure 2 for SDRM3: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads
Figure 3 for SDRM3: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads
Figure 4 for SDRM3: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads
Viaarxiv icon

XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse

Add code
Bookmark button
Alert button
Nov 16, 2022
Hyoukjun Kwon, Krishnakumar Nair, Jamin Seo, Jason Yik, Debabrata Mohapatra, Dongyuan Zhan, Jinook Song, Peter Capak, Peizhao Zhang, Peter Vajda, Colby Banbury, Mark Mazumder, Liangzhen Lai, Ashish Sirasao, Tushar Krishna, Harshit Khaitan, Vikas Chandra, Vijay Janapa Reddi

Figure 1 for XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse
Figure 2 for XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse
Figure 3 for XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse
Figure 4 for XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse
Viaarxiv icon

Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation

Add code
Bookmark button
Alert button
Nov 23, 2021
Jiaqi Gu, Hyoukjun Kwon, Dilin Wang, Wei Ye, Meng Li, Yu-Hsin Chen, Liangzhen Lai, Vikas Chandra, David Z. Pan

Figure 1 for Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Figure 2 for Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Figure 3 for Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Figure 4 for Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Viaarxiv icon

Low-Rank+Sparse Tensor Compression for Neural Networks

Add code
Bookmark button
Alert button
Nov 02, 2021
Cole Hawkins, Haichuan Yang, Meng Li, Liangzhen Lai, Vikas Chandra

Figure 1 for Low-Rank+Sparse Tensor Compression for Neural Networks
Figure 2 for Low-Rank+Sparse Tensor Compression for Neural Networks
Figure 3 for Low-Rank+Sparse Tensor Compression for Neural Networks
Figure 4 for Low-Rank+Sparse Tensor Compression for Neural Networks
Viaarxiv icon

HRViT: Multi-Scale High-Resolution Vision Transformer

Add code
Bookmark button
Alert button
Nov 01, 2021
Jiaqi Gu, Hyoukjun Kwon, Dilin Wang, Wei Ye, Meng Li, Yu-Hsin Chen, Liangzhen Lai, Vikas Chandra, David Z. Pan

Figure 1 for HRViT: Multi-Scale High-Resolution Vision Transformer
Figure 2 for HRViT: Multi-Scale High-Resolution Vision Transformer
Figure 3 for HRViT: Multi-Scale High-Resolution Vision Transformer
Figure 4 for HRViT: Multi-Scale High-Resolution Vision Transformer
Viaarxiv icon

Improving Efficiency in Neural Network Accelerator Using Operands Hamming Distance optimization

Add code
Bookmark button
Alert button
Feb 13, 2020
Meng Li, Yilei Li, Pierce Chuang, Liangzhen Lai, Vikas Chandra

Figure 1 for Improving Efficiency in Neural Network Accelerator Using Operands Hamming Distance optimization
Figure 2 for Improving Efficiency in Neural Network Accelerator Using Operands Hamming Distance optimization
Figure 3 for Improving Efficiency in Neural Network Accelerator Using Operands Hamming Distance optimization
Figure 4 for Improving Efficiency in Neural Network Accelerator Using Operands Hamming Distance optimization
Viaarxiv icon