Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

AutoDistill: an End-to-End Framework to Explore and Distill Hardware-Efficient Language Models



Xiaofan Zhang , Zongwei Zhou , Deming Chen , Yu Emma Wang


   Access Paper or Ask Questions

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts



Nan Du , Yanping Huang , Andrew M. Dai , Simon Tong , Dmitry Lepikhin , Yuanzhong Xu , Maxim Krikun , Yanqi Zhou , Adams Wei Yu , Orhan Firat , Barret Zoph , Liam Fedus , Maarten Bosma , Zongwei Zhou , Tao Wang , Yu Emma Wang , Kellie Webster , Marie Pellat , Kevin Robinson , Kathy Meier-Hellstern , Toju Duke , Lucas Dixon , Kun Zhang , Quoc V Le , Yonghui Wu , Zhifeng Chen , Claire Cui


   Access Paper or Ask Questions

Exploring the limits of Concurrency in ML Training on Google TPUs



Sameer Kumar , James Bradbury , Cliff Young , Yu Emma Wang , Anselm Levskaya , Blake Hechtman , Dehao Chen , HyoukJoong Lee , Mehmet Deveci , Naveen Kumar , Pankaj Kanwar , Shibo Wang , Skye Wanderman-Milne , Steve Lacy , Tao Wang , Tayo Oguntebi , Yazhou Zu , Yuanzhong Xu , Andy Swing


   Access Paper or Ask Questions

Exploiting Parallelism Opportunities with Deep Learning Frameworks



Yu Emma Wang , Carole-Jean Wu , Xiaodong Wang , Kim Hazelwood , David Brooks


   Access Paper or Ask Questions

Benchmarking TPU, GPU, and CPU Platforms for Deep Learning



Yu Emma Wang , Gu-Yeon Wei , David Brooks


   Access Paper or Ask Questions