Alert button
Picture for Irina Belousova

Irina Belousova

Alert button

Speculative Streaming: Fast LLM Inference without Auxiliary Models

Add code
Bookmark button
Alert button
Feb 16, 2024
Nikhil Bhendawade, Irina Belousova, Qichen Fu, Henry Mason, Mohammad Rastegari, Mahyar Najibi

Viaarxiv icon

Intelligent Assistant Language Understanding On Device

Add code
Bookmark button
Alert button
Aug 07, 2023
Cecilia Aas, Hisham Abdelsalam, Irina Belousova, Shruti Bhargava, Jianpeng Cheng, Robert Daland, Joris Driesen, Federico Flego, Tristan Guigue, Anders Johannsen, Partha Lal, Jiarui Lu, Joel Ruben Antony Moniz, Nathan Perkins, Dhivya Piraviperumal, Stephen Pulman, Diarmuid Ó Séaghdha, David Q. Sun, John Torr, Marco Del Vecchio, Jay Wacker, Jason D. Williams, Hong Yu

Figure 1 for Intelligent Assistant Language Understanding On Device
Figure 2 for Intelligent Assistant Language Understanding On Device
Figure 3 for Intelligent Assistant Language Understanding On Device
Figure 4 for Intelligent Assistant Language Understanding On Device
Viaarxiv icon