Picture for Keisuke Kamahori

Keisuke Kamahori

TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval

Add code
Feb 28, 2025
Viaarxiv icon

LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

Add code
Feb 27, 2025
Figure 1 for LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
Figure 2 for LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
Figure 3 for LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
Figure 4 for LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
Viaarxiv icon

Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models

Add code
Feb 10, 2024
Viaarxiv icon