Picture for Ravi Netravali

Ravi Netravali

Princeton University

Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning

Add code
Aug 09, 2025
Viaarxiv icon

Legilimens: Performant Video Analytics on the System-on-Chip Edge

Add code
Apr 29, 2025
Viaarxiv icon

Guillotine: Hypervisors for Isolating Malicious AIs

Add code
Apr 22, 2025
Viaarxiv icon

SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning

Add code
Apr 10, 2025
Figure 1 for SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
Figure 2 for SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
Figure 3 for SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
Figure 4 for SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
Viaarxiv icon

RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation

Add code
Dec 13, 2024
Figure 1 for RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation
Figure 2 for RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation
Figure 3 for RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation
Figure 4 for RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation
Viaarxiv icon

Marconi: Prefix Caching for the Era of Hybrid LLMs

Add code
Nov 28, 2024
Figure 1 for Marconi: Prefix Caching for the Era of Hybrid LLMs
Figure 2 for Marconi: Prefix Caching for the Era of Hybrid LLMs
Figure 3 for Marconi: Prefix Caching for the Era of Hybrid LLMs
Figure 4 for Marconi: Prefix Caching for the Era of Hybrid LLMs
Viaarxiv icon

Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving

Add code
Dec 08, 2023
Viaarxiv icon

MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations

Add code
Apr 04, 2023
Figure 1 for MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations
Figure 2 for MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations
Figure 3 for MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations
Figure 4 for MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations
Viaarxiv icon

Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection

Add code
Jun 07, 2022
Figure 1 for Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection
Figure 2 for Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection
Figure 3 for Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection
Figure 4 for Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection
Viaarxiv icon

Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs

Add code
Apr 26, 2022
Figure 1 for Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs
Figure 2 for Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs
Figure 3 for Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs
Figure 4 for Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs
Viaarxiv icon