Picture for Ion Stoica

Ion Stoica

RouteLLM: Learning to Route LLMs with Preference Data

Add code
Jun 26, 2024
Viaarxiv icon

Optimizing Speculative Decoding for Serving Large Language Models Using Goodput

Add code
Jun 20, 2024
Figure 1 for Optimizing Speculative Decoding for Serving Large Language Models Using Goodput
Figure 2 for Optimizing Speculative Decoding for Serving Large Language Models Using Goodput
Figure 3 for Optimizing Speculative Decoding for Serving Large Language Models Using Goodput
Figure 4 for Optimizing Speculative Decoding for Serving Large Language Models Using Goodput
Viaarxiv icon

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Add code
Jun 17, 2024
Viaarxiv icon

OR-Bench: An Over-Refusal Benchmark for Large Language Models

Add code
May 31, 2024
Viaarxiv icon

Crafting Interpretable Embeddings by Asking LLMs Questions

Add code
May 26, 2024
Figure 1 for Crafting Interpretable Embeddings by Asking LLMs Questions
Figure 2 for Crafting Interpretable Embeddings by Asking LLMs Questions
Figure 3 for Crafting Interpretable Embeddings by Asking LLMs Questions
Figure 4 for Crafting Interpretable Embeddings by Asking LLMs Questions
Viaarxiv icon

Stylus: Automatic Adapter Selection for Diffusion Models

Add code
Apr 29, 2024
Figure 1 for Stylus: Automatic Adapter Selection for Diffusion Models
Figure 2 for Stylus: Automatic Adapter Selection for Diffusion Models
Figure 3 for Stylus: Automatic Adapter Selection for Diffusion Models
Figure 4 for Stylus: Automatic Adapter Selection for Diffusion Models
Viaarxiv icon

Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity

Add code
Apr 22, 2024
Viaarxiv icon

GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

Add code
Apr 10, 2024
Figure 1 for GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications
Figure 2 for GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications
Figure 3 for GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications
Figure 4 for GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications
Viaarxiv icon

Trustless Audits without Revealing Data or Models

Add code
Apr 06, 2024
Figure 1 for Trustless Audits without Revealing Data or Models
Figure 2 for Trustless Audits without Revealing Data or Models
Figure 3 for Trustless Audits without Revealing Data or Models
Figure 4 for Trustless Audits without Revealing Data or Models
Viaarxiv icon

RAFT: Adapting Language Model to Domain Specific RAG

Add code
Mar 15, 2024
Figure 1 for RAFT: Adapting Language Model to Domain Specific RAG
Figure 2 for RAFT: Adapting Language Model to Domain Specific RAG
Figure 3 for RAFT: Adapting Language Model to Domain Specific RAG
Figure 4 for RAFT: Adapting Language Model to Domain Specific RAG
Viaarxiv icon