Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Injae Na

Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models

May 29, 2025

Injae Na, Keonwoong Noh, Woohwan Jung

Abstract:LLM providers typically offer multiple LLM tiers, varying in performance and price. As NLP tasks become more complex and modularized, selecting the suitable LLM tier for each subtask is a key challenge to balance between cost and performance. To address the problem, we introduce LLM Automatic Transmission (LLM-AT) framework that automatically selects LLM tiers without training. LLM-AT consists of Starter, Generator, and Judge. The starter selects the initial LLM tier expected to solve the given question, the generator produces a response using the LLM of the selected tier, and the judge evaluates the validity of the response. If the response is invalid, LLM-AT iteratively upgrades to a higher-tier model, generates a new response, and re-evaluates until a valid response is obtained. Additionally, we propose accuracy estimator, which enables the suitable initial LLM tier selection without training. Given an input question, accuracy estimator estimates the expected accuracy of each LLM tier by computing the valid response rate across top-k similar queries from past inference records. Experiments demonstrate that LLM-AT achieves superior performance while reducing costs, making it a practical solution for real-world applications.

* ACL 2025 (Findings)

Via

Access Paper or Ask Questions

Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions

Jul 25, 2024

Jiwon Suh, Injae Na, Woohwan Jung

Figure 1 for Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions

Figure 2 for Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions

Figure 3 for Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions

Figure 4 for Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions

Abstract:End-to-end automatic speech recognition (E2E ASR) systems have significantly improved speech recognition through training on extensive datasets. Despite these advancements, they still struggle to accurately recognize domain specific words, such as proper nouns and technical terminologies. To address this problem, we propose a method to utilize the state-of-the-art Whisper without modifying its architecture, preserving its generalization performance while enabling it to leverage descriptions effectively. Moreover, we propose two additional training techniques to improve the domain specific ASR: decoder fine-tuning, and context perturbation. We also propose a method to use a Large Language Model (LLM) to generate descriptions with simple metadata, when descriptions are unavailable. Our experiments demonstrate that proposed methods notably enhance domain-specific ASR accuracy on real-life datasets, with LLM-generated descriptions outperforming human-crafted ones in effectiveness.

* Accepted to INTERSPEECH 2024

Via

Access Paper or Ask Questions