Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tilman Hartwig

Comparing energy consumption and accuracy in text classification inference

Aug 19, 2025

Johannes Zschache, Tilman Hartwig

Abstract:The increasing deployment of large language models (LLMs) in natural language processing (NLP) tasks raises concerns about energy efficiency and sustainability. While prior research has largely focused on energy consumption during model training, the inference phase has received comparatively less attention. This study systematically evaluates the trade-offs between model accuracy and energy consumption in text classification inference across various model architectures and hardware configurations. Our empirical analysis shows that the best-performing model in terms of accuracy can also be energy-efficient, while larger LLMs tend to consume significantly more energy with lower classification accuracy. We observe substantial variability in inference energy consumption ($<$mWh to $>$kWh), influenced by model type, model size, and hardware specifications. Additionally, we find a strong correlation between inference energy consumption and model runtime, indicating that execution time can serve as a practical proxy for energy usage in settings where direct measurement is not feasible. These findings have implications for sustainable AI development, providing actionable insights for researchers, industry practitioners, and policymakers seeking to balance performance and resource efficiency in NLP applications.

* Key results in Figure 1, submitted to Nature Communications, 25 pages

Via

Access Paper or Ask Questions

Neural Networks Fail to Learn Periodic Functions and How to Fix It

Jun 15, 2020

Liu Ziyin, Tilman Hartwig, Masahito Ueda

Figure 1 for Neural Networks Fail to Learn Periodic Functions and How to Fix It

Figure 2 for Neural Networks Fail to Learn Periodic Functions and How to Fix It

Figure 3 for Neural Networks Fail to Learn Periodic Functions and How to Fix It

Figure 4 for Neural Networks Fail to Learn Periodic Functions and How to Fix It

Abstract:Previous literature offers limited clues on how to learn a periodic function using modern neural networks. We start with a study of the extrapolation properties of neural networks; we prove and demonstrate experimentally that the standard activations functions, such as ReLU, tanh, sigmoid, along with their variants, all fail to learn to extrapolate simple periodic functions. We hypothesize that this is due to their lack of a "periodic" inductive bias. As a fix of this problem, we propose a new activation, namely, $x + \sin^2(x)$, which achieves the desired periodic inductive bias to learn a periodic function while maintaining a favorable optimization property of the ReLU-based activations. Experimentally, we apply the proposed method to temperature and financial data prediction.

Via

Access Paper or Ask Questions