Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Oya Deniz Beyan

Fraunhofer Institute for Applied Information Technology FIT, University Hospital of Cologne

EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning

May 05, 2025

Lingxiao Kong, Cong Yang, Susanne Neufang, Oya Deniz Beyan, Zeyd Boukhers

Abstract:Recent advances in reinforcement learning (RL) for large language model (LLM) fine-tuning show promise in addressing multi-objective tasks but still face significant challenges, including complex objective balancing, low training efficiency, poor scalability, and limited explainability. Leveraging ensemble learning principles, we introduce an Ensemble Multi-Objective RL (EMORL) framework that fine-tunes multiple models with individual objectives while optimizing their aggregation after the training to improve efficiency and flexibility. Our method is the first to aggregate the last hidden states of individual models, incorporating contextual information from multiple objectives. This approach is supported by a hierarchical grid search algorithm that identifies optimal weighted combinations. We evaluate EMORL on counselor reflection generation tasks, using text-scoring LLMs to evaluate the generations and provide rewards during RL fine-tuning. Through comprehensive experiments on the PAIR and Psych8k datasets, we demonstrate the advantages of EMORL against existing baselines: significantly lower and more stable training consumption ($17,529\pm 1,650$ data points and $6,573\pm 147.43$ seconds), improved scalability and explainability, and comparable performance across multiple objectives.

* 13 pages, 9 figures, submitted to SIGDIAL 2025 conference

Via

Access Paper or Ask Questions

Recurrent Deep Embedding Networks for Genotype Clustering and Ethnicity Prediction

May 30, 2018

Md. Rezaul Karim, Michael Cochez, Oya Deniz Beyan, Achille Zappa, Ratnesh Sahay, Stefan Decker, Dietrich-Rebholz Schuhmann

Figure 1 for Recurrent Deep Embedding Networks for Genotype Clustering and Ethnicity Prediction

Figure 2 for Recurrent Deep Embedding Networks for Genotype Clustering and Ethnicity Prediction

Figure 3 for Recurrent Deep Embedding Networks for Genotype Clustering and Ethnicity Prediction

Figure 4 for Recurrent Deep Embedding Networks for Genotype Clustering and Ethnicity Prediction

Abstract:The understanding of variations in genome sequences assists us in identifying people who are predisposed to common diseases, solving rare diseases, and finding the corresponding population group of the individuals from a larger population group. Although classical machine learning techniques allow researchers to identify groups (i.e. clusters) of related variables, the accuracy, and effectiveness of these methods diminish for large and high-dimensional datasets such as the whole human genome. On the other hand, deep neural network architectures (the core of deep learning) can better exploit large-scale datasets to build complex models. In this paper, we use the K-means clustering approach for scalable genomic data analysis aiming towards clustering genotypic variants at the population scale. Finally, we train a deep belief network (DBN) for predicting the geographic ethnicity. We used the genotype data from the 1000 Genomes Project, which covers the result of genome sequencing for 2504 individuals from 26 different ethnic origins and comprises 84 million variants. Our experimental results, with a focus on accuracy and scalability, show the effectiveness and superiority compared to the state-of-the-art.

* This article is based on a workshop paper discussed at the ESWC workshop on Semantic Web Solutions for Large-scale Biomedical Data Analytics (SeWeBMeDA), in Slovenia, May 28-29, 2017. The authors would like to thank the anonymous reviewers for their useful comments which helped us to further extend and improve the draft. Pages: 23

Via

Access Paper or Ask Questions