Picture for Branislav Kveton

Branislav Kveton

Learning to Reason in LLMs by Expectation Maximization

Add code
Dec 23, 2025
Viaarxiv icon

Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning

Add code
Jun 08, 2025
Figure 1 for Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning
Figure 2 for Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning
Figure 3 for Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning
Figure 4 for Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning
Viaarxiv icon

LaMP-Cap: Personalized Figure Caption Generation With Multimodal Figure Profiles

Add code
Jun 06, 2025
Viaarxiv icon

A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations

Add code
May 20, 2025
Figure 1 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations
Figure 2 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations
Figure 3 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations
Figure 4 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations
Viaarxiv icon

FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain

Add code
May 20, 2025
Viaarxiv icon

RecGaze: The First Eye Tracking and User Interaction Dataset for Carousel Interfaces

Add code
Apr 29, 2025
Viaarxiv icon

Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models

Add code
Mar 20, 2025
Viaarxiv icon

Active Learning for Direct Preference Optimization

Add code
Mar 03, 2025
Viaarxiv icon

An Efficient Plugin Method for Metric Optimization of Black-Box Models

Add code
Mar 03, 2025
Viaarxiv icon

From Selection to Generation: A Survey of LLM-based Active Learning

Add code
Feb 17, 2025
Viaarxiv icon