Picture for Hyokun Yun

Hyokun Yun

Aligning Large Language Models with Implicit Preferences from User-Generated Content

Add code
Jun 04, 2025
Viaarxiv icon

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users

Add code
Apr 14, 2025
Viaarxiv icon

Evolutionary Contrastive Distillation for Language Model Alignment

Add code
Oct 10, 2024
Figure 1 for Evolutionary Contrastive Distillation for Language Model Alignment
Figure 2 for Evolutionary Contrastive Distillation for Language Model Alignment
Figure 3 for Evolutionary Contrastive Distillation for Language Model Alignment
Figure 4 for Evolutionary Contrastive Distillation for Language Model Alignment
Viaarxiv icon

Exposing Privacy Gaps: Membership Inference Attack on Preference Data for LLM Alignment

Add code
Jul 08, 2024
Figure 1 for Exposing Privacy Gaps: Membership Inference Attack on Preference Data for LLM Alignment
Figure 2 for Exposing Privacy Gaps: Membership Inference Attack on Preference Data for LLM Alignment
Figure 3 for Exposing Privacy Gaps: Membership Inference Attack on Preference Data for LLM Alignment
Figure 4 for Exposing Privacy Gaps: Membership Inference Attack on Preference Data for LLM Alignment
Viaarxiv icon

Robust Multi-Task Learning with Excess Risks

Add code
Feb 14, 2024
Viaarxiv icon

Threshold-aware Learning to Generate Feasible Solutions for Mixed Integer Programs

Add code
Aug 01, 2023
Viaarxiv icon

MICO: Selective Search with Mutual Information Co-training

Add code
Sep 09, 2022
Figure 1 for MICO: Selective Search with Mutual Information Co-training
Figure 2 for MICO: Selective Search with Mutual Information Co-training
Figure 3 for MICO: Selective Search with Mutual Information Co-training
Figure 4 for MICO: Selective Search with Mutual Information Co-training
Viaarxiv icon

A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone

Add code
Dec 31, 2021
Figure 1 for A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone
Figure 2 for A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone
Figure 3 for A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone
Figure 4 for A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone
Viaarxiv icon

Tiering as a Stochastic Submodular Optimization Problem

Add code
May 16, 2020
Figure 1 for Tiering as a Stochastic Submodular Optimization Problem
Figure 2 for Tiering as a Stochastic Submodular Optimization Problem
Figure 3 for Tiering as a Stochastic Submodular Optimization Problem
Figure 4 for Tiering as a Stochastic Submodular Optimization Problem
Viaarxiv icon