Picture for Daniel Wontae Nam

Daniel Wontae Nam

Binary Classifier Optimization for Large Language Model Alignment

Add code
Apr 06, 2024
Viaarxiv icon

Hexa: Self-Improving for Knowledge-Grounded Dialogue System

Add code
Oct 22, 2023
Viaarxiv icon

Effortless Integration of Memory Management into Open-Domain Conversation Systems

Add code
May 23, 2023
Figure 1 for Effortless Integration of Memory Management into Open-Domain Conversation Systems
Figure 2 for Effortless Integration of Memory Management into Open-Domain Conversation Systems
Figure 3 for Effortless Integration of Memory Management into Open-Domain Conversation Systems
Viaarxiv icon

LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward

Add code
Oct 11, 2022
Figure 1 for LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
Figure 2 for LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
Figure 3 for LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
Figure 4 for LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
Viaarxiv icon

GMAC: A Distributional Perspective on Actor-Critic Framework

Add code
May 24, 2021
Figure 1 for GMAC: A Distributional Perspective on Actor-Critic Framework
Figure 2 for GMAC: A Distributional Perspective on Actor-Critic Framework
Figure 3 for GMAC: A Distributional Perspective on Actor-Critic Framework
Figure 4 for GMAC: A Distributional Perspective on Actor-Critic Framework
Viaarxiv icon