Picture for Daejin Jo

Daejin Jo

SGPO: Self-Generated Preference Optimization based on Self-Improver

Add code
Jul 27, 2025
Viaarxiv icon

Slot-MLLM: Object-Centric Visual Tokenization for Multimodal LLM

Add code
May 26, 2025
Viaarxiv icon

TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback

Add code
Jul 23, 2024
Figure 1 for TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback
Figure 2 for TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback
Figure 3 for TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback
Figure 4 for TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback
Viaarxiv icon

Hexa: Self-Improving for Knowledge-Grounded Dialogue System

Add code
Oct 22, 2023
Figure 1 for Hexa: Self-Improving for Knowledge-Grounded Dialogue System
Figure 2 for Hexa: Self-Improving for Knowledge-Grounded Dialogue System
Figure 3 for Hexa: Self-Improving for Knowledge-Grounded Dialogue System
Figure 4 for Hexa: Self-Improving for Knowledge-Grounded Dialogue System
Viaarxiv icon

Effortless Integration of Memory Management into Open-Domain Conversation Systems

Add code
May 23, 2023
Figure 1 for Effortless Integration of Memory Management into Open-Domain Conversation Systems
Figure 2 for Effortless Integration of Memory Management into Open-Domain Conversation Systems
Figure 3 for Effortless Integration of Memory Management into Open-Domain Conversation Systems
Viaarxiv icon

MAGVLT: Masked Generative Vision-and-Language Transformer

Add code
Mar 21, 2023
Figure 1 for MAGVLT: Masked Generative Vision-and-Language Transformer
Figure 2 for MAGVLT: Masked Generative Vision-and-Language Transformer
Figure 3 for MAGVLT: Masked Generative Vision-and-Language Transformer
Figure 4 for MAGVLT: Masked Generative Vision-and-Language Transformer
Viaarxiv icon

LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward

Add code
Oct 11, 2022
Figure 1 for LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
Figure 2 for LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
Figure 3 for LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
Figure 4 for LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
Viaarxiv icon

Selective Token Generation for Few-shot Natural Language Generation

Add code
Sep 17, 2022
Figure 1 for Selective Token Generation for Few-shot Natural Language Generation
Figure 2 for Selective Token Generation for Few-shot Natural Language Generation
Figure 3 for Selective Token Generation for Few-shot Natural Language Generation
Figure 4 for Selective Token Generation for Few-shot Natural Language Generation
Viaarxiv icon

Insights From the NeurIPS 2021 NetHack Challenge

Add code
Mar 22, 2022
Figure 1 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 2 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 3 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 4 for Insights From the NeurIPS 2021 NetHack Challenge
Viaarxiv icon