Picture for Taehyeon Kim

Taehyeon Kim

Multi-Drafter Speculative Decoding with Alignment Feedback

Add code
Apr 07, 2026
Viaarxiv icon

MERIT Feedback Elicits Better Bargaining in LLM Negotiators

Add code
Feb 12, 2026
Viaarxiv icon

LLM Agents for Bargaining with Utility-based Feedback

Add code
May 29, 2025
Viaarxiv icon

AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners

Add code
May 22, 2025
Viaarxiv icon

Guiding Reasoning in Small Language Models with LLM Assistance

Add code
Apr 14, 2025
Figure 1 for Guiding Reasoning in Small Language Models with LLM Assistance
Figure 2 for Guiding Reasoning in Small Language Models with LLM Assistance
Figure 3 for Guiding Reasoning in Small Language Models with LLM Assistance
Figure 4 for Guiding Reasoning in Small Language Models with LLM Assistance
Viaarxiv icon

MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation

Add code
Mar 18, 2025
Viaarxiv icon

$C^2$: Scalable Auto-Feedback for LLM-based Chart Generation

Add code
Oct 24, 2024
Figure 1 for $C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
Figure 2 for $C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
Figure 3 for $C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
Figure 4 for $C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
Viaarxiv icon

REBEL: Rule-based and Experience-enhanced Learning with LLMs for Initial Task Allocation in Multi-Human Multi-Robot Teams

Add code
Sep 24, 2024
Figure 1 for REBEL: Rule-based and Experience-enhanced Learning with LLMs for Initial Task Allocation in Multi-Human Multi-Robot Teams
Figure 2 for REBEL: Rule-based and Experience-enhanced Learning with LLMs for Initial Task Allocation in Multi-Human Multi-Robot Teams
Figure 3 for REBEL: Rule-based and Experience-enhanced Learning with LLMs for Initial Task Allocation in Multi-Human Multi-Robot Teams
Figure 4 for REBEL: Rule-based and Experience-enhanced Learning with LLMs for Initial Task Allocation in Multi-Human Multi-Robot Teams
Viaarxiv icon

PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning with Multimodal Transformers

Add code
Sep 20, 2024
Figure 1 for PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning with Multimodal Transformers
Figure 2 for PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning with Multimodal Transformers
Figure 3 for PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning with Multimodal Transformers
Figure 4 for PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning with Multimodal Transformers
Viaarxiv icon

Adaptive Task Allocation in Multi-Human Multi-Robot Teams under Team Heterogeneity and Dynamic Information Uncertainty

Add code
Sep 20, 2024
Figure 1 for Adaptive Task Allocation in Multi-Human Multi-Robot Teams under Team Heterogeneity and Dynamic Information Uncertainty
Figure 2 for Adaptive Task Allocation in Multi-Human Multi-Robot Teams under Team Heterogeneity and Dynamic Information Uncertainty
Figure 3 for Adaptive Task Allocation in Multi-Human Multi-Robot Teams under Team Heterogeneity and Dynamic Information Uncertainty
Figure 4 for Adaptive Task Allocation in Multi-Human Multi-Robot Teams under Team Heterogeneity and Dynamic Information Uncertainty
Viaarxiv icon