Alert button
Picture for Wonseok Jeon

Wonseok Jeon

Alert button

On Speculative Decoding for Multimodal Large Language Models

Add code
Bookmark button
Alert button
Apr 13, 2024
Mukul Gagrani, Raghavv Goel, Wonseok Jeon, Junyoung Park, Mingu Lee, Christopher Lott

Viaarxiv icon

Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs

Add code
Bookmark button
Alert button
Mar 08, 2024
Raghavv Goel, Mukul Gagrani, Wonseok Jeon, Junyoung Park, Mingu Lee, Christopher Lott

Figure 1 for Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs
Figure 2 for Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs
Figure 3 for Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs
Figure 4 for Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs
Viaarxiv icon

Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement

Add code
Bookmark button
Alert button
Mar 05, 2024
Wonseok Jeon, Mukul Gagrani, Raghavv Goel, Junyoung Park, Mingu Lee, Christopher Lott

Viaarxiv icon

Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions

Add code
Bookmark button
Alert button
Oct 25, 2022
Haanvid Lee, Jongmin Lee, Yunseon Choi, Wonseok Jeon, Byung-Jun Lee, Yung-Kyun Noh, Kee-Eung Kim

Figure 1 for Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
Figure 2 for Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
Figure 3 for Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
Figure 4 for Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
Viaarxiv icon

Neural Topological Ordering for Computation Graphs

Add code
Bookmark button
Alert button
Jul 13, 2022
Mukul Gagrani, Corrado Rainone, Yang Yang, Harris Teague, Wonseok Jeon, Herke Van Hoof, Weiliang Will Zeng, Piero Zappi, Christopher Lott, Roberto Bondesan

Figure 1 for Neural Topological Ordering for Computation Graphs
Figure 2 for Neural Topological Ordering for Computation Graphs
Figure 3 for Neural Topological Ordering for Computation Graphs
Figure 4 for Neural Topological Ordering for Computation Graphs
Viaarxiv icon

OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation

Add code
Bookmark button
Alert button
Jun 21, 2021
Jongmin Lee, Wonseok Jeon, Byung-Jun Lee, Joelle Pineau, Kee-Eung Kim

Figure 1 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Figure 2 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Figure 3 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Figure 4 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Viaarxiv icon

Regularized Inverse Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 07, 2020
Wonseok Jeon, Chen-Yang Su, Paul Barde, Thang Doan, Derek Nowrouzezahrai, Joelle Pineau

Figure 1 for Regularized Inverse Reinforcement Learning
Figure 2 for Regularized Inverse Reinforcement Learning
Figure 3 for Regularized Inverse Reinforcement Learning
Figure 4 for Regularized Inverse Reinforcement Learning
Viaarxiv icon

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

Add code
Bookmark button
Alert button
Jun 23, 2020
Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher Pal, Derek Nowrouzezahrai

Figure 1 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Figure 2 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Figure 3 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Figure 4 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Viaarxiv icon

Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic

Add code
Bookmark button
Alert button
Feb 24, 2020
Wonseok Jeon, Paul Barde, Derek Nowrouzezahrai, Joelle Pineau

Figure 1 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Figure 2 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Figure 3 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Figure 4 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Viaarxiv icon