Alert button
Picture for Nikos Karampatziakis

Nikos Karampatziakis

Alert button

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Add code
Bookmark button
Alert button
Oct 23, 2023
Yixiao Li, Yifan Yu, Chen Liang, Pengcheng He, Nikos Karampatziakis, Weizhu Chen, Tuo Zhao

Figure 1 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 2 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 3 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 4 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Viaarxiv icon

Meet in the Middle: A New Pre-training Paradigm

Add code
Bookmark button
Alert button
Mar 13, 2023
Anh Nguyen, Nikos Karampatziakis, Weizhu Chen

Figure 1 for Meet in the Middle: A New Pre-training Paradigm
Figure 2 for Meet in the Middle: A New Pre-training Paradigm
Figure 3 for Meet in the Middle: A New Pre-training Paradigm
Figure 4 for Meet in the Middle: A New Pre-training Paradigm
Viaarxiv icon

Anytime-valid off-policy inference for contextual bandits

Add code
Bookmark button
Alert button
Oct 19, 2022
Ian Waudby-Smith, Lili Wu, Aaditya Ramdas, Nikos Karampatziakis, Paul Mineiro

Figure 1 for Anytime-valid off-policy inference for contextual bandits
Figure 2 for Anytime-valid off-policy inference for contextual bandits
Figure 3 for Anytime-valid off-policy inference for contextual bandits
Figure 4 for Anytime-valid off-policy inference for contextual bandits
Viaarxiv icon

Contextual Bandit Applications in Customer Support Bot

Add code
Bookmark button
Alert button
Dec 06, 2021
Sandra Sajeev, Jade Huang, Nikos Karampatziakis, Matthew Hall, Sebastian Kochman, Weizhu Chen

Figure 1 for Contextual Bandit Applications in Customer Support Bot
Figure 2 for Contextual Bandit Applications in Customer Support Bot
Figure 3 for Contextual Bandit Applications in Customer Support Bot
Figure 4 for Contextual Bandit Applications in Customer Support Bot
Viaarxiv icon

Off-policy Confidence Sequences

Add code
Bookmark button
Alert button
Feb 18, 2021
Nikos Karampatziakis, Paul Mineiro, Aaditya Ramdas

Figure 1 for Off-policy Confidence Sequences
Figure 2 for Off-policy Confidence Sequences
Figure 3 for Off-policy Confidence Sequences
Figure 4 for Off-policy Confidence Sequences
Viaarxiv icon

Empirical Likelihood for Contextual Bandits

Add code
Bookmark button
Alert button
Jun 21, 2019
Nikos Karampatziakis, John Langford, Paul Mineiro

Figure 1 for Empirical Likelihood for Contextual Bandits
Figure 2 for Empirical Likelihood for Contextual Bandits
Figure 3 for Empirical Likelihood for Contextual Bandits
Figure 4 for Empirical Likelihood for Contextual Bandits
Viaarxiv icon

Lessons from Real-World Reinforcement Learning in a Customer Support Bot

Add code
Bookmark button
Alert button
May 06, 2019
Nikos Karampatziakis, Sebastian Kochman, Jade Huang, Paul Mineiro, Kathy Osborne, Weizhu Chen

Figure 1 for Lessons from Real-World Reinforcement Learning in a Customer Support Bot
Figure 2 for Lessons from Real-World Reinforcement Learning in a Customer Support Bot
Figure 3 for Lessons from Real-World Reinforcement Learning in a Customer Support Bot
Figure 4 for Lessons from Real-World Reinforcement Learning in a Customer Support Bot
Viaarxiv icon