Alert button
Picture for Ivan Kobyzev

Ivan Kobyzev

Alert button

Resonance RoPE: Improving Context Length Generalization of Large Language Models

Add code
Bookmark button
Alert button
Feb 29, 2024
Suyuchen Wang, Ivan Kobyzev, Peng Lu, Mehdi Rezagholizadeh, Bang Liu

Figure 1 for Resonance RoPE: Improving Context Length Generalization of Large Language Models
Figure 2 for Resonance RoPE: Improving Context Length Generalization of Large Language Models
Figure 3 for Resonance RoPE: Improving Context Length Generalization of Large Language Models
Figure 4 for Resonance RoPE: Improving Context Length Generalization of Large Language Models
Viaarxiv icon

Hyperparameter Optimization for Large Language Model Instruction-Tuning

Add code
Bookmark button
Alert button
Dec 01, 2023
Christophe Tribes, Sacha Benarroch-Lelong, Peng Lu, Ivan Kobyzev

Viaarxiv icon

Attribute Controlled Dialogue Prompting

Add code
Bookmark button
Alert button
Jul 11, 2023
Runcheng Liu, Ahmad Rashid, Ivan Kobyzev, Mehdi Rezagholizadeh, Pascal Poupart

Figure 1 for Attribute Controlled Dialogue Prompting
Figure 2 for Attribute Controlled Dialogue Prompting
Figure 3 for Attribute Controlled Dialogue Prompting
Figure 4 for Attribute Controlled Dialogue Prompting
Viaarxiv icon

LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization

Add code
Bookmark button
Alert button
May 08, 2023
Peng Lu, Ahmad Rashid, Ivan Kobyzev, Mehdi Rezagholizadeh, Philippe Langlais

Figure 1 for LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
Figure 2 for LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
Figure 3 for LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
Figure 4 for LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
Viaarxiv icon

Mathematical Challenges in Deep Learning

Add code
Bookmark button
Alert button
Mar 24, 2023
Vahid Partovi Nia, Guojun Zhang, Ivan Kobyzev, Michael R. Metel, Xinlin Li, Ke Sun, Sobhan Hemati, Masoud Asgharian, Linglong Kong, Wulong Liu, Boxing Chen

Figure 1 for Mathematical Challenges in Deep Learning
Figure 2 for Mathematical Challenges in Deep Learning
Figure 3 for Mathematical Challenges in Deep Learning
Figure 4 for Mathematical Challenges in Deep Learning
Viaarxiv icon

KronA: Parameter Efficient Tuning with Kronecker Adapter

Add code
Bookmark button
Alert button
Dec 20, 2022
Ali Edalati, Marzieh Tahaei, Ivan Kobyzev, Vahid Partovi Nia, James J. Clark, Mehdi Rezagholizadeh

Figure 1 for KronA: Parameter Efficient Tuning with Kronecker Adapter
Figure 2 for KronA: Parameter Efficient Tuning with Kronecker Adapter
Figure 3 for KronA: Parameter Efficient Tuning with Kronecker Adapter
Figure 4 for KronA: Parameter Efficient Tuning with Kronecker Adapter
Viaarxiv icon

Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging

Add code
Bookmark button
Alert button
Dec 16, 2022
Peng Lu, Ivan Kobyzev, Mehdi Rezagholizadeh, Ahmad Rashid, Ali Ghodsi, Philippe Langlais

Figure 1 for Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Figure 2 for Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Figure 3 for Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Figure 4 for Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Viaarxiv icon

Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization

Add code
Bookmark button
Alert button
Dec 12, 2022
Aref Jafari, Ivan Kobyzev, Mehdi Rezagholizadeh, Pascal Poupart, Ali Ghodsi

Figure 1 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Figure 2 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Figure 3 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Figure 4 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Viaarxiv icon

DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation

Add code
Bookmark button
Alert button
Oct 14, 2022
Mojtaba Valipour, Mehdi Rezagholizadeh, Ivan Kobyzev, Ali Ghodsi

Figure 1 for DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation
Figure 2 for DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation
Figure 3 for DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation
Figure 4 for DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation
Viaarxiv icon