Picture for Ivan Kobyzev

Ivan Kobyzev

OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection

Add code
Jun 04, 2024
Viaarxiv icon

Resonance RoPE: Improving Context Length Generalization of Large Language Models

Add code
Feb 29, 2024
Figure 1 for Resonance RoPE: Improving Context Length Generalization of Large Language Models
Figure 2 for Resonance RoPE: Improving Context Length Generalization of Large Language Models
Figure 3 for Resonance RoPE: Improving Context Length Generalization of Large Language Models
Figure 4 for Resonance RoPE: Improving Context Length Generalization of Large Language Models
Viaarxiv icon

Hyperparameter Optimization for Large Language Model Instruction-Tuning

Add code
Dec 01, 2023
Viaarxiv icon

Attribute Controlled Dialogue Prompting

Add code
Jul 11, 2023
Figure 1 for Attribute Controlled Dialogue Prompting
Figure 2 for Attribute Controlled Dialogue Prompting
Figure 3 for Attribute Controlled Dialogue Prompting
Figure 4 for Attribute Controlled Dialogue Prompting
Viaarxiv icon

LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization

Add code
May 08, 2023
Figure 1 for LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
Figure 2 for LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
Figure 3 for LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
Figure 4 for LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
Viaarxiv icon

Mathematical Challenges in Deep Learning

Add code
Mar 24, 2023
Figure 1 for Mathematical Challenges in Deep Learning
Figure 2 for Mathematical Challenges in Deep Learning
Figure 3 for Mathematical Challenges in Deep Learning
Figure 4 for Mathematical Challenges in Deep Learning
Viaarxiv icon

KronA: Parameter Efficient Tuning with Kronecker Adapter

Add code
Dec 20, 2022
Figure 1 for KronA: Parameter Efficient Tuning with Kronecker Adapter
Figure 2 for KronA: Parameter Efficient Tuning with Kronecker Adapter
Figure 3 for KronA: Parameter Efficient Tuning with Kronecker Adapter
Figure 4 for KronA: Parameter Efficient Tuning with Kronecker Adapter
Viaarxiv icon

Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging

Add code
Dec 16, 2022
Figure 1 for Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Figure 2 for Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Figure 3 for Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Figure 4 for Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Viaarxiv icon

Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization

Add code
Dec 12, 2022
Figure 1 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Figure 2 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Figure 3 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Figure 4 for Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
Viaarxiv icon

DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation

Add code
Oct 14, 2022
Figure 1 for DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation
Figure 2 for DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation
Figure 3 for DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation
Figure 4 for DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation
Viaarxiv icon