Picture for Dzmitry Bahdanau

Dzmitry Bahdanau

BRIDGE: Predicting Human Task Completion Time From Model Performance

Add code
Feb 06, 2026
Viaarxiv icon

How to Get Your LLM to Generate Challenging Problems for Evaluation

Add code
Feb 20, 2025
Figure 1 for How to Get Your LLM to Generate Challenging Problems for Evaluation
Figure 2 for How to Get Your LLM to Generate Challenging Problems for Evaluation
Figure 3 for How to Get Your LLM to Generate Challenging Problems for Evaluation
Figure 4 for How to Get Your LLM to Generate Challenging Problems for Evaluation
Viaarxiv icon

TapeAgents: a Holistic Framework for Agent Development and Optimization

Add code
Dec 11, 2024
Figure 1 for TapeAgents: a Holistic Framework for Agent Development and Optimization
Figure 2 for TapeAgents: a Holistic Framework for Agent Development and Optimization
Figure 3 for TapeAgents: a Holistic Framework for Agent Development and Optimization
Figure 4 for TapeAgents: a Holistic Framework for Agent Development and Optimization
Viaarxiv icon

NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator

Add code
Oct 03, 2024
Figure 1 for NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator
Figure 2 for NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator
Figure 3 for NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator
Figure 4 for NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator
Viaarxiv icon

LLMs can learn self-restraint through iterative self-reflection

Add code
May 15, 2024
Figure 1 for LLMs can learn self-restraint through iterative self-reflection
Figure 2 for LLMs can learn self-restraint through iterative self-reflection
Figure 3 for LLMs can learn self-restraint through iterative self-reflection
Figure 4 for LLMs can learn self-restraint through iterative self-reflection
Viaarxiv icon

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Add code
Apr 09, 2024
Figure 1 for LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Figure 2 for LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Figure 3 for LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Figure 4 for LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Viaarxiv icon

Evaluating In-Context Learning of Libraries for Code Generation

Add code
Nov 16, 2023
Viaarxiv icon

PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation

Add code
Oct 22, 2023
Figure 1 for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Figure 2 for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Figure 3 for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Figure 4 for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Viaarxiv icon

MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations

Add code
Oct 18, 2023
Figure 1 for MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
Figure 2 for MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
Figure 3 for MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
Figure 4 for MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
Viaarxiv icon

In-Context Learning for Text Classification with Many Labels

Add code
Sep 19, 2023
Figure 1 for In-Context Learning for Text Classification with Many Labels
Figure 2 for In-Context Learning for Text Classification with Many Labels
Figure 3 for In-Context Learning for Text Classification with Many Labels
Figure 4 for In-Context Learning for Text Classification with Many Labels
Viaarxiv icon