Picture for Alan Akbik

Alan Akbik

Pre-Training Curriculum for Multi-Token Prediction in Language Models

Add code
May 28, 2025
Viaarxiv icon

Evaluating Design Decisions for Dual Encoder-based Entity Disambiguation

Add code
May 16, 2025
Viaarxiv icon

Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models

Add code
Apr 19, 2025
Viaarxiv icon

MastermindEval: A Simple But Scalable Reasoning Benchmark

Add code
Mar 11, 2025
Viaarxiv icon

BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models

Add code
Dec 20, 2024
Figure 1 for BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
Figure 2 for BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
Figure 3 for BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
Figure 4 for BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
Viaarxiv icon

Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data

Add code
Dec 13, 2024
Figure 1 for Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data
Figure 2 for Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data
Figure 3 for Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data
Figure 4 for Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data
Viaarxiv icon

Don't Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation LLM

Add code
Nov 22, 2024
Figure 1 for Don't Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation LLM
Figure 2 for Don't Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation LLM
Figure 3 for Don't Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation LLM
Figure 4 for Don't Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation LLM
Viaarxiv icon

Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning

Add code
Oct 19, 2024
Viaarxiv icon

TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks

Add code
Sep 09, 2024
Figure 1 for TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks
Figure 2 for TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks
Figure 3 for TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks
Figure 4 for TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks
Viaarxiv icon

LM-PUB-QUIZ: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models

Add code
Aug 28, 2024
Viaarxiv icon