Picture for Benjamin Feuer

Benjamin Feuer

OpenThoughts: Data Recipes for Reasoning Models

Add code
Jun 05, 2025
Viaarxiv icon

Towards Large Reasoning Models for Agriculture

Add code
May 25, 2025
Viaarxiv icon

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Add code
Jan 30, 2025
Viaarxiv icon

Hidden in the Noise: Two-Stage Robust Watermarking for Images

Add code
Dec 05, 2024
Viaarxiv icon

SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification

Add code
Oct 07, 2024
Figure 1 for SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification
Figure 2 for SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification
Figure 3 for SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification
Figure 4 for SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification
Viaarxiv icon

Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity

Add code
Jun 25, 2024
Figure 1 for Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity
Figure 2 for Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity
Figure 3 for Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity
Figure 4 for Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity
Viaarxiv icon

TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks

Add code
Feb 17, 2024
Figure 1 for TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks
Figure 2 for TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks
Figure 3 for TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks
Figure 4 for TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks
Viaarxiv icon

Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks

Add code
Nov 17, 2023
Viaarxiv icon

Exploring Dataset-Scale Indicators of Data Quality

Add code
Nov 07, 2023
Figure 1 for Exploring Dataset-Scale Indicators of Data Quality
Figure 2 for Exploring Dataset-Scale Indicators of Data Quality
Figure 3 for Exploring Dataset-Scale Indicators of Data Quality
Figure 4 for Exploring Dataset-Scale Indicators of Data Quality
Viaarxiv icon

ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models

Add code
Nov 06, 2023
Figure 1 for ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
Figure 2 for ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
Figure 3 for ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
Figure 4 for ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
Viaarxiv icon