Picture for Ned Letcher

Ned Letcher

The Tokenization Bottleneck: How Vocabulary Extension Improves Chemistry Representation Learning in Pretrained Language Models

Add code
Nov 18, 2025
Viaarxiv icon