Picture for Sahger Lad

Sahger Lad

The Tokenization Bottleneck: How Vocabulary Extension Improves Chemistry Representation Learning in Pretrained Language Models

Add code
Nov 18, 2025
Viaarxiv icon