Picture for Prasanna Pendse

Prasanna Pendse

The Tokenization Bottleneck: How Vocabulary Extension Improves Chemistry Representation Learning in Pretrained Language Models

Add code
Nov 18, 2025
Figure 1 for The Tokenization Bottleneck: How Vocabulary Extension Improves Chemistry Representation Learning in Pretrained Language Models
Figure 2 for The Tokenization Bottleneck: How Vocabulary Extension Improves Chemistry Representation Learning in Pretrained Language Models
Figure 3 for The Tokenization Bottleneck: How Vocabulary Extension Improves Chemistry Representation Learning in Pretrained Language Models
Figure 4 for The Tokenization Bottleneck: How Vocabulary Extension Improves Chemistry Representation Learning in Pretrained Language Models
Viaarxiv icon

Towards Transparent AI Grading: Semantic Entropy as a Signal for Human-AI Disagreement

Add code
Aug 06, 2025
Viaarxiv icon