Picture for He Cao

He Cao

Augmenting Molecular Language Models with Local $n$-gram Memory

Add code
Jun 10, 2026
Viaarxiv icon

Improving Cross-Format Robustness in Language Models with Multi-Format Training

Add code
Jun 10, 2026
Viaarxiv icon

From Answers to States: Verifiable Process-Level Evaluation of Chemical Reasoning in Large Language Models

Add code
Jun 03, 2026
Viaarxiv icon

FORGE: Fragment-Oriented Ranking and Generation for Context-Aware Molecular Optimization

Add code
May 11, 2026
Viaarxiv icon

ProteinOPD: Towards Effective and Efficient Preference Alignment for Protein Design

Add code
May 11, 2026
Viaarxiv icon

Mozi: Governed Autonomy for Drug Discovery LLM Agents

Add code
Mar 04, 2026
Viaarxiv icon

Agentic reinforcement learning empowers next-generation chemical language models for molecular design and synthesis

Add code
Jan 25, 2026
Viaarxiv icon

From Static Structures to Ensembles: Studying and Harnessing Protein Structure Tokenization

Add code
Nov 13, 2025
Viaarxiv icon

Beyond Chemical QA: Evaluating LLM's Chemical Reasoning with Modular Chemical Operations

Add code
May 27, 2025
Viaarxiv icon

Rethinking Text-based Protein Understanding: Retrieval or LLM?

Add code
May 26, 2025
Viaarxiv icon