Picture for Bhavya Kailkhura

Bhavya Kailkhura

A Comedy of Estimators: On KL Regularization in RL Training of LLMs

Add code
Dec 26, 2025
Viaarxiv icon

Forecasting Fails: Unveiling Evasion Attacks in Weather Prediction Models

Add code
Dec 09, 2025
Viaarxiv icon

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Add code
Nov 10, 2025
Viaarxiv icon

Recursive Self-Aggregation Unlocks Deep Thinking in Large Language Models

Add code
Sep 30, 2025
Viaarxiv icon

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Add code
Jun 05, 2025
Viaarxiv icon

BOOM: Benchmarking Out-Of-distribution Molecular Property Predictions of Machine Learning Models

Add code
May 03, 2025
Viaarxiv icon

AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security

Add code
Apr 29, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

LLM Unlearning Reveals a Stronger-Than-Expected Coreset Effect in Current Benchmarks

Add code
Apr 16, 2025
Viaarxiv icon

STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

Add code
Apr 02, 2025
Viaarxiv icon