Picture for Bhavya Kailkhura

Bhavya Kailkhura

LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning

Add code
Apr 15, 2026
Viaarxiv icon

Improving Robustness In Sparse Autoencoders via Masked Regularization

Add code
Apr 07, 2026
Viaarxiv icon

A Comedy of Estimators: On KL Regularization in RL Training of LLMs

Add code
Dec 26, 2025
Viaarxiv icon

Forecasting Fails: Unveiling Evasion Attacks in Weather Prediction Models

Add code
Dec 09, 2025
Viaarxiv icon

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Add code
Nov 10, 2025
Viaarxiv icon

Recursive Self-Aggregation Unlocks Deep Thinking in Large Language Models

Add code
Sep 30, 2025
Viaarxiv icon

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Add code
Jun 05, 2025
Viaarxiv icon

BOOM: Benchmarking Out-Of-distribution Molecular Property Predictions of Machine Learning Models

Add code
May 03, 2025
Viaarxiv icon

AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security

Add code
Apr 29, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon