Picture for Karthikeyan Shanmugam

Karthikeyan Shanmugam

Regret minimization in Linear Bandits with offline data via extended D-optimal exploration

Add code
Aug 13, 2025
Viaarxiv icon

Efficient Approximate Posterior Sampling with Annealed Langevin Monte Carlo

Add code
Aug 11, 2025
Viaarxiv icon

Path-specific effects for pulse-oximetry guided decisions in critical care

Add code
Jun 14, 2025
Viaarxiv icon

Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms

Add code
Apr 30, 2025
Viaarxiv icon

Representation Learning Preserving Ignorability and Covariate Matching for Treatment Effects

Add code
Apr 29, 2025
Viaarxiv icon

Risk-sensitive Bandits: Arm Mixture Optimality and Regret-efficient Algorithms

Add code
Mar 11, 2025
Viaarxiv icon

Online Bidding under RoS Constraints without Knowing the Value

Add code
Mar 05, 2025
Viaarxiv icon

Interleaved Gibbs Diffusion for Constrained Generation

Add code
Feb 19, 2025
Viaarxiv icon

Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?

Add code
Dec 04, 2024
Figure 1 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Figure 2 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Figure 3 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Figure 4 for Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?
Viaarxiv icon

Time-Reversal Provides Unsupervised Feedback to LLMs

Add code
Dec 03, 2024
Figure 1 for Time-Reversal Provides Unsupervised Feedback to LLMs
Figure 2 for Time-Reversal Provides Unsupervised Feedback to LLMs
Figure 3 for Time-Reversal Provides Unsupervised Feedback to LLMs
Figure 4 for Time-Reversal Provides Unsupervised Feedback to LLMs
Viaarxiv icon