Picture for Bhaskar Ramasubramanian

Bhaskar Ramasubramanian

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Add code
May 29, 2025
Viaarxiv icon

SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge

Add code
May 27, 2025
Viaarxiv icon

Temporal Sampling for Forgotten Reasoning in LLMs

Add code
May 26, 2025
Viaarxiv icon

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Add code
May 20, 2025
Viaarxiv icon

Small Models Struggle to Learn from Strong Reasoners

Add code
Feb 17, 2025
Figure 1 for Small Models Struggle to Learn from Strong Reasoners
Figure 2 for Small Models Struggle to Learn from Strong Reasoners
Figure 3 for Small Models Struggle to Learn from Strong Reasoners
Figure 4 for Small Models Struggle to Learn from Strong Reasoners
Viaarxiv icon

A Method for Fast Autonomy Transfer in Reinforcement Learning

Add code
Jul 29, 2024
Figure 1 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Figure 2 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Figure 3 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Figure 4 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Viaarxiv icon

CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

Add code
Feb 22, 2024
Figure 1 for ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
Figure 2 for ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
Figure 3 for ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
Figure 4 for ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
Viaarxiv icon

Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors

Add code
Feb 12, 2024
Figure 1 for Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors
Figure 2 for Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors
Figure 3 for Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors
Figure 4 for Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors
Viaarxiv icon

Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization

Add code
Feb 02, 2024
Figure 1 for Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
Figure 2 for Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
Figure 3 for Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
Figure 4 for Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
Viaarxiv icon