Picture for Bhaskar Ramasubramanian

Bhaskar Ramasubramanian

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Add code
May 29, 2025
Viaarxiv icon

SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge

Add code
May 27, 2025
Viaarxiv icon

Temporal Sampling for Forgotten Reasoning in LLMs

Add code
May 26, 2025
Viaarxiv icon

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Add code
May 20, 2025
Figure 1 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 2 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 3 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Figure 4 for TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Viaarxiv icon

Small Models Struggle to Learn from Strong Reasoners

Add code
Feb 17, 2025
Figure 1 for Small Models Struggle to Learn from Strong Reasoners
Figure 2 for Small Models Struggle to Learn from Strong Reasoners
Figure 3 for Small Models Struggle to Learn from Strong Reasoners
Figure 4 for Small Models Struggle to Learn from Strong Reasoners
Viaarxiv icon

A Method for Fast Autonomy Transfer in Reinforcement Learning

Add code
Jul 29, 2024
Figure 1 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Figure 2 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Figure 3 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Figure 4 for A Method for Fast Autonomy Transfer in Reinforcement Learning
Viaarxiv icon

CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

Add code
Feb 22, 2024
Figure 1 for ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
Figure 2 for ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
Figure 3 for ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
Figure 4 for ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
Viaarxiv icon

Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors

Add code
Feb 12, 2024
Figure 1 for Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors
Figure 2 for Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors
Figure 3 for Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors
Figure 4 for Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors
Viaarxiv icon

Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization

Add code
Feb 02, 2024
Figure 1 for Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
Figure 2 for Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
Figure 3 for Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
Figure 4 for Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
Viaarxiv icon