Picture for Archiki Prasad

Archiki Prasad

Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems

Add code
Apr 06, 2026
Viaarxiv icon

Effective Reasoning Chains Reduce Intrinsic Dimensionality

Add code
Feb 09, 2026
Viaarxiv icon

Conflict-Resolving and Sharpness-Aware Minimization for Generalized Knowledge Editing with Multiple Updates

Add code
Feb 03, 2026
Viaarxiv icon

Think Right: Learning to Mitigate Under-Over Thinking via Adaptive, Attentive Compression

Add code
Oct 02, 2025
Viaarxiv icon

GrAInS: Gradient-based Attribution for Inference-Time Steering of LLMs and VLMs

Add code
Jul 24, 2025
Viaarxiv icon

Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

Add code
Apr 14, 2025
Viaarxiv icon

Multi-Attribute Steering of Language Models via Targeted Intervention

Add code
Feb 18, 2025
Viaarxiv icon

Learning to Generate Unit Tests for Automated Debugging

Add code
Feb 03, 2025
Figure 1 for Learning to Generate Unit Tests for Automated Debugging
Figure 2 for Learning to Generate Unit Tests for Automated Debugging
Figure 3 for Learning to Generate Unit Tests for Automated Debugging
Figure 4 for Learning to Generate Unit Tests for Automated Debugging
Viaarxiv icon

Self-Consistency Preference Optimization

Add code
Nov 06, 2024
Figure 1 for Self-Consistency Preference Optimization
Figure 2 for Self-Consistency Preference Optimization
Figure 3 for Self-Consistency Preference Optimization
Figure 4 for Self-Consistency Preference Optimization
Viaarxiv icon

LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits

Add code
Oct 02, 2024
Viaarxiv icon