Picture for Amrit Singh Bedi

Amrit Singh Bedi

Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts

Add code
Oct 06, 2025
Figure 1 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 2 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 3 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Figure 4 for Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Viaarxiv icon

MIRA: Towards Mitigating Reward Hacking in Inference-Time Alignment of T2I Diffusion Models

Add code
Oct 02, 2025
Viaarxiv icon

Leveraging Pre-Trained Visual Models for AI-Generated Video Detection

Add code
Jul 17, 2025
Viaarxiv icon

Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models

Add code
Jun 04, 2025
Viaarxiv icon

Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time

Add code
May 29, 2025
Viaarxiv icon

Sample Complexity of Diffusion Model Training Without Empirical Risk Minimizer Access

Add code
May 23, 2025
Viaarxiv icon

Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection

Add code
Apr 02, 2025
Viaarxiv icon

Learning Multi-Robot Coordination through Locality-Based Factorized Multi-Agent Actor-Critic Algorithm

Add code
Mar 24, 2025
Figure 1 for Learning Multi-Robot Coordination through Locality-Based Factorized Multi-Agent Actor-Critic Algorithm
Figure 2 for Learning Multi-Robot Coordination through Locality-Based Factorized Multi-Agent Actor-Critic Algorithm
Figure 3 for Learning Multi-Robot Coordination through Locality-Based Factorized Multi-Agent Actor-Critic Algorithm
Figure 4 for Learning Multi-Robot Coordination through Locality-Based Factorized Multi-Agent Actor-Critic Algorithm
Viaarxiv icon

BalancedDPO: Adaptive Multi-Metric Alignment

Add code
Mar 16, 2025
Viaarxiv icon

Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment

Add code
Jan 07, 2025
Figure 1 for Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment
Figure 2 for Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment
Figure 3 for Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment
Figure 4 for Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment
Viaarxiv icon