Picture for Bhuwan Dhingra

Bhuwan Dhingra

RVPO: Risk-Sensitive Alignment via Variance Regularization

Add code
May 07, 2026
Viaarxiv icon

Document-as-Image Representations Fall Short for Scientific Retrieval

Add code
Apr 20, 2026
Viaarxiv icon

Coding Agents are Effective Long-Context Processors

Add code
Mar 20, 2026
Viaarxiv icon

InData: Towards Secure Multi-Step, Tool-Based Data Analysis

Add code
Nov 14, 2025
Viaarxiv icon

Staircase Streaming for Low-Latency Multi-Agent Inference

Add code
Oct 06, 2025
Figure 1 for Staircase Streaming for Low-Latency Multi-Agent Inference
Figure 2 for Staircase Streaming for Low-Latency Multi-Agent Inference
Figure 3 for Staircase Streaming for Low-Latency Multi-Agent Inference
Figure 4 for Staircase Streaming for Low-Latency Multi-Agent Inference
Viaarxiv icon

How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning

Add code
May 30, 2025
Figure 1 for How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning
Figure 2 for How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning
Figure 3 for How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning
Figure 4 for How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning
Viaarxiv icon

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

Breaking the Batch Barrier (B3) of Contrastive Learning via Smart Batch Mining

Add code
May 16, 2025
Viaarxiv icon

Atomic Consistency Preference Optimization for Long-Form Question Answering

Add code
May 14, 2025
Viaarxiv icon

Improving Model Alignment Through Collective Intelligence of Open-Source LLMS

Add code
May 05, 2025
Figure 1 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Figure 2 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Figure 3 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Figure 4 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Viaarxiv icon