Picture for Bhuwan Dhingra

Bhuwan Dhingra

How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning

Add code
May 30, 2025
Viaarxiv icon

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

Breaking the Batch Barrier (B3) of Contrastive Learning via Smart Batch Mining

Add code
May 16, 2025
Viaarxiv icon

Atomic Consistency Preference Optimization for Long-Form Question Answering

Add code
May 14, 2025
Viaarxiv icon

Improving Model Alignment Through Collective Intelligence of Open-Source LLMS

Add code
May 05, 2025
Viaarxiv icon

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods

Add code
Apr 18, 2025
Viaarxiv icon

Fuzzy Speculative Decoding for a Tunable Accuracy-Runtime Tradeoff

Add code
Feb 28, 2025
Viaarxiv icon

Knowing When to Stop: Dynamic Context Cutoff for Large Language Models

Add code
Feb 03, 2025
Figure 1 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Figure 2 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Figure 3 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Figure 4 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Viaarxiv icon

MatViX: Multimodal Information Extraction from Visually Rich Articles

Add code
Oct 27, 2024
Viaarxiv icon

Enhancing Large Language Models' Situated Faithfulness to External Contexts

Add code
Oct 18, 2024
Viaarxiv icon