Picture for Priyanka Nigam

Priyanka Nigam

Allen

Translate-R1: Cost-Aware Translation Tool Use via Reinforcement Learning

Add code
Jun 05, 2026
Viaarxiv icon

Unlocking Latent Value: Taxonomy-Guided Recovery of High-Performing Data from Low-Tier Web Corpora

Add code
Jun 05, 2026
Viaarxiv icon

QUBRIC: Co-Designing Queries and Rubrics for RL Beyond Verifiable Rewards

Add code
Jun 02, 2026
Viaarxiv icon

Training LLMs for Multi-Step Tool Orchestration with Constrained Data Synthesis and Graduated Rewards

Add code
Mar 25, 2026
Viaarxiv icon

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning

Add code
Feb 27, 2026
Viaarxiv icon

HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon

Aligning Large Language Models with Implicit Preferences from User-Generated Content

Add code
Jun 04, 2025
Figure 1 for Aligning Large Language Models with Implicit Preferences from User-Generated Content
Figure 2 for Aligning Large Language Models with Implicit Preferences from User-Generated Content
Figure 3 for Aligning Large Language Models with Implicit Preferences from User-Generated Content
Figure 4 for Aligning Large Language Models with Implicit Preferences from User-Generated Content
Viaarxiv icon

Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training

Add code
Feb 10, 2025
Figure 1 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Figure 2 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Figure 3 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Figure 4 for Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
Viaarxiv icon

Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models

Add code
Oct 28, 2024
Figure 1 for Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models
Figure 2 for Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models
Figure 3 for Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models
Figure 4 for Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models
Viaarxiv icon

Evolutionary Contrastive Distillation for Language Model Alignment

Add code
Oct 10, 2024
Figure 1 for Evolutionary Contrastive Distillation for Language Model Alignment
Figure 2 for Evolutionary Contrastive Distillation for Language Model Alignment
Figure 3 for Evolutionary Contrastive Distillation for Language Model Alignment
Figure 4 for Evolutionary Contrastive Distillation for Language Model Alignment
Viaarxiv icon