Picture for Archit Sharma

Archit Sharma

RoboSubtaskNet: Temporal Sub-task Segmentation for Human-to-Robot Skill Transfer in Real-World Environments

Add code
Feb 11, 2026
Viaarxiv icon

Instruct2Act: From Human Instruction to Actions Sequencing and Execution via Robot Action Network for Robotic Manipulation

Add code
Feb 10, 2026
Viaarxiv icon

FSPO: Few-Shot Preference Optimization of Synthetic Preference Data in LLMs Elicits Effective Personalization to Real Users

Add code
Feb 26, 2025
Viaarxiv icon

Test-Time Alignment via Hypothesis Reweighting

Add code
Dec 11, 2024
Viaarxiv icon

Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone

Add code
Dec 09, 2024
Viaarxiv icon

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval

Add code
Oct 31, 2024
Viaarxiv icon

Towards Data-Centric RLHF: Simple Metrics for Preference Dataset Comparison

Add code
Sep 15, 2024
Viaarxiv icon

Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

Add code
Apr 23, 2024
Figure 1 for Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
Figure 2 for Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
Figure 3 for Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
Figure 4 for Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
Viaarxiv icon

Stream of Search : Learning to Search in Language

Add code
Apr 01, 2024
Viaarxiv icon

Yell At Your Robot: Improving On-the-Fly from Language Corrections

Add code
Mar 19, 2024
Viaarxiv icon