Picture for Pulkit Agrawal

Pulkit Agrawal

DEXOP: A Device for Robotic Transfer of Dexterous Human Manipulation

Add code
Sep 04, 2025
Viaarxiv icon

RL's Razor: Why Online Reinforcement Learning Forgets Less

Add code
Sep 04, 2025
Viaarxiv icon

DexWrist: A Robotic Wrist for Constrained and Dynamic Manipulation

Add code
Jul 01, 2025
Viaarxiv icon

Self-Adapting Language Models

Add code
Jun 12, 2025
Viaarxiv icon

FAST-Q: Fast-track Exploration with Adversarially Balanced State Representations for Counterfactual Action Estimation in Offline Reinforcement Learning

Add code
Apr 30, 2025
Viaarxiv icon

Language Model Personalization via Reward Factorization

Add code
Mar 08, 2025
Viaarxiv icon

General Reasoning Requires Learning to Reason from the Get-go

Add code
Feb 26, 2025
Viaarxiv icon

Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning

Add code
Feb 17, 2025
Figure 1 for Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning
Figure 2 for Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning
Figure 3 for Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning
Figure 4 for Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning
Viaarxiv icon

Known Unknowns: Out-of-Distribution Property Prediction in Materials and Molecules

Add code
Feb 09, 2025
Viaarxiv icon

Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning

Add code
Dec 17, 2024
Viaarxiv icon