Picture for Ruslan Salakhutdinov

Ruslan Salakhutdinov

Shammie

Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation

Add code
Jun 09, 2025
Viaarxiv icon

Can Large Reasoning Models Self-Train?

Add code
May 27, 2025
Viaarxiv icon

AgentDAM: Privacy Leakage Evaluation for Autonomous Web Agents

Add code
Mar 12, 2025
Viaarxiv icon

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Add code
Mar 10, 2025
Viaarxiv icon

FACTR: Force-Attending Curriculum Training for Contact-Rich Policy Learning

Add code
Feb 24, 2025
Viaarxiv icon

Training a Generally Curious Agent

Add code
Feb 24, 2025
Viaarxiv icon

Towards Internet-Scale Training For Agents

Add code
Feb 10, 2025
Figure 1 for Towards Internet-Scale Training For Agents
Figure 2 for Towards Internet-Scale Training For Agents
Figure 3 for Towards Internet-Scale Training For Agents
Figure 4 for Towards Internet-Scale Training For Agents
Viaarxiv icon

Self-Regulation and Requesting Interventions

Add code
Feb 07, 2025
Viaarxiv icon

The BrowserGym Ecosystem for Web Agent Research

Add code
Dec 10, 2024
Figure 1 for The BrowserGym Ecosystem for Web Agent Research
Figure 2 for The BrowserGym Ecosystem for Web Agent Research
Figure 3 for The BrowserGym Ecosystem for Web Agent Research
Figure 4 for The BrowserGym Ecosystem for Web Agent Research
Viaarxiv icon

Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion

Add code
Nov 30, 2024
Figure 1 for Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion
Figure 2 for Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion
Figure 3 for Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion
Figure 4 for Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion
Viaarxiv icon