Picture for Marin Vlastelica

Marin Vlastelica

Quagmires in SFT-RL Post-Training: When High SFT Scores Mislead and What to Use Instead

Add code
Oct 02, 2025
Viaarxiv icon

Divide, Discover, Deploy: Factorized Skill Learning with Symmetry and Style Priors

Add code
Aug 27, 2025
Viaarxiv icon

Provable Maximum Entropy Manifold Exploration via Diffusion Models

Add code
Jun 18, 2025
Figure 1 for Provable Maximum Entropy Manifold Exploration via Diffusion Models
Figure 2 for Provable Maximum Entropy Manifold Exploration via Diffusion Models
Figure 3 for Provable Maximum Entropy Manifold Exploration via Diffusion Models
Figure 4 for Provable Maximum Entropy Manifold Exploration via Diffusion Models
Viaarxiv icon

Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints

Add code
Jan 08, 2025
Figure 1 for Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Figure 2 for Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Figure 3 for Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Viaarxiv icon

Causal Action Influence Aware Counterfactual Data Augmentation

Add code
May 29, 2024
Figure 1 for Causal Action Influence Aware Counterfactual Data Augmentation
Figure 2 for Causal Action Influence Aware Counterfactual Data Augmentation
Figure 3 for Causal Action Influence Aware Counterfactual Data Augmentation
Figure 4 for Causal Action Influence Aware Counterfactual Data Augmentation
Viaarxiv icon

Learning Diverse Skills for Local Navigation under Multi-constraint Optimality

Add code
Oct 03, 2023
Figure 1 for Learning Diverse Skills for Local Navigation under Multi-constraint Optimality
Figure 2 for Learning Diverse Skills for Local Navigation under Multi-constraint Optimality
Figure 3 for Learning Diverse Skills for Local Navigation under Multi-constraint Optimality
Figure 4 for Learning Diverse Skills for Local Navigation under Multi-constraint Optimality
Viaarxiv icon

Diffusion Generative Inverse Design

Add code
Sep 18, 2023
Figure 1 for Diffusion Generative Inverse Design
Figure 2 for Diffusion Generative Inverse Design
Figure 3 for Diffusion Generative Inverse Design
Figure 4 for Diffusion Generative Inverse Design
Viaarxiv icon

Mind the Uncertainty: Risk-Aware and Actively Exploring Model-Based Reinforcement Learning

Add code
Sep 11, 2023
Viaarxiv icon

Diverse Offline Imitation via Fenchel Duality

Add code
Jul 21, 2023
Figure 1 for Diverse Offline Imitation via Fenchel Duality
Figure 2 for Diverse Offline Imitation via Fenchel Duality
Figure 3 for Diverse Offline Imitation via Fenchel Duality
Figure 4 for Diverse Offline Imitation via Fenchel Duality
Viaarxiv icon

Spuriosity Didn't Kill the Classifier: Using Invariant Predictions to Harness Spurious Features

Add code
Jul 19, 2023
Viaarxiv icon