Picture for Amir Zadeh

Amir Zadeh

Ehsan

Latent Particle World Models: Self-supervised Object-centric Stochastic Dynamics Modeling

Add code
Mar 04, 2026
Viaarxiv icon

Iterative Refinement Improves Compositional Image Generation

Add code
Jan 21, 2026
Viaarxiv icon

NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards

Add code
Nov 18, 2025
Figure 1 for NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards
Figure 2 for NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards
Figure 3 for NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards
Figure 4 for NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards
Viaarxiv icon

OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always!

Add code
Sep 30, 2025
Figure 1 for OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always!
Figure 2 for OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always!
Figure 3 for OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always!
Figure 4 for OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always!
Viaarxiv icon

AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies

Add code
Aug 11, 2025
Viaarxiv icon

Lessons from Training Grounded LLMs with Verifiable Rewards

Add code
Jun 18, 2025
Figure 1 for Lessons from Training Grounded LLMs with Verifiable Rewards
Figure 2 for Lessons from Training Grounded LLMs with Verifiable Rewards
Figure 3 for Lessons from Training Grounded LLMs with Verifiable Rewards
Figure 4 for Lessons from Training Grounded LLMs with Verifiable Rewards
Viaarxiv icon

Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision

Add code
May 26, 2025
Viaarxiv icon

VeriFastScore: Speeding up long-form factuality evaluation

Add code
May 22, 2025
Viaarxiv icon

BLEUBERI: BLEU is a surprisingly effective reward for instruction following

Add code
May 16, 2025
Figure 1 for BLEUBERI: BLEU is a surprisingly effective reward for instruction following
Figure 2 for BLEUBERI: BLEU is a surprisingly effective reward for instruction following
Figure 3 for BLEUBERI: BLEU is a surprisingly effective reward for instruction following
Figure 4 for BLEUBERI: BLEU is a surprisingly effective reward for instruction following
Viaarxiv icon

NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

Add code
Apr 28, 2025
Figure 1 for NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
Figure 2 for NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
Figure 3 for NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
Figure 4 for NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
Viaarxiv icon