Picture for Amir Zadeh

Amir Zadeh

Ehsan

AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies

Add code
Aug 11, 2025
Viaarxiv icon

Lessons from Training Grounded LLMs with Verifiable Rewards

Add code
Jun 18, 2025
Viaarxiv icon

Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision

Add code
May 26, 2025
Viaarxiv icon

VeriFastScore: Speeding up long-form factuality evaluation

Add code
May 22, 2025
Viaarxiv icon

BLEUBERI: BLEU is a surprisingly effective reward for instruction following

Add code
May 16, 2025
Viaarxiv icon

NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

Add code
Apr 28, 2025
Viaarxiv icon

PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference

Add code
Mar 30, 2025
Viaarxiv icon

Hi5: 2D Hand Pose Estimation with Zero Human Annotation

Add code
Jun 05, 2024
Figure 1 for Hi5: 2D Hand Pose Estimation with Zero Human Annotation
Figure 2 for Hi5: 2D Hand Pose Estimation with Zero Human Annotation
Figure 3 for Hi5: 2D Hand Pose Estimation with Zero Human Annotation
Figure 4 for Hi5: 2D Hand Pose Estimation with Zero Human Annotation
Viaarxiv icon

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding

Add code
Mar 02, 2023
Viaarxiv icon

Foundations and Recent Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions

Add code
Sep 07, 2022
Figure 1 for Foundations and Recent Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions
Figure 2 for Foundations and Recent Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions
Figure 3 for Foundations and Recent Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions
Figure 4 for Foundations and Recent Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions
Viaarxiv icon