Picture for Nimrod Shabtay

Nimrod Shabtay

CLIMP: Contrastive Language-Image Mamba Pretraining

Add code
Jan 11, 2026
Viaarxiv icon

Spoken question answering for visual queries

Add code
May 29, 2025
Viaarxiv icon

Teaching VLMs to Localize Specific Objects from In-context Examples

Add code
Nov 20, 2024
Figure 1 for Teaching VLMs to Localize Specific Objects from In-context Examples
Figure 2 for Teaching VLMs to Localize Specific Objects from In-context Examples
Figure 3 for Teaching VLMs to Localize Specific Objects from In-context Examples
Figure 4 for Teaching VLMs to Localize Specific Objects from In-context Examples
Viaarxiv icon

Continuous Speech Synthesis using per-token Latent Diffusion

Add code
Oct 21, 2024
Figure 1 for Continuous Speech Synthesis using per-token Latent Diffusion
Figure 2 for Continuous Speech Synthesis using per-token Latent Diffusion
Figure 3 for Continuous Speech Synthesis using per-token Latent Diffusion
Figure 4 for Continuous Speech Synthesis using per-token Latent Diffusion
Viaarxiv icon

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Add code
Oct 15, 2024
Figure 1 for LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
Figure 2 for LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
Figure 3 for LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
Figure 4 for LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
Viaarxiv icon

Deep Phase Coded Image Prior

Add code
Apr 05, 2024
Figure 1 for Deep Phase Coded Image Prior
Figure 2 for Deep Phase Coded Image Prior
Figure 3 for Deep Phase Coded Image Prior
Figure 4 for Deep Phase Coded Image Prior
Viaarxiv icon

PIP: Positional-encoding Image Prior

Add code
Nov 25, 2022
Figure 1 for PIP: Positional-encoding Image Prior
Figure 2 for PIP: Positional-encoding Image Prior
Figure 3 for PIP: Positional-encoding Image Prior
Figure 4 for PIP: Positional-encoding Image Prior
Viaarxiv icon