Picture for Brais Martinez

Brais Martinez

Restore, Assess, Repeat: A Unified Framework for Iterative Image Restoration

Add code
Mar 27, 2026
Viaarxiv icon

No Hard Negatives Required: Concept Centric Learning Leads to Compositionality without Degrading Zero-shot Capabilities of Contrastive Models

Add code
Mar 26, 2026
Viaarxiv icon

More Images, More Problems? A Controlled Analysis of VLM Failure Modes

Add code
Jan 12, 2026
Viaarxiv icon

Multi-scale Image Super Resolution with a Single Auto-Regressive Model

Add code
Jun 05, 2025
Viaarxiv icon

Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning

Add code
Dec 09, 2024
Figure 1 for Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning
Figure 2 for Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning
Figure 3 for Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning
Figure 4 for Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning
Viaarxiv icon

Discriminative Fine-tuning of LVLMs

Add code
Dec 05, 2024
Figure 1 for Discriminative Fine-tuning of LVLMs
Figure 2 for Discriminative Fine-tuning of LVLMs
Figure 3 for Discriminative Fine-tuning of LVLMs
Figure 4 for Discriminative Fine-tuning of LVLMs
Viaarxiv icon

FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion

Add code
Nov 27, 2024
Figure 1 for FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
Figure 2 for FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
Figure 3 for FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
Figure 4 for FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
Viaarxiv icon

A Bayesian Approach to Data Point Selection

Add code
Nov 06, 2024
Viaarxiv icon

MobileQuant: Mobile-friendly Quantization for On-device Language Models

Add code
Aug 25, 2024
Figure 1 for MobileQuant: Mobile-friendly Quantization for On-device Language Models
Figure 2 for MobileQuant: Mobile-friendly Quantization for On-device Language Models
Figure 3 for MobileQuant: Mobile-friendly Quantization for On-device Language Models
Figure 4 for MobileQuant: Mobile-friendly Quantization for On-device Language Models
Viaarxiv icon

CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs

Add code
Aug 19, 2024
Figure 1 for CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs
Figure 2 for CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs
Figure 3 for CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs
Figure 4 for CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs
Viaarxiv icon