Picture for Stephan Alaniz

Stephan Alaniz

FINER: MLLMs Hallucinate under Fine-grained Negative Queries

Add code
Mar 18, 2026
Viaarxiv icon

Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs

Add code
Oct 01, 2025
Figure 1 for Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs
Figure 2 for Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs
Figure 3 for Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs
Figure 4 for Training-free Uncertainty Guidance for Complex Visual Tasks with MLLMs
Viaarxiv icon

SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions

Add code
Jul 31, 2025
Figure 1 for SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
Figure 2 for SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
Figure 3 for SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
Figure 4 for SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions
Viaarxiv icon

Feasibility with Language Models for Open-World Compositional Zero-Shot Learning

Add code
May 16, 2025
Viaarxiv icon

Concept-Guided Interpretability via Neural Chunking

Add code
May 16, 2025
Figure 1 for Concept-Guided Interpretability via Neural Chunking
Figure 2 for Concept-Guided Interpretability via Neural Chunking
Figure 3 for Concept-Guided Interpretability via Neural Chunking
Figure 4 for Concept-Guided Interpretability via Neural Chunking
Viaarxiv icon

LoFT: LoRA-fused Training Dataset Generation with Few-shot Guidance

Add code
May 16, 2025
Viaarxiv icon

A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models

Add code
Mar 30, 2025
Viaarxiv icon

Discovering Chunks in Neural Embeddings for Interpretability

Add code
Feb 03, 2025
Figure 1 for Discovering Chunks in Neural Embeddings for Interpretability
Figure 2 for Discovering Chunks in Neural Embeddings for Interpretability
Figure 3 for Discovering Chunks in Neural Embeddings for Interpretability
Figure 4 for Discovering Chunks in Neural Embeddings for Interpretability
Viaarxiv icon

FLAIR: VLM with Fine-grained Language-informed Image Representations

Add code
Dec 04, 2024
Figure 1 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Figure 2 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Figure 3 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Figure 4 for FLAIR: VLM with Fine-grained Language-informed Image Representations
Viaarxiv icon

COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training

Add code
Dec 02, 2024
Figure 1 for COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
Figure 2 for COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
Figure 3 for COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
Figure 4 for COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
Viaarxiv icon