Picture for Lorenzo Baraldi

Lorenzo Baraldi

Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection

Add code
Sep 16, 2024
Figure 1 for Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection
Viaarxiv icon

Fluent and Accurate Image Captioning with a Self-Trained Reward Model

Add code
Aug 29, 2024
Figure 1 for Fluent and Accurate Image Captioning with a Self-Trained Reward Model
Figure 2 for Fluent and Accurate Image Captioning with a Self-Trained Reward Model
Figure 3 for Fluent and Accurate Image Captioning with a Self-Trained Reward Model
Figure 4 for Fluent and Accurate Image Captioning with a Self-Trained Reward Model
Viaarxiv icon

Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization

Add code
Aug 26, 2024
Viaarxiv icon

UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation

Add code
Aug 08, 2024
Viaarxiv icon

Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities

Add code
Jul 29, 2024
Viaarxiv icon

BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

Add code
Jul 29, 2024
Viaarxiv icon

Towards Retrieval-Augmented Architectures for Image Captioning

Add code
May 21, 2024
Figure 1 for Towards Retrieval-Augmented Architectures for Image Captioning
Figure 2 for Towards Retrieval-Augmented Architectures for Image Captioning
Figure 3 for Towards Retrieval-Augmented Architectures for Image Captioning
Figure 4 for Towards Retrieval-Augmented Architectures for Image Captioning
Viaarxiv icon

Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs

Add code
Apr 23, 2024
Figure 1 for Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
Figure 2 for Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
Figure 3 for Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
Figure 4 for Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
Viaarxiv icon

AIGeN: An Adversarial Approach for Instruction Generation in VLN

Add code
Apr 15, 2024
Figure 1 for AIGeN: An Adversarial Approach for Instruction Generation in VLN
Figure 2 for AIGeN: An Adversarial Approach for Instruction Generation in VLN
Figure 3 for AIGeN: An Adversarial Approach for Instruction Generation in VLN
Figure 4 for AIGeN: An Adversarial Approach for Instruction Generation in VLN
Viaarxiv icon

Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation

Add code
Apr 09, 2024
Viaarxiv icon