Picture for Ismini Lourentzou

Ismini Lourentzou

HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation

Add code
Jun 26, 2025
Viaarxiv icon

Open World Scene Graph Generation using Vision Language Models

Add code
Jun 09, 2025
Figure 1 for Open World Scene Graph Generation using Vision Language Models
Figure 2 for Open World Scene Graph Generation using Vision Language Models
Figure 3 for Open World Scene Graph Generation using Vision Language Models
Figure 4 for Open World Scene Graph Generation using Vision Language Models
Viaarxiv icon

LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer

Add code
Jun 08, 2025
Viaarxiv icon

Uncertainty in Action: Confidence Elicitation in Embodied Agents

Add code
Mar 13, 2025
Viaarxiv icon

FAIR: Facilitating Artificial Intelligence Resilience in Manufacturing Industrial Internet

Add code
Mar 03, 2025
Figure 1 for FAIR: Facilitating Artificial Intelligence Resilience in Manufacturing Industrial Internet
Figure 2 for FAIR: Facilitating Artificial Intelligence Resilience in Manufacturing Industrial Internet
Figure 3 for FAIR: Facilitating Artificial Intelligence Resilience in Manufacturing Industrial Internet
Figure 4 for FAIR: Facilitating Artificial Intelligence Resilience in Manufacturing Industrial Internet
Viaarxiv icon

CALICO: Part-Focused Semantic Co-Segmentation with Large Vision-Language Models

Add code
Dec 26, 2024
Viaarxiv icon

PRIMA: Multi-Image Vision-Language Models for Reasoning Segmentation

Add code
Dec 19, 2024
Figure 1 for PRIMA: Multi-Image Vision-Language Models for Reasoning Segmentation
Figure 2 for PRIMA: Multi-Image Vision-Language Models for Reasoning Segmentation
Figure 3 for PRIMA: Multi-Image Vision-Language Models for Reasoning Segmentation
Figure 4 for PRIMA: Multi-Image Vision-Language Models for Reasoning Segmentation
Viaarxiv icon

Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG

Add code
Dec 12, 2024
Figure 1 for Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG
Figure 2 for Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG
Figure 3 for Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG
Figure 4 for Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG
Viaarxiv icon

uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures

Add code
Mar 14, 2024
Viaarxiv icon

Commonsense for Zero-Shot Natural Language Video Localization

Add code
Dec 29, 2023
Figure 1 for Commonsense for Zero-Shot Natural Language Video Localization
Figure 2 for Commonsense for Zero-Shot Natural Language Video Localization
Figure 3 for Commonsense for Zero-Shot Natural Language Video Localization
Figure 4 for Commonsense for Zero-Shot Natural Language Video Localization
Viaarxiv icon