Picture for Stevan Rudinac

Stevan Rudinac

VL-KGE: Vision-Language Models Meet Knowledge Graph Embeddings

Add code
Mar 02, 2026
Viaarxiv icon

SAM3-LiteText: An Anatomical Study of the SAM3 Text Encoder for Efficient Vision-Language Segmentation

Add code
Feb 12, 2026
Viaarxiv icon

Looking Beyond the Obvious: A Survey on Abstract Concept Recognition for Video Understanding

Add code
Aug 28, 2025
Viaarxiv icon

ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding

Add code
May 09, 2025
Figure 1 for ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding
Figure 2 for ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding
Figure 3 for ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding
Figure 4 for ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding
Viaarxiv icon

The CASTLE 2024 Dataset: Advancing the Art of Multimodal Understanding

Add code
Mar 21, 2025
Viaarxiv icon

Gradient Weight-normalized Low-rank Projection for Efficient LLM Training

Add code
Dec 27, 2024
Viaarxiv icon

Non-Progressive Influence Maximization in Dynamic Social Networks

Add code
Dec 10, 2024
Figure 1 for Non-Progressive Influence Maximization in Dynamic Social Networks
Figure 2 for Non-Progressive Influence Maximization in Dynamic Social Networks
Figure 3 for Non-Progressive Influence Maximization in Dynamic Social Networks
Figure 4 for Non-Progressive Influence Maximization in Dynamic Social Networks
Viaarxiv icon

Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models

Add code
Nov 08, 2024
Figure 1 for Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models
Figure 2 for Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models
Figure 3 for Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models
Figure 4 for Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models
Viaarxiv icon

Set2Seq Transformer: Learning Permutation Aware Set Representations of Artistic Sequences

Add code
Aug 06, 2024
Figure 1 for Set2Seq Transformer: Learning Permutation Aware Set Representations of Artistic Sequences
Figure 2 for Set2Seq Transformer: Learning Permutation Aware Set Representations of Artistic Sequences
Figure 3 for Set2Seq Transformer: Learning Permutation Aware Set Representations of Artistic Sequences
Figure 4 for Set2Seq Transformer: Learning Permutation Aware Set Representations of Artistic Sequences
Viaarxiv icon

A Novel Evaluation Framework for Image2Text Generation

Add code
Aug 03, 2024
Viaarxiv icon