Picture for Ivan Vulić

Ivan Vulić

RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding

Add code
May 20, 2025
Viaarxiv icon

Visual Planning: Let's Think Only with Images

Add code
May 16, 2025
Viaarxiv icon

A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Add code
Apr 21, 2025
Viaarxiv icon

Cross-Tokenizer Distillation via Approximate Likelihood Matching

Add code
Mar 27, 2025
Viaarxiv icon

Training Plug-n-Play Knowledge Modules with Deep Context Distillation

Add code
Mar 11, 2025
Viaarxiv icon

Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies

Add code
Feb 04, 2025
Viaarxiv icon

Imagine while Reasoning in Space: Multimodal Visualization-of-Thought

Add code
Jan 13, 2025
Figure 1 for Imagine while Reasoning in Space: Multimodal Visualization-of-Thought
Figure 2 for Imagine while Reasoning in Space: Multimodal Visualization-of-Thought
Figure 3 for Imagine while Reasoning in Space: Multimodal Visualization-of-Thought
Figure 4 for Imagine while Reasoning in Space: Multimodal Visualization-of-Thought
Viaarxiv icon

Language Fusion for Parameter-Efficient Cross-lingual Transfer

Add code
Jan 12, 2025
Viaarxiv icon

Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding

Add code
Jan 10, 2025
Viaarxiv icon

Retrofitting (Large) Language Models with Dynamic Tokenization

Add code
Nov 27, 2024
Viaarxiv icon