Picture for Vassilis Katsouros

Vassilis Katsouros

When Slots Compete: Slot Merging in Object-Centric Learning

Add code
Mar 11, 2026
Viaarxiv icon

Pay (Cross) Attention to the Melody: Curriculum Masking for Single-Encoder Melodic Harmonization

Add code
Jan 22, 2026
Viaarxiv icon

Exposing Hidden Biases in Text-to-Image Models via Automated Prompt Search

Add code
Dec 09, 2025
Viaarxiv icon

Incorporating Structure and Chord Constraints in Symbolic Transformer-based Melodic Harmonization

Add code
Dec 08, 2025
Viaarxiv icon

VOX-KRIKRI: Unifying Speech and Language through Continuous Fusion

Add code
Sep 19, 2025
Figure 1 for VOX-KRIKRI: Unifying Speech and Language through Continuous Fusion
Figure 2 for VOX-KRIKRI: Unifying Speech and Language through Continuous Fusion
Figure 3 for VOX-KRIKRI: Unifying Speech and Language through Continuous Fusion
Figure 4 for VOX-KRIKRI: Unifying Speech and Language through Continuous Fusion
Viaarxiv icon

MEDUSA: A Multimodal Deep Fusion Multi-Stage Training Framework for Speech Emotion Recognition in Naturalistic Conditions

Add code
Jun 11, 2025
Viaarxiv icon

Krikri: Advancing Open Large Language Models for Greek

Add code
May 19, 2025
Figure 1 for Krikri: Advancing Open Large Language Models for Greek
Figure 2 for Krikri: Advancing Open Large Language Models for Greek
Figure 3 for Krikri: Advancing Open Large Language Models for Greek
Figure 4 for Krikri: Advancing Open Large Language Models for Greek
Viaarxiv icon

DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis

Add code
Apr 15, 2025
Viaarxiv icon

Meltemi: The first open Large Language Model for Greek

Add code
Jul 30, 2024
Figure 1 for Meltemi: The first open Large Language Model for Greek
Figure 2 for Meltemi: The first open Large Language Model for Greek
Figure 3 for Meltemi: The first open Large Language Model for Greek
Figure 4 for Meltemi: The first open Large Language Model for Greek
Viaarxiv icon

The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data

Add code
Jun 21, 2024
Figure 1 for The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data
Figure 2 for The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data
Figure 3 for The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data
Figure 4 for The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data
Viaarxiv icon