Picture for Vassilis Katsouros

Vassilis Katsouros

Pay (Cross) Attention to the Melody: Curriculum Masking for Single-Encoder Melodic Harmonization

Add code
Jan 22, 2026
Viaarxiv icon

Exposing Hidden Biases in Text-to-Image Models via Automated Prompt Search

Add code
Dec 09, 2025
Viaarxiv icon

Incorporating Structure and Chord Constraints in Symbolic Transformer-based Melodic Harmonization

Add code
Dec 08, 2025
Viaarxiv icon

VOX-KRIKRI: Unifying Speech and Language through Continuous Fusion

Add code
Sep 19, 2025
Viaarxiv icon

MEDUSA: A Multimodal Deep Fusion Multi-Stage Training Framework for Speech Emotion Recognition in Naturalistic Conditions

Add code
Jun 11, 2025
Viaarxiv icon

Krikri: Advancing Open Large Language Models for Greek

Add code
May 19, 2025
Figure 1 for Krikri: Advancing Open Large Language Models for Greek
Figure 2 for Krikri: Advancing Open Large Language Models for Greek
Figure 3 for Krikri: Advancing Open Large Language Models for Greek
Figure 4 for Krikri: Advancing Open Large Language Models for Greek
Viaarxiv icon

DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis

Add code
Apr 15, 2025
Viaarxiv icon

Meltemi: The first open Large Language Model for Greek

Add code
Jul 30, 2024
Figure 1 for Meltemi: The first open Large Language Model for Greek
Figure 2 for Meltemi: The first open Large Language Model for Greek
Figure 3 for Meltemi: The first open Large Language Model for Greek
Figure 4 for Meltemi: The first open Large Language Model for Greek
Viaarxiv icon

The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data

Add code
Jun 21, 2024
Figure 1 for The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data
Figure 2 for The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data
Figure 3 for The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data
Figure 4 for The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data
Viaarxiv icon

Weakly-supervised Automated Audio Captioning via text only training

Add code
Sep 21, 2023
Figure 1 for Weakly-supervised Automated Audio Captioning via text only training
Figure 2 for Weakly-supervised Automated Audio Captioning via text only training
Figure 3 for Weakly-supervised Automated Audio Captioning via text only training
Figure 4 for Weakly-supervised Automated Audio Captioning via text only training
Viaarxiv icon