Picture for Sergio Escalera

Sergio Escalera

UB

A Hyperbolic Perspective on Hierarchical Structure in Object-Centric Scene Representations

Add code
Mar 14, 2026
Viaarxiv icon

MV-Fashion: Towards Enabling Virtual Try-On and Size Estimation with Multi-View Paired Data

Add code
Mar 09, 2026
Viaarxiv icon

Beyond Caption-Based Queries for Video Moment Retrieval

Add code
Mar 02, 2026
Viaarxiv icon

AdaSpot: Spend Resolution Where It Matters for Precise Event Spotting

Add code
Feb 25, 2026
Viaarxiv icon

Enhancing Personality Recognition by Comparing the Predictive Power of Traits, Facets, and Nuances

Add code
Feb 05, 2026
Viaarxiv icon

SOVABench: A Vehicle Surveillance Action Retrieval Benchmark for Multimodal Large Language Models

Add code
Jan 08, 2026
Viaarxiv icon

PrismVAU: Prompt-Refined Inference System for Multimodal Video Anomaly Understanding

Add code
Jan 07, 2026
Viaarxiv icon

Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models

Add code
Dec 22, 2025
Figure 1 for Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models
Figure 2 for Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models
Figure 3 for Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models
Figure 4 for Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models
Viaarxiv icon

RVLF: A Reinforcing Vision-Language Framework for Gloss-Free Sign Language Translation

Add code
Dec 08, 2025
Viaarxiv icon

SoccerNet 2025 Challenges Results

Add code
Aug 26, 2025
Viaarxiv icon