Visual Relationship Detection Dataset


StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation

Add code
May 15, 2025
Viaarxiv icon

FauForensics: Boosting Audio-Visual Deepfake Detection with Facial Action Units

Add code
May 13, 2025
Viaarxiv icon

RDD: Robust Feature Detector and Descriptor using Deformable Transformer

Add code
May 12, 2025
Viaarxiv icon

METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection

Add code
May 10, 2025
Viaarxiv icon

FBRT-YOLO: Faster and Better for Real-Time Aerial Image Detection

Add code
Apr 29, 2025
Viaarxiv icon

Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization

Add code
Apr 18, 2025
Viaarxiv icon

MuSeD: A Multimodal Spanish Dataset for Sexism Detection in Social Media Videos

Add code
Apr 15, 2025
Viaarxiv icon

LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection

Add code
Apr 17, 2025
Viaarxiv icon

Accurate Tracking of Arabidopsis Root Cortex Cell Nuclei in 3D Time-Lapse Microscopy Images Based on Genetic Algorithm

Add code
Apr 17, 2025
Viaarxiv icon

Generalized Visual Relation Detection with Diffusion Models

Add code
Apr 16, 2025
Viaarxiv icon