Picture for Frederik Diederichs

Frederik Diederichs

Vision-language Models for Driver Monitoring Systems: A Driver Activity Description Dataset

Add code
Jun 01, 2026
Viaarxiv icon

Multi-modal Video Representation Alignment for Robust Self-supervised Driver Distraction Detection

Add code
Jun 01, 2026
Viaarxiv icon

FlowNar: Scalable Streaming Narration for Long-Form Videos

Add code
May 30, 2026
Viaarxiv icon

QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024

Add code
Jul 04, 2024
Figure 1 for QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024
Figure 2 for QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024
Figure 3 for QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024
Figure 4 for QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024
Viaarxiv icon