Picture for Dominick Reilly

Dominick Reilly

World Action Models Enable Continual Imitation Learning with Recurrent Generative Replays

Add code
Jun 25, 2026
Viaarxiv icon

TimeProVe: Propose, then Verify for Efficient Long Video Temporal Reasoning in Activities of Daily Living

Add code
Jun 18, 2026
Viaarxiv icon

UNIEGO: Proxies as Mediators for Unified Egocentric Video Representation Learning

Add code
Jun 18, 2026
Viaarxiv icon

UniLACT: Depth-Aware RGB Latent Action Learning for Vision-Language-Action Models

Add code
Feb 23, 2026
Viaarxiv icon

SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living

Add code
Feb 05, 2025
Figure 1 for SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
Figure 2 for SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
Figure 3 for SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
Figure 4 for SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
Viaarxiv icon

From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities

Add code
Jan 10, 2025
Figure 1 for From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities
Figure 2 for From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities
Figure 3 for From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities
Figure 4 for From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities
Viaarxiv icon

Introducing Gating and Context into Temporal Action Detection

Add code
Sep 06, 2024
Viaarxiv icon

Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

Add code
Jun 27, 2024
Figure 1 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Figure 2 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Figure 3 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Figure 4 for Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Viaarxiv icon

LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living

Add code
Jun 13, 2024
Viaarxiv icon

Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living

Add code
Nov 30, 2023
Viaarxiv icon