Picture for Santhosh Kumar Ramakrishnan

Santhosh Kumar Ramakrishnan

Do multimodal models imagine electric sheep?

Add code
May 10, 2026
Viaarxiv icon

Does Spatial Cognition Emerge in Frontier Models?

Add code
Oct 09, 2024
Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Nov 30, 2023
Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Video-Mined Task Graphs for Keystep Recognition in Instructional Videos

Add code
Jul 17, 2023
Figure 1 for Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Figure 2 for Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Figure 3 for Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Figure 4 for Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Viaarxiv icon

SpotEM: Efficient Video Search for Episodic Memory

Add code
Jun 28, 2023
Figure 1 for SpotEM: Efficient Video Search for Episodic Memory
Figure 2 for SpotEM: Efficient Video Search for Episodic Memory
Figure 3 for SpotEM: Efficient Video Search for Episodic Memory
Figure 4 for SpotEM: Efficient Video Search for Episodic Memory
Viaarxiv icon

Single-Stage Visual Query Localization in Egocentric Videos

Add code
Jun 15, 2023
Figure 1 for Single-Stage Visual Query Localization in Egocentric Videos
Figure 2 for Single-Stage Visual Query Localization in Egocentric Videos
Figure 3 for Single-Stage Visual Query Localization in Egocentric Videos
Figure 4 for Single-Stage Visual Query Localization in Egocentric Videos
Viaarxiv icon

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Add code
Jan 18, 2023
Figure 1 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Figure 2 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Figure 3 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Figure 4 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Viaarxiv icon

NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory

Add code
Jan 02, 2023
Figure 1 for NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory
Figure 2 for NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory
Figure 3 for NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory
Figure 4 for NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory
Viaarxiv icon

Habitat-Matterport 3D Semantics Dataset

Add code
Oct 11, 2022
Figure 1 for Habitat-Matterport 3D Semantics Dataset
Figure 2 for Habitat-Matterport 3D Semantics Dataset
Figure 3 for Habitat-Matterport 3D Semantics Dataset
Figure 4 for Habitat-Matterport 3D Semantics Dataset
Viaarxiv icon

Egocentric scene context for human-centric environment understanding from video

Add code
Jul 22, 2022
Figure 1 for Egocentric scene context for human-centric environment understanding from video
Figure 2 for Egocentric scene context for human-centric environment understanding from video
Figure 3 for Egocentric scene context for human-centric environment understanding from video
Figure 4 for Egocentric scene context for human-centric environment understanding from video
Viaarxiv icon