Picture for Tushar Nagarajan

Tushar Nagarajan

VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning

Add code
Oct 04, 2024
Figure 1 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Figure 2 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Figure 3 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Figure 4 for VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Viaarxiv icon

Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos

Add code
Sep 30, 2024
Figure 1 for Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos
Figure 2 for Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos
Figure 3 for Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos
Figure 4 for Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos
Viaarxiv icon

AMEGO: Active Memory from long EGOcentric videos

Add code
Sep 17, 2024
Figure 1 for AMEGO: Active Memory from long EGOcentric videos
Figure 2 for AMEGO: Active Memory from long EGOcentric videos
Figure 3 for AMEGO: Active Memory from long EGOcentric videos
Figure 4 for AMEGO: Active Memory from long EGOcentric videos
Viaarxiv icon

Unlocking Exocentric Video-Language Data for Egocentric Video Representation Learning

Add code
Aug 07, 2024
Viaarxiv icon

User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance

Add code
Aug 04, 2024
Figure 1 for User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance
Figure 2 for User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance
Figure 3 for User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance
Figure 4 for User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance
Viaarxiv icon

ExpertAF: Expert Actionable Feedback from Video

Add code
Aug 01, 2024
Figure 1 for ExpertAF: Expert Actionable Feedback from Video
Figure 2 for ExpertAF: Expert Actionable Feedback from Video
Figure 3 for ExpertAF: Expert Actionable Feedback from Video
Figure 4 for ExpertAF: Expert Actionable Feedback from Video
Viaarxiv icon

Step Differences in Instructional Video

Add code
Apr 24, 2024
Figure 1 for Step Differences in Instructional Video
Figure 2 for Step Differences in Instructional Video
Figure 3 for Step Differences in Instructional Video
Figure 4 for Step Differences in Instructional Video
Viaarxiv icon

Video ReCap: Recursive Captioning of Hour-Long Videos

Add code
Feb 28, 2024
Figure 1 for Video ReCap: Recursive Captioning of Hour-Long Videos
Figure 2 for Video ReCap: Recursive Captioning of Hour-Long Videos
Figure 3 for Video ReCap: Recursive Captioning of Hour-Long Videos
Figure 4 for Video ReCap: Recursive Captioning of Hour-Long Videos
Viaarxiv icon

Detours for Navigating Instructional Videos

Add code
Jan 03, 2024
Figure 1 for Detours for Navigating Instructional Videos
Figure 2 for Detours for Navigating Instructional Videos
Figure 3 for Detours for Navigating Instructional Videos
Figure 4 for Detours for Navigating Instructional Videos
Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Nov 30, 2023
Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon