Picture for Nakul Agarwal

Nakul Agarwal

Pose-Aware Weakly-Supervised Action Segmentation

Add code
Apr 08, 2025
Figure 1 for Pose-Aware Weakly-Supervised Action Segmentation
Figure 2 for Pose-Aware Weakly-Supervised Action Segmentation
Figure 3 for Pose-Aware Weakly-Supervised Action Segmentation
Figure 4 for Pose-Aware Weakly-Supervised Action Segmentation
Viaarxiv icon

ACE: Action Concept Enhancement of Video-Language Models in Procedural Videos

Add code
Nov 23, 2024
Figure 1 for ACE: Action Concept Enhancement of Video-Language Models in Procedural Videos
Figure 2 for ACE: Action Concept Enhancement of Video-Language Models in Procedural Videos
Figure 3 for ACE: Action Concept Enhancement of Video-Language Models in Procedural Videos
Figure 4 for ACE: Action Concept Enhancement of Video-Language Models in Procedural Videos
Viaarxiv icon

IIT Bombay Racing Driverless: Autonomous Driving Stack for Formula Student AI

Add code
Aug 12, 2024
Viaarxiv icon

M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models

Add code
Jul 19, 2024
Figure 1 for M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Figure 2 for M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Figure 3 for M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Figure 4 for M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Viaarxiv icon

Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models

Add code
May 30, 2024
Figure 1 for Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models
Figure 2 for Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models
Figure 3 for Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models
Figure 4 for Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models
Viaarxiv icon

Multi-Objective Recommendation via Multivariate Policy Learning

Add code
May 03, 2024
Figure 1 for Multi-Objective Recommendation via Multivariate Policy Learning
Figure 2 for Multi-Objective Recommendation via Multivariate Policy Learning
Figure 3 for Multi-Objective Recommendation via Multivariate Policy Learning
Figure 4 for Multi-Objective Recommendation via Multivariate Policy Learning
Viaarxiv icon

Disentangled Neural Relational Inference for Interpretable Motion Prediction

Add code
Jan 07, 2024
Viaarxiv icon

Vamos: Versatile Action Models for Video Understanding

Add code
Nov 22, 2023
Figure 1 for Vamos: Versatile Action Models for Video Understanding
Figure 2 for Vamos: Versatile Action Models for Video Understanding
Figure 3 for Vamos: Versatile Action Models for Video Understanding
Figure 4 for Vamos: Versatile Action Models for Video Understanding
Viaarxiv icon

Object-centric Video Representation for Long-term Action Anticipation

Add code
Oct 31, 2023
Figure 1 for Object-centric Video Representation for Long-term Action Anticipation
Figure 2 for Object-centric Video Representation for Long-term Action Anticipation
Figure 3 for Object-centric Video Representation for Long-term Action Anticipation
Figure 4 for Object-centric Video Representation for Long-term Action Anticipation
Viaarxiv icon

Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning

Add code
Sep 12, 2023
Viaarxiv icon