Alert button
Picture for Kumar Ashutosh

Kumar Ashutosh

Alert button

SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos

Add code
Bookmark button
Alert button
Apr 08, 2024
Changan Chen, Kumar Ashutosh, Rohit Girdhar, David Harwath, Kristen Grauman

Viaarxiv icon

Detours for Navigating Instructional Videos

Add code
Bookmark button
Alert button
Jan 03, 2024
Kumar Ashutosh, Zihui Xue, Tushar Nagarajan, Kristen Grauman

Viaarxiv icon

Learning Object State Changes in Videos: An Open-World Perspective

Add code
Bookmark button
Alert button
Dec 19, 2023
Zihui Xue, Kumar Ashutosh, Kristen Grauman

Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Bookmark button
Alert button
Nov 30, 2023
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Jain, Rawal Khirodkar, Devansh Kukreja, Kevin J Liang, Jia-Wei Liu, Sagnik Majumder, Yongsen Mao, Miguel Martin, Effrosyni Mavroudi, Tushar Nagarajan, Francesco Ragusa, Santhosh Kumar Ramakrishnan, Luigi Seminara, Arjun Somayazulu, Yale Song, Shan Su, Zihui Xue, Edward Zhang, Jinxu Zhang, Angela Castillo, Changan Chen, Xinzhu Fu, Ryosuke Furuta, Cristina Gonzalez, Prince Gupta, Jiabo Hu, Yifei Huang, Yiming Huang, Weslie Khoo, Anush Kumar, Robert Kuo, Sach Lakhavani, Miao Liu, Mi Luo, Zhengyi Luo, Brighid Meredith, Austin Miller, Oluwatumininu Oguntola, Xiaqing Pan, Penny Peng, Shraman Pramanick, Merey Ramazanova, Fiona Ryan, Wei Shan, Kiran Somasundaram, Chenan Song, Audrey Southerland, Masatoshi Tateno, Huiyu Wang, Yuchen Wang, Takuma Yagi, Mingfei Yan, Xitong Yang, Zecheng Yu, Shengxin Cindy Zha, Chen Zhao, Ziwei Zhao, Zhifan Zhu, Jeff Zhuo, Pablo Arbelaez, Gedas Bertasius, David Crandall, Dima Damen, Jakob Engel, Giovanni Maria Farinella, Antonino Furnari, Bernard Ghanem, Judy Hoffman, C. V. Jawahar, Richard Newcombe, Hyun Soo Park, James M. Rehg, Yoichi Sato, Manolis Savva, Jianbo Shi, Mike Zheng Shou, Michael Wray

Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Video-Mined Task Graphs for Keystep Recognition in Instructional Videos

Add code
Bookmark button
Alert button
Jul 17, 2023
Kumar Ashutosh, Santhosh Kumar Ramakrishnan, Triantafyllos Afouras, Kristen Grauman

Figure 1 for Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Figure 2 for Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Figure 3 for Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Figure 4 for Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Viaarxiv icon

HierVL: Learning Hierarchical Video-Language Embeddings

Add code
Bookmark button
Alert button
Jan 05, 2023
Kumar Ashutosh, Rohit Girdhar, Lorenzo Torresani, Kristen Grauman

Figure 1 for HierVL: Learning Hierarchical Video-Language Embeddings
Figure 2 for HierVL: Learning Hierarchical Video-Language Embeddings
Figure 3 for HierVL: Learning Hierarchical Video-Language Embeddings
Figure 4 for HierVL: Learning Hierarchical Video-Language Embeddings
Viaarxiv icon

What You Say Is What You Show: Visual Narration Detection in Instructional Videos

Add code
Bookmark button
Alert button
Jan 05, 2023
Kumar Ashutosh, Rohit Girdhar, Lorenzo Torresani, Kristen Grauman

Figure 1 for What You Say Is What You Show: Visual Narration Detection in Instructional Videos
Figure 2 for What You Say Is What You Show: Visual Narration Detection in Instructional Videos
Figure 3 for What You Say Is What You Show: Visual Narration Detection in Instructional Videos
Figure 4 for What You Say Is What You Show: Visual Narration Detection in Instructional Videos
Viaarxiv icon

RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging

Add code
Bookmark button
Alert button
Oct 15, 2022
Ajay Jaiswal, Kumar Ashutosh, Justin F Rousseau, Yifan Peng, Zhangyang Wang, Ying Ding

Figure 1 for RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging
Figure 2 for RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging
Figure 3 for RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging
Figure 4 for RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging
Viaarxiv icon

3D-NVS: A 3D Supervision Approach for Next View Selection

Add code
Bookmark button
Alert button
Dec 03, 2020
Kumar Ashutosh, Saurabh Kumar, Subhasis Chaudhuri

Figure 1 for 3D-NVS: A 3D Supervision Approach for Next View Selection
Figure 2 for 3D-NVS: A 3D Supervision Approach for Next View Selection
Figure 3 for 3D-NVS: A 3D Supervision Approach for Next View Selection
Figure 4 for 3D-NVS: A 3D Supervision Approach for Next View Selection
Viaarxiv icon

Lower Bounds for Policy Iteration on Multi-action MDPs

Add code
Bookmark button
Alert button
Sep 16, 2020
Kumar Ashutosh, Sarthak Consul, Bhishma Dedhia, Parthasarathi Khirwadkar, Sahil Shah, Shivaram Kalyanakrishnan

Figure 1 for Lower Bounds for Policy Iteration on Multi-action MDPs
Figure 2 for Lower Bounds for Policy Iteration on Multi-action MDPs
Figure 3 for Lower Bounds for Policy Iteration on Multi-action MDPs
Viaarxiv icon