Alert button
Picture for Zihui Xue

Zihui Xue

Alert button

Detours for Navigating Instructional Videos

Jan 03, 2024
Kumar Ashutosh, Zihui Xue, Tushar Nagarajan, Kristen Grauman

Viaarxiv icon

Learning Object State Changes in Videos: An Open-World Perspective

Dec 19, 2023
Zihui Xue, Kumar Ashutosh, Kristen Grauman

Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Nov 30, 2023
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Jain, Rawal Khirodkar, Devansh Kukreja, Kevin J Liang, Jia-Wei Liu, Sagnik Majumder, Yongsen Mao, Miguel Martin, Effrosyni Mavroudi, Tushar Nagarajan, Francesco Ragusa, Santhosh Kumar Ramakrishnan, Luigi Seminara, Arjun Somayazulu, Yale Song, Shan Su, Zihui Xue, Edward Zhang, Jinxu Zhang, Angela Castillo, Changan Chen, Xinzhu Fu, Ryosuke Furuta, Cristina Gonzalez, Prince Gupta, Jiabo Hu, Yifei Huang, Yiming Huang, Weslie Khoo, Anush Kumar, Robert Kuo, Sach Lakhavani, Miao Liu, Mi Luo, Zhengyi Luo, Brighid Meredith, Austin Miller, Oluwatumininu Oguntola, Xiaqing Pan, Penny Peng, Shraman Pramanick, Merey Ramazanova, Fiona Ryan, Wei Shan, Kiran Somasundaram, Chenan Song, Audrey Southerland, Masatoshi Tateno, Huiyu Wang, Yuchen Wang, Takuma Yagi, Mingfei Yan, Xitong Yang, Zecheng Yu, Shengxin Cindy Zha, Chen Zhao, Ziwei Zhao, Zhifan Zhu, Jeff Zhuo, Pablo Arbelaez, Gedas Bertasius, David Crandall, Dima Damen, Jakob Engel, Giovanni Maria Farinella, Antonino Furnari, Bernard Ghanem, Judy Hoffman, C. V. Jawahar, Richard Newcombe, Hyun Soo Park, James M. Rehg, Yoichi Sato, Manolis Savva, Jianbo Shi, Mike Zheng Shou, Michael Wray

Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment

Jun 08, 2023
Zihui Xue, Kristen Grauman

Figure 1 for Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Figure 2 for Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Figure 3 for Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Figure 4 for Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Viaarxiv icon

Egocentric Video Task Translation @ Ego4D Challenge 2022

Feb 03, 2023
Zihui Xue, Yale Song, Kristen Grauman, Lorenzo Torresani

Figure 1 for Egocentric Video Task Translation @ Ego4D Challenge 2022
Figure 2 for Egocentric Video Task Translation @ Ego4D Challenge 2022
Figure 3 for Egocentric Video Task Translation @ Ego4D Challenge 2022
Figure 4 for Egocentric Video Task Translation @ Ego4D Challenge 2022
Viaarxiv icon

Egocentric Video Task Translation

Dec 13, 2022
Zihui Xue, Yale Song, Kristen Grauman, Lorenzo Torresani

Figure 1 for Egocentric Video Task Translation
Figure 2 for Egocentric Video Task Translation
Figure 3 for Egocentric Video Task Translation
Figure 4 for Egocentric Video Task Translation
Viaarxiv icon

The Modality Focusing Hypothesis: On the Blink of Multimodal Knowledge Distillation

Jun 13, 2022
Zihui Xue, Zhengqi Gao, Sucheng Ren, Hang Zhao

Figure 1 for The Modality Focusing Hypothesis: On the Blink of Multimodal Knowledge Distillation
Figure 2 for The Modality Focusing Hypothesis: On the Blink of Multimodal Knowledge Distillation
Figure 3 for The Modality Focusing Hypothesis: On the Blink of Multimodal Knowledge Distillation
Figure 4 for The Modality Focusing Hypothesis: On the Blink of Multimodal Knowledge Distillation
Viaarxiv icon

Training-Free Robust Multimodal Learning via Sample-Wise Jacobian Regularization

Apr 05, 2022
Zhengqi Gao, Sucheng Ren, Zihui Xue, Siting Li, Hang Zhao

Figure 1 for Training-Free Robust Multimodal Learning via Sample-Wise Jacobian Regularization
Figure 2 for Training-Free Robust Multimodal Learning via Sample-Wise Jacobian Regularization
Figure 3 for Training-Free Robust Multimodal Learning via Sample-Wise Jacobian Regularization
Figure 4 for Training-Free Robust Multimodal Learning via Sample-Wise Jacobian Regularization
Viaarxiv icon

Dynamic Multimodal Fusion

Mar 31, 2022
Zihui Xue, Radu Marculescu

Figure 1 for Dynamic Multimodal Fusion
Figure 2 for Dynamic Multimodal Fusion
Figure 3 for Dynamic Multimodal Fusion
Figure 4 for Dynamic Multimodal Fusion
Viaarxiv icon

SUGAR: Efficient Subgraph-level Training via Resource-aware Graph Partitioning

Feb 16, 2022
Zihui Xue, Yuedong Yang, Mengtian Yang, Radu Marculescu

Figure 1 for SUGAR: Efficient Subgraph-level Training via Resource-aware Graph Partitioning
Figure 2 for SUGAR: Efficient Subgraph-level Training via Resource-aware Graph Partitioning
Figure 3 for SUGAR: Efficient Subgraph-level Training via Resource-aware Graph Partitioning
Figure 4 for SUGAR: Efficient Subgraph-level Training via Resource-aware Graph Partitioning
Viaarxiv icon