Alert button
Picture for Huiyu Wang

Huiyu Wang

Alert button

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Nov 30, 2023
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Jain, Rawal Khirodkar, Devansh Kukreja, Kevin J Liang, Jia-Wei Liu, Sagnik Majumder, Yongsen Mao, Miguel Martin, Effrosyni Mavroudi, Tushar Nagarajan, Francesco Ragusa, Santhosh Kumar Ramakrishnan, Luigi Seminara, Arjun Somayazulu, Yale Song, Shan Su, Zihui Xue, Edward Zhang, Jinxu Zhang, Angela Castillo, Changan Chen, Xinzhu Fu, Ryosuke Furuta, Cristina Gonzalez, Prince Gupta, Jiabo Hu, Yifei Huang, Yiming Huang, Weslie Khoo, Anush Kumar, Robert Kuo, Sach Lakhavani, Miao Liu, Mi Luo, Zhengyi Luo, Brighid Meredith, Austin Miller, Oluwatumininu Oguntola, Xiaqing Pan, Penny Peng, Shraman Pramanick, Merey Ramazanova, Fiona Ryan, Wei Shan, Kiran Somasundaram, Chenan Song, Audrey Southerland, Masatoshi Tateno, Huiyu Wang, Yuchen Wang, Takuma Yagi, Mingfei Yan, Xitong Yang, Zecheng Yu, Shengxin Cindy Zha, Chen Zhao, Ziwei Zhao, Zhifan Zhu, Jeff Zhuo, Pablo Arbelaez, Gedas Bertasius, David Crandall, Dima Damen, Jakob Engel, Giovanni Maria Farinella, Antonino Furnari, Bernard Ghanem, Judy Hoffman, C. V. Jawahar, Richard Newcombe, Hyun Soo Park, James M. Rehg, Yoichi Sato, Manolis Savva, Jianbo Shi, Mike Zheng Shou, Michael Wray

Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Diffusion Models as Masked Autoencoders

Apr 06, 2023
Chen Wei, Karttikeya Mangalam, Po-Yao Huang, Yanghao Li, Haoqi Fan, Hu Xu, Huiyu Wang, Cihang Xie, Alan Yuille, Christoph Feichtenhofer

Figure 1 for Diffusion Models as Masked Autoencoders
Figure 2 for Diffusion Models as Masked Autoencoders
Figure 3 for Diffusion Models as Masked Autoencoders
Figure 4 for Diffusion Models as Masked Autoencoders
Viaarxiv icon

Ego-Only: Egocentric Action Detection without Exocentric Pretraining

Jan 03, 2023
Huiyu Wang, Mitesh Kumar Singh, Lorenzo Torresani

Figure 1 for Ego-Only: Egocentric Action Detection without Exocentric Pretraining
Figure 2 for Ego-Only: Egocentric Action Detection without Exocentric Pretraining
Figure 3 for Ego-Only: Egocentric Action Detection without Exocentric Pretraining
Figure 4 for Ego-Only: Egocentric Action Detection without Exocentric Pretraining
Viaarxiv icon

Unleashing the Power of Visual Prompting At the Pixel Level

Dec 20, 2022
Junyang Wu, Xianhang Li, Chen Wei, Huiyu Wang, Alan Yuille, Yuyin Zhou, Cihang Xie

Figure 1 for Unleashing the Power of Visual Prompting At the Pixel Level
Figure 2 for Unleashing the Power of Visual Prompting At the Pixel Level
Figure 3 for Unleashing the Power of Visual Prompting At the Pixel Level
Figure 4 for Unleashing the Power of Visual Prompting At the Pixel Level
Viaarxiv icon

SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training

Nov 30, 2022
Yuanze Lin, Chen Wei, Huiyu Wang, Alan Yuille, Cihang Xie

Figure 1 for SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training
Figure 2 for SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training
Figure 3 for SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training
Figure 4 for SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training
Viaarxiv icon

Masked Autoencoders Enable Efficient Knowledge Distillers

Aug 25, 2022
Yutong Bai, Zeyu Wang, Junfei Xiao, Chen Wei, Huiyu Wang, Alan Yuille, Yuyin Zhou, Cihang Xie

Figure 1 for Masked Autoencoders Enable Efficient Knowledge Distillers
Figure 2 for Masked Autoencoders Enable Efficient Knowledge Distillers
Figure 3 for Masked Autoencoders Enable Efficient Knowledge Distillers
Figure 4 for Masked Autoencoders Enable Efficient Knowledge Distillers
Viaarxiv icon

k-means Mask Transformer

Jul 08, 2022
Qihang Yu, Huiyu Wang, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hatwig Adam, Alan Yuille, Liang-Chieh Chen

Figure 1 for k-means Mask Transformer
Figure 2 for k-means Mask Transformer
Figure 3 for k-means Mask Transformer
Figure 4 for k-means Mask Transformer
Viaarxiv icon

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

Jun 17, 2022
Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen

Figure 1 for CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Figure 2 for CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Figure 3 for CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Figure 4 for CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Viaarxiv icon