Alert button
Picture for Zhiding Yu

Zhiding Yu

Alert button

FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation

Add code
Bookmark button
Alert button
Jul 04, 2023
Zhiqi Li, Zhiding Yu, David Austin, Mingsheng Fang, Shiyi Lan, Jan Kautz, Jose M. Alvarez

Figure 1 for FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
Figure 2 for FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
Figure 3 for FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
Figure 4 for FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
Viaarxiv icon

Differentially Private Video Activity Recognition

Add code
Bookmark button
Alert button
Jun 27, 2023
Zelun Luo, Yuliang Zou, Yijin Yang, Zane Durante, De-An Huang, Zhiding Yu, Chaowei Xiao, Li Fei-Fei, Animashree Anandkumar

Figure 1 for Differentially Private Video Activity Recognition
Figure 2 for Differentially Private Video Activity Recognition
Figure 3 for Differentially Private Video Activity Recognition
Figure 4 for Differentially Private Video Activity Recognition
Viaarxiv icon

SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving

Add code
Bookmark button
Alert button
Jun 15, 2023
Yiming Li, Sihang Li, Xinhao Liu, Moonjun Gong, Kenan Li, Nuo Chen, Zijun Wang, Zhiheng Li, Tao Jiang, Fisher Yu, Yue Wang, Hang Zhao, Zhiding Yu, Chen Feng

Figure 1 for SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving
Figure 2 for SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving
Figure 3 for SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving
Figure 4 for SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving
Viaarxiv icon

Real-Time Radiance Fields for Single-Image Portrait View Synthesis

Add code
Bookmark button
Alert button
May 03, 2023
Alex Trevithick, Matthew Chan, Michael Stengel, Eric R. Chan, Chao Liu, Zhiding Yu, Sameh Khamis, Manmohan Chandraker, Ravi Ramamoorthi, Koki Nagano

Figure 1 for Real-Time Radiance Fields for Single-Image Portrait View Synthesis
Figure 2 for Real-Time Radiance Fields for Single-Image Portrait View Synthesis
Figure 3 for Real-Time Radiance Fields for Single-Image Portrait View Synthesis
Figure 4 for Real-Time Radiance Fields for Single-Image Portrait View Synthesis
Viaarxiv icon

Prismer: A Vision-Language Model with An Ensemble of Experts

Add code
Bookmark button
Alert button
Mar 12, 2023
Shikun Liu, Linxi Fan, Edward Johns, Zhiding Yu, Chaowei Xiao, Anima Anandkumar

Figure 1 for Prismer: A Vision-Language Model with An Ensemble of Experts
Figure 2 for Prismer: A Vision-Language Model with An Ensemble of Experts
Figure 3 for Prismer: A Vision-Language Model with An Ensemble of Experts
Figure 4 for Prismer: A Vision-Language Model with An Ensemble of Experts
Viaarxiv icon

VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion

Add code
Bookmark button
Alert button
Feb 23, 2023
Yiming Li, Zhiding Yu, Christopher Choy, Chaowei Xiao, Jose M. Alvarez, Sanja Fidler, Chen Feng, Anima Anandkumar

Figure 1 for VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion
Figure 2 for VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion
Figure 3 for VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion
Figure 4 for VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion
Viaarxiv icon

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning

Add code
Bookmark button
Alert button
Feb 09, 2023
Zhuolin Yang, Wei Ping, Zihan Liu, Vijay Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Ming-Yu Liu, Yuke Zhu, Mohammad Shoeybi, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar

Figure 1 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Figure 2 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Figure 3 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Figure 4 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Viaarxiv icon

Vision Transformers Are Good Mask Auto-Labelers

Add code
Bookmark button
Alert button
Jan 10, 2023
Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, Jose M. Alvarez, Anima Anandkumar

Figure 1 for Vision Transformers Are Good Mask Auto-Labelers
Figure 2 for Vision Transformers Are Good Mask Auto-Labelers
Figure 3 for Vision Transformers Are Good Mask Auto-Labelers
Figure 4 for Vision Transformers Are Good Mask Auto-Labelers
Viaarxiv icon

1st Place Solution of The Robust Vision Challenge 2022 Semantic Segmentation Track

Add code
Bookmark button
Alert button
Nov 07, 2022
Junfei Xiao, Zhichao Xu, Shiyi Lan, Zhiding Yu, Alan Yuille, Anima Anandkumar

Figure 1 for 1st Place Solution of The Robust Vision Challenge 2022 Semantic Segmentation Track
Figure 2 for 1st Place Solution of The Robust Vision Challenge 2022 Semantic Segmentation Track
Figure 3 for 1st Place Solution of The Robust Vision Challenge 2022 Semantic Segmentation Track
Viaarxiv icon

Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

Add code
Bookmark button
Alert button
Sep 15, 2022
Manli Shu, Weili Nie, De-An Huang, Zhiding Yu, Tom Goldstein, Anima Anandkumar, Chaowei Xiao

Figure 1 for Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models
Figure 2 for Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models
Figure 3 for Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models
Figure 4 for Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models
Viaarxiv icon