Alert button
Picture for Zun Wang

Zun Wang

Alert button

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark

Dec 03, 2023
Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao

Figure 1 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 2 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 3 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 4 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Viaarxiv icon

Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields

Aug 11, 2023
Yatao Li, Wanling Gao, Lei Wang, Lixin Sun, Zun Wang, Jianfeng Zhan

Figure 1 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Figure 2 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Figure 3 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Figure 4 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Viaarxiv icon

Scaling Data Generation in Vision-and-Language Navigation

Aug 09, 2023
Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao

Figure 1 for Scaling Data Generation in Vision-and-Language Navigation
Figure 2 for Scaling Data Generation in Vision-and-Language Navigation
Figure 3 for Scaling Data Generation in Vision-and-Language Navigation
Figure 4 for Scaling Data Generation in Vision-and-Language Navigation
Viaarxiv icon

ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments

Apr 07, 2023
Dong An, Hanqing Wang, Wenguan Wang, Zun Wang, Yan Huang, Keji He, Liang Wang

Figure 1 for ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
Figure 2 for ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
Figure 3 for ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
Figure 4 for ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
Viaarxiv icon

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Dec 07, 2022
Yi Wang, Kunchang Li, Yizhuo Li, Yinan He, Bingkun Huang, Zhiyu Zhao, Hongjie Zhang, Jilan Xu, Yi Liu, Zun Wang, Sen Xing, Guo Chen, Junting Pan, Jiashuo Yu, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Figure 2 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Figure 3 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Figure 4 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Viaarxiv icon

An ensemble of VisNet, Transformer-M, and pretraining models for molecular property prediction in OGB Large-Scale Challenge @ NeurIPS 2022

Nov 23, 2022
Yusong Wang, Shaoning Li, Tong Wang, Zun Wang, Xinheng He, Bin Shao, Tie-Yan Liu

Figure 1 for An ensemble of VisNet, Transformer-M, and pretraining models for molecular property prediction in OGB Large-Scale Challenge @ NeurIPS 2022
Figure 2 for An ensemble of VisNet, Transformer-M, and pretraining models for molecular property prediction in OGB Large-Scale Challenge @ NeurIPS 2022
Figure 3 for An ensemble of VisNet, Transformer-M, and pretraining models for molecular property prediction in OGB Large-Scale Challenge @ NeurIPS 2022
Figure 4 for An ensemble of VisNet, Transformer-M, and pretraining models for molecular property prediction in OGB Large-Scale Challenge @ NeurIPS 2022
Viaarxiv icon

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Nov 17, 2022
Guo Chen, Sen Xing, Zhe Chen, Yi Wang, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang, Zhiyu Zhao, Junting Pan, Yifei Huang, Zun Wang, Jiashuo Yu, Yinan He, Hongjie Zhang, Tong Lu, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 2 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 3 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 4 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Viaarxiv icon

1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)

Jun 26, 2022
Dong An, Zun Wang, Yangguang Li, Yi Wang, Yicong Hong, Yan Huang, Liang Wang, Jing Shao

Figure 1 for 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Figure 2 for 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Figure 3 for 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Figure 4 for 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Viaarxiv icon