Picture for Zun Wang

Zun Wang

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Add code
Jul 17, 2024
Viaarxiv icon

Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models

Add code
Jul 09, 2024
Viaarxiv icon

Infusing Self-Consistency into Density Functional Theory Hamiltonian Prediction via Deep Equilibrium Models

Add code
Jun 06, 2024
Viaarxiv icon

SE3Set: Harnessing equivariant hypergraph neural networks for molecular representation learning

Add code
May 26, 2024
Viaarxiv icon

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Add code
Mar 22, 2024
Viaarxiv icon

Self-Consistency Training for Hamiltonian Prediction

Add code
Mar 14, 2024
Figure 1 for Self-Consistency Training for Hamiltonian Prediction
Figure 2 for Self-Consistency Training for Hamiltonian Prediction
Figure 3 for Self-Consistency Training for Hamiltonian Prediction
Figure 4 for Self-Consistency Training for Hamiltonian Prediction
Viaarxiv icon

Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey

Add code
Mar 05, 2024
Figure 1 for Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey
Figure 2 for Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey
Figure 3 for Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey
Figure 4 for Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey
Viaarxiv icon

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark

Add code
Dec 03, 2023
Figure 1 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 2 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 3 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 4 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Viaarxiv icon

Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields

Add code
Aug 11, 2023
Figure 1 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Figure 2 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Figure 3 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Figure 4 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Viaarxiv icon

Scaling Data Generation in Vision-and-Language Navigation

Add code
Aug 09, 2023
Figure 1 for Scaling Data Generation in Vision-and-Language Navigation
Figure 2 for Scaling Data Generation in Vision-and-Language Navigation
Figure 3 for Scaling Data Generation in Vision-and-Language Navigation
Figure 4 for Scaling Data Generation in Vision-and-Language Navigation
Viaarxiv icon