Picture for Hongjie Zhang

Hongjie Zhang

Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning

Add code
Aug 27, 2024
Viaarxiv icon

EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World

Add code
Mar 24, 2024
Viaarxiv icon

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Add code
Mar 22, 2024
Viaarxiv icon

MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding

Add code
Dec 08, 2023
Figure 1 for MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding
Figure 2 for MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding
Figure 3 for MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding
Figure 4 for MoVQA: A Benchmark of Versatile Question-Answering for Long-Form Movie Understanding
Viaarxiv icon

Multi-view Feature Extraction based on Triple Contrastive Heads

Add code
Mar 22, 2023
Figure 1 for Multi-view Feature Extraction based on Triple Contrastive Heads
Figure 2 for Multi-view Feature Extraction based on Triple Contrastive Heads
Figure 3 for Multi-view Feature Extraction based on Triple Contrastive Heads
Figure 4 for Multi-view Feature Extraction based on Triple Contrastive Heads
Viaarxiv icon

Multi-view Feature Extraction based on Dual Contrastive Head

Add code
Feb 08, 2023
Figure 1 for Multi-view Feature Extraction based on Dual Contrastive Head
Figure 2 for Multi-view Feature Extraction based on Dual Contrastive Head
Figure 3 for Multi-view Feature Extraction based on Dual Contrastive Head
Viaarxiv icon

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Add code
Dec 07, 2022
Figure 1 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Figure 2 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Figure 3 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Figure 4 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Viaarxiv icon

AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning

Add code
Nov 28, 2022
Figure 1 for AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
Figure 2 for AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
Figure 3 for AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
Figure 4 for AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
Viaarxiv icon

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Add code
Nov 17, 2022
Figure 1 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 2 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 3 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 4 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Viaarxiv icon

Feature Extraction Framework based on Contrastive Learning with Adaptive Positive and Negative Samples

Add code
Jan 11, 2022
Figure 1 for Feature Extraction Framework based on Contrastive Learning with Adaptive Positive and Negative Samples
Figure 2 for Feature Extraction Framework based on Contrastive Learning with Adaptive Positive and Negative Samples
Figure 3 for Feature Extraction Framework based on Contrastive Learning with Adaptive Positive and Negative Samples
Figure 4 for Feature Extraction Framework based on Contrastive Learning with Adaptive Positive and Negative Samples
Viaarxiv icon