Picture for Zilong Zheng

Zilong Zheng

Mars: Situated Inductive Reasoning in an Open-World Environment

Add code
Oct 10, 2024
Figure 1 for Mars: Situated Inductive Reasoning in an Open-World Environment
Figure 2 for Mars: Situated Inductive Reasoning in an Open-World Environment
Figure 3 for Mars: Situated Inductive Reasoning in an Open-World Environment
Figure 4 for Mars: Situated Inductive Reasoning in an Open-World Environment
Viaarxiv icon

Alignment Between the Decision-Making Logic of LLMs and Human Cognition: A Case Study on Legal LLMs

Add code
Oct 06, 2024
Viaarxiv icon

VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges

Add code
Sep 02, 2024
Figure 1 for VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges
Figure 2 for VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges
Figure 3 for VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges
Figure 4 for VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges
Viaarxiv icon

ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning

Add code
Aug 05, 2024
Viaarxiv icon

Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing

Add code
Jul 12, 2024
Figure 1 for Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Figure 2 for Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Figure 3 for Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Figure 4 for Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Viaarxiv icon

VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models

Add code
Jun 24, 2024
Viaarxiv icon

LangSuitE: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments

Add code
Jun 24, 2024
Viaarxiv icon

Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels

Add code
Jun 24, 2024
Viaarxiv icon

Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers

Add code
Jun 24, 2024
Viaarxiv icon

In-Context Editing: Learning Knowledge from Self-Induced Distributions

Add code
Jun 17, 2024
Figure 1 for In-Context Editing: Learning Knowledge from Self-Induced Distributions
Figure 2 for In-Context Editing: Learning Knowledge from Self-Induced Distributions
Figure 3 for In-Context Editing: Learning Knowledge from Self-Induced Distributions
Figure 4 for In-Context Editing: Learning Knowledge from Self-Induced Distributions
Viaarxiv icon