Picture for Kaiyang Zhou

Kaiyang Zhou

4D Panoptic Scene Graph Generation

Add code
May 16, 2024
Figure 1 for 4D Panoptic Scene Graph Generation
Figure 2 for 4D Panoptic Scene Graph Generation
Figure 3 for 4D Panoptic Scene Graph Generation
Figure 4 for 4D Panoptic Scene Graph Generation
Viaarxiv icon

Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models

Add code
Mar 26, 2024
Figure 1 for Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
Figure 2 for Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
Figure 3 for Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
Figure 4 for Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
Viaarxiv icon

Open-Vocabulary Calibration for Vision-Language Models

Add code
Feb 15, 2024
Figure 1 for Open-Vocabulary Calibration for Vision-Language Models
Figure 2 for Open-Vocabulary Calibration for Vision-Language Models
Figure 3 for Open-Vocabulary Calibration for Vision-Language Models
Figure 4 for Open-Vocabulary Calibration for Vision-Language Models
Viaarxiv icon

Panoptic Video Scene Graph Generation

Add code
Nov 28, 2023
Figure 1 for Panoptic Video Scene Graph Generation
Figure 2 for Panoptic Video Scene Graph Generation
Figure 3 for Panoptic Video Scene Graph Generation
Figure 4 for Panoptic Video Scene Graph Generation
Viaarxiv icon

Octopus: Embodied Vision-Language Programmer from Environmental Feedback

Add code
Oct 12, 2023
Figure 1 for Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Figure 2 for Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Figure 3 for Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Figure 4 for Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Viaarxiv icon

OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection

Add code
Jun 17, 2023
Figure 1 for OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection
Figure 2 for OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection
Figure 3 for OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection
Figure 4 for OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection
Viaarxiv icon

Contextual Object Detection with Multimodal Large Language Models

Add code
May 29, 2023
Figure 1 for Contextual Object Detection with Multimodal Large Language Models
Figure 2 for Contextual Object Detection with Multimodal Large Language Models
Figure 3 for Contextual Object Detection with Multimodal Large Language Models
Figure 4 for Contextual Object Detection with Multimodal Large Language Models
Viaarxiv icon

Semi-Supervised and Long-Tailed Object Detection with CascadeMatch

Add code
May 24, 2023
Viaarxiv icon

What Makes Good Examples for Visual In-Context Learning?

Add code
Feb 01, 2023
Figure 1 for What Makes Good Examples for Visual In-Context Learning?
Figure 2 for What Makes Good Examples for Visual In-Context Learning?
Figure 3 for What Makes Good Examples for Visual In-Context Learning?
Figure 4 for What Makes Good Examples for Visual In-Context Learning?
Viaarxiv icon

Learning to Augment via Implicit Differentiation for Domain Generalization

Add code
Oct 25, 2022
Figure 1 for Learning to Augment via Implicit Differentiation for Domain Generalization
Figure 2 for Learning to Augment via Implicit Differentiation for Domain Generalization
Figure 3 for Learning to Augment via Implicit Differentiation for Domain Generalization
Figure 4 for Learning to Augment via Implicit Differentiation for Domain Generalization
Viaarxiv icon