Picture for Kaiyang Zhou

Kaiyang Zhou

Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon

Training-Free Watermarking for Autoregressive Image Generation

Add code
May 20, 2025
Viaarxiv icon

Fine-tuning Quantized Neural Networks with Zeroth-order Optimization

Add code
May 19, 2025
Viaarxiv icon

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

Add code
Jan 30, 2025
Viaarxiv icon

4D Panoptic Scene Graph Generation

Add code
May 16, 2024
Figure 1 for 4D Panoptic Scene Graph Generation
Figure 2 for 4D Panoptic Scene Graph Generation
Figure 3 for 4D Panoptic Scene Graph Generation
Figure 4 for 4D Panoptic Scene Graph Generation
Viaarxiv icon

Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models

Add code
Mar 26, 2024
Viaarxiv icon

Open-Vocabulary Calibration for Vision-Language Models

Add code
Feb 15, 2024
Figure 1 for Open-Vocabulary Calibration for Vision-Language Models
Figure 2 for Open-Vocabulary Calibration for Vision-Language Models
Figure 3 for Open-Vocabulary Calibration for Vision-Language Models
Figure 4 for Open-Vocabulary Calibration for Vision-Language Models
Viaarxiv icon

Panoptic Video Scene Graph Generation

Add code
Nov 28, 2023
Viaarxiv icon

Octopus: Embodied Vision-Language Programmer from Environmental Feedback

Add code
Oct 12, 2023
Viaarxiv icon

OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection

Add code
Jun 17, 2023
Viaarxiv icon