Picture for Chaoyang Zhao

Chaoyang Zhao

Foundation Model Research Center, Institute of Automation, Chinese Academy of Sciences, objecteye.Inc

GenPilot: A Multi-Agent System for Test-Time Prompt Optimization in Image Generation

Add code
Oct 08, 2025
Viaarxiv icon

PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability

Add code
Mar 13, 2025
Viaarxiv icon

LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning

Add code
Mar 11, 2025
Figure 1 for LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning
Figure 2 for LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning
Figure 3 for LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning
Figure 4 for LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning
Viaarxiv icon

Mitigating Hallucination in Visual Language Models with Visual Supervision

Add code
Nov 27, 2023
Figure 1 for Mitigating Hallucination in Visual Language Models with Visual Supervision
Figure 2 for Mitigating Hallucination in Visual Language Models with Visual Supervision
Figure 3 for Mitigating Hallucination in Visual Language Models with Visual Supervision
Figure 4 for Mitigating Hallucination in Visual Language Models with Visual Supervision
Viaarxiv icon

ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection

Add code
Mar 26, 2023
Figure 1 for ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection
Figure 2 for ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection
Figure 3 for ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection
Figure 4 for ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground Selection
Viaarxiv icon

Efficient Masked Autoencoders with Self-Consistency

Add code
Feb 28, 2023
Figure 1 for Efficient Masked Autoencoders with Self-Consistency
Figure 2 for Efficient Masked Autoencoders with Self-Consistency
Figure 3 for Efficient Masked Autoencoders with Self-Consistency
Figure 4 for Efficient Masked Autoencoders with Self-Consistency
Viaarxiv icon

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks

Add code
Sep 28, 2022
Figure 1 for Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks
Figure 2 for Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks
Figure 3 for Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks
Figure 4 for Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks
Viaarxiv icon

Transfering Low-Frequency Features for Domain Adaptation

Add code
Aug 31, 2022
Figure 1 for Transfering Low-Frequency Features for Domain Adaptation
Figure 2 for Transfering Low-Frequency Features for Domain Adaptation
Figure 3 for Transfering Low-Frequency Features for Domain Adaptation
Figure 4 for Transfering Low-Frequency Features for Domain Adaptation
Viaarxiv icon

UniVIP: A Unified Framework for Self-Supervised Visual Pre-training

Add code
Mar 14, 2022
Viaarxiv icon

Pruning-aware Sparse Regularization for Network Pruning

Add code
Jan 18, 2022
Viaarxiv icon