Picture for Cheng Shi

Cheng Shi

Joint Graph Rewiring and Feature Denoising via Spectral Resonance

Add code
Aug 13, 2024
Viaarxiv icon

A spring-block theory of feature learning in deep neural networks

Add code
Jul 28, 2024
Figure 1 for A spring-block theory of feature learning in deep neural networks
Figure 2 for A spring-block theory of feature learning in deep neural networks
Figure 3 for A spring-block theory of feature learning in deep neural networks
Figure 4 for A spring-block theory of feature learning in deep neural networks
Viaarxiv icon

Bidirectional skip-frame prediction for video anomaly detection with intra-domain disparity-driven attention

Add code
Jul 23, 2024
Viaarxiv icon

Part2Object: Hierarchical Unsupervised 3D Instance Segmentation

Add code
Jul 14, 2024
Viaarxiv icon

Plain-Det: A Plain Multi-Dataset Object Detector

Add code
Jul 14, 2024
Figure 1 for Plain-Det: A Plain Multi-Dataset Object Detector
Figure 2 for Plain-Det: A Plain Multi-Dataset Object Detector
Figure 3 for Plain-Det: A Plain Multi-Dataset Object Detector
Figure 4 for Plain-Det: A Plain Multi-Dataset Object Detector
Viaarxiv icon

The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models

Add code
Apr 18, 2024
Figure 1 for The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models
Figure 2 for The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models
Figure 3 for The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models
Figure 4 for The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models
Viaarxiv icon

Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator

Add code
Sep 25, 2023
Figure 1 for Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
Figure 2 for Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
Figure 3 for Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
Figure 4 for Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
Viaarxiv icon

LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models

Add code
Sep 22, 2023
Figure 1 for LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Figure 2 for LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Figure 3 for LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Figure 4 for LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Viaarxiv icon

Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding

Add code
Sep 03, 2023
Viaarxiv icon

EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment

Add code
Sep 03, 2023
Figure 1 for EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment
Figure 2 for EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment
Figure 3 for EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment
Figure 4 for EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment
Viaarxiv icon