Picture for Zhenyu He

Zhenyu He

GRAPE: Generalizable and Robust Multi-view Facial Capture

Add code
Jul 14, 2024
Viaarxiv icon

Learning Spatial-Semantic Features for Robust Video Object Segmentation

Add code
Jul 10, 2024
Figure 1 for Learning Spatial-Semantic Features for Robust Video Object Segmentation
Figure 2 for Learning Spatial-Semantic Features for Robust Video Object Segmentation
Figure 3 for Learning Spatial-Semantic Features for Robust Video Object Segmentation
Figure 4 for Learning Spatial-Semantic Features for Robust Video Object Segmentation
Viaarxiv icon

Let the Code LLM Edit Itself When You Edit the Code

Add code
Jul 03, 2024
Figure 1 for Let the Code LLM Edit Itself When You Edit the Code
Figure 2 for Let the Code LLM Edit Itself When You Edit the Code
Figure 3 for Let the Code LLM Edit Itself When You Edit the Code
Figure 4 for Let the Code LLM Edit Itself When You Edit the Code
Viaarxiv icon

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

Add code
Jun 24, 2024
Viaarxiv icon

1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation

Add code
Jun 07, 2024
Viaarxiv icon

Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models

Add code
May 17, 2024
Viaarxiv icon

Spatial-Temporal Multi-level Association for Video Object Segmentation

Add code
Apr 09, 2024
Figure 1 for Spatial-Temporal Multi-level Association for Video Object Segmentation
Figure 2 for Spatial-Temporal Multi-level Association for Video Object Segmentation
Figure 3 for Spatial-Temporal Multi-level Association for Video Object Segmentation
Figure 4 for Spatial-Temporal Multi-level Association for Video Object Segmentation
Viaarxiv icon

RTracker: Recoverable Tracking via PN Tree Structured Memory

Add code
Mar 28, 2024
Figure 1 for RTracker: Recoverable Tracking via PN Tree Structured Memory
Figure 2 for RTracker: Recoverable Tracking via PN Tree Structured Memory
Figure 3 for RTracker: Recoverable Tracking via PN Tree Structured Memory
Figure 4 for RTracker: Recoverable Tracking via PN Tree Structured Memory
Viaarxiv icon

Do Efficient Transformers Really Save Computation?

Add code
Feb 21, 2024
Figure 1 for Do Efficient Transformers Really Save Computation?
Figure 2 for Do Efficient Transformers Really Save Computation?
Figure 3 for Do Efficient Transformers Really Save Computation?
Figure 4 for Do Efficient Transformers Really Save Computation?
Viaarxiv icon

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

Add code
Jan 29, 2024
Figure 1 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Figure 2 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Figure 3 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Figure 4 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Viaarxiv icon