Picture for De-An Huang

De-An Huang

What is Point Supervision Worth in Video Instance Segmentation?

Add code
Apr 01, 2024
Figure 1 for What is Point Supervision Worth in Video Instance Segmentation?
Figure 2 for What is Point Supervision Worth in Video Instance Segmentation?
Figure 3 for What is Point Supervision Worth in Video Instance Segmentation?
Figure 4 for What is Point Supervision Worth in Video Instance Segmentation?
Viaarxiv icon

LITA: Language Instructed Temporal-Localization Assistant

Add code
Mar 27, 2024
Viaarxiv icon

Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

Add code
Mar 21, 2024
Viaarxiv icon

T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching

Add code
Feb 21, 2024
Viaarxiv icon

Deep Multimodal Fusion for Surgical Feedback Classification

Add code
Dec 06, 2023
Viaarxiv icon

Eureka: Human-Level Reward Design via Coding Large Language Models

Add code
Oct 19, 2023
Figure 1 for Eureka: Human-Level Reward Design via Coding Large Language Models
Figure 2 for Eureka: Human-Level Reward Design via Coding Large Language Models
Figure 3 for Eureka: Human-Level Reward Design via Coding Large Language Models
Figure 4 for Eureka: Human-Level Reward Design via Coding Large Language Models
Viaarxiv icon

Differentially Private Video Activity Recognition

Add code
Jun 27, 2023
Figure 1 for Differentially Private Video Activity Recognition
Figure 2 for Differentially Private Video Activity Recognition
Figure 3 for Differentially Private Video Activity Recognition
Figure 4 for Differentially Private Video Activity Recognition
Viaarxiv icon

PerAda: Parameter-Efficient and Generalizable Federated Learning Personalization with Guarantees

Add code
Feb 13, 2023
Viaarxiv icon

I$^2$SB: Image-to-Image Schrödinger Bridge

Add code
Feb 12, 2023
Viaarxiv icon

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning

Add code
Feb 09, 2023
Figure 1 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Figure 2 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Figure 3 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Figure 4 for Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
Viaarxiv icon