Picture for Difei Gao

Difei Gao

ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation

Add code
Jan 01, 2024
Viaarxiv icon

ViT-Lens-2: Gateway to Omni-modal Intelligence

Add code
Nov 27, 2023
Figure 1 for ViT-Lens-2: Gateway to Omni-modal Intelligence
Figure 2 for ViT-Lens-2: Gateway to Omni-modal Intelligence
Figure 3 for ViT-Lens-2: Gateway to Omni-modal Intelligence
Figure 4 for ViT-Lens-2: Gateway to Omni-modal Intelligence
Viaarxiv icon

CVPR 2023 Text Guided Video Editing Competition

Add code
Oct 24, 2023
Viaarxiv icon

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Add code
Sep 27, 2023
Viaarxiv icon

Recap: Detecting Deepfake Video with Unpredictable Tampered Traces via Recovering Faces and Mapping Recovered Faces

Add code
Aug 19, 2023
Viaarxiv icon

UniVTG: Towards Unified Video-Language Temporal Grounding

Add code
Aug 18, 2023
Viaarxiv icon

AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn

Add code
Jun 28, 2023
Viaarxiv icon

GroundNLQ @ Ego4D Natural Language Queries Challenge 2023

Add code
Jun 27, 2023
Viaarxiv icon

Affordance Grounding from Demonstration Video to Target Image

Add code
Mar 26, 2023
Viaarxiv icon

DeepfakeMAE: Facial Part Consistency Aware Masked Autoencoder for Deepfake Video Detection

Add code
Mar 03, 2023
Figure 1 for DeepfakeMAE: Facial Part Consistency Aware Masked Autoencoder for Deepfake Video Detection
Figure 2 for DeepfakeMAE: Facial Part Consistency Aware Masked Autoencoder for Deepfake Video Detection
Figure 3 for DeepfakeMAE: Facial Part Consistency Aware Masked Autoencoder for Deepfake Video Detection
Figure 4 for DeepfakeMAE: Facial Part Consistency Aware Masked Autoencoder for Deepfake Video Detection
Viaarxiv icon