Picture for Yi Yang

Yi Yang

The Hong Kong University of Science and Technology, Hong Kong SAR, China

DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval

Add code
Jan 19, 2024
Viaarxiv icon

DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models

Add code
Jan 16, 2024
Figure 1 for DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
Figure 2 for DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
Figure 3 for DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
Figure 4 for DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
Viaarxiv icon

AntEval: Quantitatively Evaluating Informativeness and Expressiveness of Agent Social Interactions

Add code
Jan 12, 2024
Viaarxiv icon

MS-DETR: Efficient DETR Training with Mixed Supervision

Add code
Jan 08, 2024
Figure 1 for MS-DETR: Efficient DETR Training with Mixed Supervision
Figure 2 for MS-DETR: Efficient DETR Training with Mixed Supervision
Figure 3 for MS-DETR: Efficient DETR Training with Mixed Supervision
Figure 4 for MS-DETR: Efficient DETR Training with Mixed Supervision
Viaarxiv icon

GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields

Add code
Jan 02, 2024
Figure 1 for GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields
Figure 2 for GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields
Figure 3 for GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields
Figure 4 for GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields
Viaarxiv icon

Model Stealing Attack against Recommender System

Add code
Dec 26, 2023
Viaarxiv icon

Model Stealing Attack against Graph Classification with Authenticity, Uncertainty and Diversity

Add code
Dec 26, 2023
Viaarxiv icon

SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance

Add code
Dec 26, 2023
Viaarxiv icon

Human101: Training 100+FPS Human Gaussians in 100s from 1 View

Add code
Dec 23, 2023
Viaarxiv icon

Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens

Add code
Dec 12, 2023
Figure 1 for Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens
Figure 2 for Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens
Figure 3 for Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens
Figure 4 for Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens
Viaarxiv icon