Picture for Yi Yang

Yi Yang

The Hong Kong University of Science and Technology, Hong Kong SAR, China

FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention

Add code
Jul 29, 2024
Viaarxiv icon

PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning

Add code
Jul 24, 2024
Figure 1 for PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning
Figure 2 for PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning
Figure 3 for PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning
Figure 4 for PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning
Viaarxiv icon

Navigation Instruction Generation with BEV Perception and Large Language Models

Add code
Jul 21, 2024
Figure 1 for Navigation Instruction Generation with BEV Perception and Large Language Models
Figure 2 for Navigation Instruction Generation with BEV Perception and Large Language Models
Figure 3 for Navigation Instruction Generation with BEV Perception and Large Language Models
Figure 4 for Navigation Instruction Generation with BEV Perception and Large Language Models
Viaarxiv icon

Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion

Add code
Jul 15, 2024
Figure 1 for Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Figure 2 for Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Figure 3 for Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Figure 4 for Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Viaarxiv icon

Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data

Add code
Jul 14, 2024
Figure 1 for Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data
Figure 2 for Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data
Figure 3 for Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data
Figure 4 for Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data
Viaarxiv icon

VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation

Add code
Jul 13, 2024
Figure 1 for VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
Figure 2 for VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
Figure 3 for VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
Figure 4 for VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
Viaarxiv icon

Nonverbal Interaction Detection

Add code
Jul 11, 2024
Figure 1 for Nonverbal Interaction Detection
Figure 2 for Nonverbal Interaction Detection
Figure 3 for Nonverbal Interaction Detection
Figure 4 for Nonverbal Interaction Detection
Viaarxiv icon

Controllable Navigation Instruction Generation with Chain of Thought Prompting

Add code
Jul 10, 2024
Viaarxiv icon

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Add code
Jul 10, 2024
Figure 1 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 2 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 3 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 4 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Viaarxiv icon

General and Task-Oriented Video Segmentation

Add code
Jul 09, 2024
Figure 1 for General and Task-Oriented Video Segmentation
Figure 2 for General and Task-Oriented Video Segmentation
Figure 3 for General and Task-Oriented Video Segmentation
Figure 4 for General and Task-Oriented Video Segmentation
Viaarxiv icon