Picture for Hehe Fan

Hehe Fan

VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation

Add code
Jul 13, 2024
Viaarxiv icon

Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models

Add code
May 24, 2024
Viaarxiv icon

TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment

Add code
May 22, 2024
Figure 1 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Figure 2 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Figure 3 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Figure 4 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Viaarxiv icon

Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly

Add code
Apr 30, 2024
Viaarxiv icon

Clustering for Protein Representation Learning

Add code
Mar 30, 2024
Figure 1 for Clustering for Protein Representation Learning
Figure 2 for Clustering for Protein Representation Learning
Figure 3 for Clustering for Protein Representation Learning
Figure 4 for Clustering for Protein Representation Learning
Viaarxiv icon

EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing

Add code
Mar 24, 2024
Viaarxiv icon

ProtChatGPT: Towards Understanding Proteins with Large Language Models

Add code
Feb 15, 2024
Viaarxiv icon

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

Add code
Feb 09, 2024
Figure 1 for HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Figure 2 for HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Figure 3 for HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Figure 4 for HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Viaarxiv icon

Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling

Add code
Jan 29, 2024
Viaarxiv icon

DocMSU: A Comprehensive Benchmark for Document-level Multimodal Sarcasm Understanding

Add code
Dec 26, 2023
Viaarxiv icon