Picture for Yuan Zhang

Yuan Zhang

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

Add code
Jul 03, 2024
Viaarxiv icon

ClawMachine: Fetching Visual Tokens as An Entity for Referring and Grounding

Add code
Jun 17, 2024
Viaarxiv icon

Artemis: Towards Referential Understanding in Complex Videos

Add code
Jun 01, 2024
Figure 1 for Artemis: Towards Referential Understanding in Complex Videos
Figure 2 for Artemis: Towards Referential Understanding in Complex Videos
Figure 3 for Artemis: Towards Referential Understanding in Complex Videos
Figure 4 for Artemis: Towards Referential Understanding in Complex Videos
Viaarxiv icon

Benchmarking and Improving Detail Image Caption

Add code
May 29, 2024
Figure 1 for Benchmarking and Improving Detail Image Caption
Figure 2 for Benchmarking and Improving Detail Image Caption
Figure 3 for Benchmarking and Improving Detail Image Caption
Figure 4 for Benchmarking and Improving Detail Image Caption
Viaarxiv icon

SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance

Add code
May 24, 2024
Viaarxiv icon

Unveiling the Tapestry of Consistency in Large Vision-Language Models

Add code
May 23, 2024
Viaarxiv icon

A rapid approach to urban traffic noise mapping with a generative adversarial network

Add code
May 21, 2024
Viaarxiv icon

UDUC: An Uncertainty-driven Approach for Learning-based Robust Control

Add code
May 04, 2024
Viaarxiv icon

Multimodal Emotion Recognition by Fusing Video Semantic in MOOC Learning Scenarios

Add code
Apr 11, 2024
Viaarxiv icon

ReALM: Reference Resolution As Language Modeling

Add code
Mar 29, 2024
Viaarxiv icon