Picture for Yuan Zhang

Yuan Zhang

PlacidDreamer: Advancing Harmony in Text-to-3D Generation

Add code
Jul 19, 2024
Viaarxiv icon

4Dynamic: Text-to-4D Generation with Hybrid Priors

Add code
Jul 17, 2024
Viaarxiv icon

Latent Linear Quadratic Regulator for Robotic Control Tasks

Add code
Jul 15, 2024
Viaarxiv icon

DSCENet: Dynamic Screening and Clinical-Enhanced Multimodal Fusion for MPNs Subtype Classification

Add code
Jul 11, 2024
Viaarxiv icon

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

Add code
Jul 03, 2024
Viaarxiv icon

ClawMachine: Fetching Visual Tokens as An Entity for Referring and Grounding

Add code
Jun 17, 2024
Viaarxiv icon

Artemis: Towards Referential Understanding in Complex Videos

Add code
Jun 01, 2024
Figure 1 for Artemis: Towards Referential Understanding in Complex Videos
Figure 2 for Artemis: Towards Referential Understanding in Complex Videos
Figure 3 for Artemis: Towards Referential Understanding in Complex Videos
Figure 4 for Artemis: Towards Referential Understanding in Complex Videos
Viaarxiv icon

Benchmarking and Improving Detail Image Caption

Add code
May 29, 2024
Figure 1 for Benchmarking and Improving Detail Image Caption
Figure 2 for Benchmarking and Improving Detail Image Caption
Figure 3 for Benchmarking and Improving Detail Image Caption
Figure 4 for Benchmarking and Improving Detail Image Caption
Viaarxiv icon

SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance

Add code
May 24, 2024
Viaarxiv icon

Unveiling the Tapestry of Consistency in Large Vision-Language Models

Add code
May 23, 2024
Figure 1 for Unveiling the Tapestry of Consistency in Large Vision-Language Models
Figure 2 for Unveiling the Tapestry of Consistency in Large Vision-Language Models
Figure 3 for Unveiling the Tapestry of Consistency in Large Vision-Language Models
Figure 4 for Unveiling the Tapestry of Consistency in Large Vision-Language Models
Viaarxiv icon