Picture for Daquan Zhou

Daquan Zhou

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Add code
May 02, 2024
Viaarxiv icon

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning

Add code
Apr 29, 2024
Viaarxiv icon

Chain of Thought Explanation for Dialogue State Tracking

Add code
Mar 09, 2024
Figure 1 for Chain of Thought Explanation for Dialogue State Tracking
Figure 2 for Chain of Thought Explanation for Dialogue State Tracking
Figure 3 for Chain of Thought Explanation for Dialogue State Tracking
Figure 4 for Chain of Thought Explanation for Dialogue State Tracking
Viaarxiv icon

Sora Generates Videos with Stunning Geometrical Consistency

Add code
Feb 27, 2024
Figure 1 for Sora Generates Videos with Stunning Geometrical Consistency
Figure 2 for Sora Generates Videos with Stunning Geometrical Consistency
Figure 3 for Sora Generates Videos with Stunning Geometrical Consistency
Figure 4 for Sora Generates Videos with Stunning Geometrical Consistency
Viaarxiv icon

Magic-Me: Identity-Specific Video Customized Diffusion

Add code
Feb 14, 2024
Figure 1 for Magic-Me: Identity-Specific Video Customized Diffusion
Figure 2 for Magic-Me: Identity-Specific Video Customized Diffusion
Figure 3 for Magic-Me: Identity-Specific Video Customized Diffusion
Figure 4 for Magic-Me: Identity-Specific Video Customized Diffusion
Viaarxiv icon

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation

Add code
Jan 09, 2024
Viaarxiv icon

Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost

Add code
Dec 14, 2023
Figure 1 for Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost
Figure 2 for Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost
Figure 3 for Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost
Figure 4 for Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost
Viaarxiv icon

MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration

Add code
Nov 16, 2023
Viaarxiv icon

EPIM: Efficient Processing-In-Memory Accelerators based on Epitome

Add code
Nov 12, 2023
Viaarxiv icon

ChatAnything: Facetime Chat with LLM-Enhanced Personas

Add code
Nov 12, 2023
Viaarxiv icon