Picture for Daquan Zhou

Daquan Zhou

Refer to the report for detailed contributions

MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation

Add code
Dec 16, 2024
Figure 1 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Figure 2 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Figure 3 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Figure 4 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Viaarxiv icon

HunyuanVideo: A Systematic Framework For Large Video Generative Models

Add code
Dec 03, 2024
Figure 1 for HunyuanVideo: A Systematic Framework For Large Video Generative Models
Figure 2 for HunyuanVideo: A Systematic Framework For Large Video Generative Models
Figure 3 for HunyuanVideo: A Systematic Framework For Large Video Generative Models
Figure 4 for HunyuanVideo: A Systematic Framework For Large Video Generative Models
Viaarxiv icon

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

Add code
Oct 14, 2024
Figure 1 for LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
Figure 2 for LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
Figure 3 for LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
Figure 4 for LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
Viaarxiv icon

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Add code
Oct 03, 2024
Figure 1 for Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Figure 2 for Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Figure 3 for Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Figure 4 for Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Viaarxiv icon

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Add code
May 02, 2024
Viaarxiv icon

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning

Add code
Apr 29, 2024
Viaarxiv icon

Chain of Thought Explanation for Dialogue State Tracking

Add code
Mar 09, 2024
Figure 1 for Chain of Thought Explanation for Dialogue State Tracking
Figure 2 for Chain of Thought Explanation for Dialogue State Tracking
Figure 3 for Chain of Thought Explanation for Dialogue State Tracking
Figure 4 for Chain of Thought Explanation for Dialogue State Tracking
Viaarxiv icon

Sora Generates Videos with Stunning Geometrical Consistency

Add code
Feb 27, 2024
Figure 1 for Sora Generates Videos with Stunning Geometrical Consistency
Figure 2 for Sora Generates Videos with Stunning Geometrical Consistency
Figure 3 for Sora Generates Videos with Stunning Geometrical Consistency
Figure 4 for Sora Generates Videos with Stunning Geometrical Consistency
Viaarxiv icon

Magic-Me: Identity-Specific Video Customized Diffusion

Add code
Feb 14, 2024
Figure 1 for Magic-Me: Identity-Specific Video Customized Diffusion
Figure 2 for Magic-Me: Identity-Specific Video Customized Diffusion
Figure 3 for Magic-Me: Identity-Specific Video Customized Diffusion
Figure 4 for Magic-Me: Identity-Specific Video Customized Diffusion
Viaarxiv icon

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation

Add code
Jan 09, 2024
Viaarxiv icon