Picture for Chong Luo

Chong Luo

Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms

Add code
Jun 13, 2024
Figure 1 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 2 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 3 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 4 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Viaarxiv icon

MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion

Add code
May 30, 2024
Viaarxiv icon

Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild

Add code
Apr 29, 2024
Viaarxiv icon

OmniVid: A Generative Framework for Universal Video Understanding

Add code
Mar 26, 2024
Figure 1 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 2 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 3 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 4 for OmniVid: A Generative Framework for Universal Video Understanding
Viaarxiv icon

Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering

Add code
Mar 14, 2024
Figure 1 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Figure 2 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Figure 3 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Figure 4 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Viaarxiv icon

Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs

Add code
Dec 12, 2023
Figure 1 for Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs
Figure 2 for Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs
Figure 3 for Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs
Figure 4 for Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs
Viaarxiv icon

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

Add code
Nov 30, 2023
Figure 1 for MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Figure 2 for MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Figure 3 for MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Figure 4 for MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Viaarxiv icon

ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models

Add code
Nov 30, 2023
Figure 1 for ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Figure 2 for ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Figure 3 for ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Figure 4 for ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Viaarxiv icon

Panacea: Panoramic and Controllable Video Generation for Autonomous Driving

Add code
Nov 28, 2023
Figure 1 for Panacea: Panoramic and Controllable Video Generation for Autonomous Driving
Figure 2 for Panacea: Panoramic and Controllable Video Generation for Autonomous Driving
Figure 3 for Panacea: Panoramic and Controllable Video Generation for Autonomous Driving
Figure 4 for Panacea: Panoramic and Controllable Video Generation for Autonomous Driving
Viaarxiv icon

CCEdit: Creative and Controllable Video Editing via Diffusion Models

Add code
Sep 28, 2023
Figure 1 for CCEdit: Creative and Controllable Video Editing via Diffusion Models
Figure 2 for CCEdit: Creative and Controllable Video Editing via Diffusion Models
Figure 3 for CCEdit: Creative and Controllable Video Editing via Diffusion Models
Figure 4 for CCEdit: Creative and Controllable Video Editing via Diffusion Models
Viaarxiv icon