Picture for Zhibo Chen

Zhibo Chen

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

Add code
Dec 13, 2024
Figure 1 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 2 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 3 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 4 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Viaarxiv icon

Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions

Add code
Dec 10, 2024
Figure 1 for Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions
Figure 2 for Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions
Figure 3 for Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions
Figure 4 for Light Field Image Quality Assessment With Auxiliary Learning Based on Depthwise and Anglewise Separable Convolutions
Viaarxiv icon

UniMIC: Towards Universal Multi-modality Perceptual Image Compression

Add code
Dec 09, 2024
Figure 1 for UniMIC: Towards Universal Multi-modality Perceptual Image Compression
Figure 2 for UniMIC: Towards Universal Multi-modality Perceptual Image Compression
Figure 3 for UniMIC: Towards Universal Multi-modality Perceptual Image Compression
Figure 4 for UniMIC: Towards Universal Multi-modality Perceptual Image Compression
Viaarxiv icon

LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents

Add code
Dec 05, 2024
Figure 1 for LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
Figure 2 for LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
Figure 3 for LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
Figure 4 for LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
Viaarxiv icon

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Add code
Nov 15, 2024
Figure 1 for Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Figure 2 for Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Figure 3 for Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Figure 4 for Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Viaarxiv icon

Towards Defining an Efficient and Expandable File Format for AI-Generated Contents

Add code
Oct 15, 2024
Figure 1 for Towards Defining an Efficient and Expandable File Format for AI-Generated Contents
Figure 2 for Towards Defining an Efficient and Expandable File Format for AI-Generated Contents
Figure 3 for Towards Defining an Efficient and Expandable File Format for AI-Generated Contents
Figure 4 for Towards Defining an Efficient and Expandable File Format for AI-Generated Contents
Viaarxiv icon

Compositional 3D-aware Video Generation with LLM Director

Add code
Aug 31, 2024
Figure 1 for Compositional 3D-aware Video Generation with LLM Director
Figure 2 for Compositional 3D-aware Video Generation with LLM Director
Figure 3 for Compositional 3D-aware Video Generation with LLM Director
Figure 4 for Compositional 3D-aware Video Generation with LLM Director
Viaarxiv icon

MambaCSR: Dual-Interleaved Scanning for Compressed Image Super-Resolution With SSMs

Add code
Aug 21, 2024
Viaarxiv icon

Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs

Add code
Aug 16, 2024
Figure 1 for Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs
Figure 2 for Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs
Figure 3 for Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs
Figure 4 for Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs
Viaarxiv icon

Rethinking Domain Adaptation and Generalization in the Era of CLIP

Add code
Jul 21, 2024
Figure 1 for Rethinking Domain Adaptation and Generalization in the Era of CLIP
Figure 2 for Rethinking Domain Adaptation and Generalization in the Era of CLIP
Figure 3 for Rethinking Domain Adaptation and Generalization in the Era of CLIP
Figure 4 for Rethinking Domain Adaptation and Generalization in the Era of CLIP
Viaarxiv icon