Picture for Liang-Chieh Chen

Liang-Chieh Chen

Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models

Add code
Jun 13, 2024
Figure 1 for Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
Figure 2 for Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
Figure 3 for Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
Figure 4 for Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
Viaarxiv icon

An Image is Worth 32 Tokens for Reconstruction and Generation

Add code
Jun 11, 2024
Viaarxiv icon

Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting

Add code
Jun 04, 2024
Viaarxiv icon

COCONut: Modernizing COCO Segmentation

Add code
Apr 12, 2024
Viaarxiv icon

ViTamin: Designing Scalable Vision Models in the Vision-Language Era

Add code
Apr 03, 2024
Viaarxiv icon

SPFormer: Enhancing Vision Transformer with Superpixel Representation

Add code
Jan 05, 2024
Viaarxiv icon

MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation

Add code
Dec 11, 2023
Figure 1 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Figure 2 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Figure 3 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Figure 4 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Viaarxiv icon

MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation

Add code
Nov 30, 2023
Figure 1 for MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation
Figure 2 for MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation
Figure 3 for MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation
Figure 4 for MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation
Viaarxiv icon

Towards Open-Ended Visual Recognition with Large Language Model

Add code
Nov 14, 2023
Figure 1 for Towards Open-Ended Visual Recognition with Large Language Model
Figure 2 for Towards Open-Ended Visual Recognition with Large Language Model
Figure 3 for Towards Open-Ended Visual Recognition with Large Language Model
Figure 4 for Towards Open-Ended Visual Recognition with Large Language Model
Viaarxiv icon

PolyMaX: General Dense Prediction with Mask Transformer

Add code
Nov 09, 2023
Figure 1 for PolyMaX: General Dense Prediction with Mask Transformer
Figure 2 for PolyMaX: General Dense Prediction with Mask Transformer
Figure 3 for PolyMaX: General Dense Prediction with Mask Transformer
Figure 4 for PolyMaX: General Dense Prediction with Mask Transformer
Viaarxiv icon