Picture for Weisi Lin

Weisi Lin

MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model

Add code
Feb 29, 2024
Figure 1 for MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model
Figure 2 for MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model
Figure 3 for MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model
Figure 4 for MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model
Viaarxiv icon

Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction

Add code
Feb 29, 2024
Figure 1 for Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction
Figure 2 for Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction
Figure 3 for Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction
Figure 4 for Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction
Viaarxiv icon

A Benchmark for Multi-modal Foundation Models on Low-level Vision: from Single Images to Pairs

Add code
Feb 11, 2024
Viaarxiv icon

AesBench: An Expert Benchmark for Multimodal Large Language Models on Image Aesthetics Perception

Add code
Jan 16, 2024
Viaarxiv icon

Towards A Better Metric for Text-to-Video Generation

Add code
Jan 15, 2024
Viaarxiv icon

Q-Refine: A Perceptual Quality Refiner for AI-Generated Image

Add code
Jan 02, 2024
Viaarxiv icon

Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels

Add code
Dec 28, 2023
Figure 1 for Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Figure 2 for Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Figure 3 for Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Figure 4 for Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Viaarxiv icon

Q-Boost: On Visual Quality Assessment Ability of Low-level Multi-Modality Foundation Models

Add code
Dec 23, 2023
Viaarxiv icon

Iterative Token Evaluation and Refinement for Real-World Super-Resolution

Add code
Dec 09, 2023
Figure 1 for Iterative Token Evaluation and Refinement for Real-World Super-Resolution
Figure 2 for Iterative Token Evaluation and Refinement for Real-World Super-Resolution
Figure 3 for Iterative Token Evaluation and Refinement for Real-World Super-Resolution
Figure 4 for Iterative Token Evaluation and Refinement for Real-World Super-Resolution
Viaarxiv icon

Enhancing Diffusion Models with Text-Encoder Reinforcement Learning

Add code
Nov 27, 2023
Figure 1 for Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Figure 2 for Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Figure 3 for Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Figure 4 for Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Viaarxiv icon