Picture for Shiqi Wang

Shiqi Wang

Meta

When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding

Add code
Aug 15, 2024
Figure 1 for When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding
Figure 2 for When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding
Figure 3 for When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding
Figure 4 for When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding
Viaarxiv icon

High Efficiency Image Compression for Large Visual-Language Models

Add code
Jul 24, 2024
Viaarxiv icon

DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor

Add code
Jun 12, 2024
Figure 1 for DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor
Figure 2 for DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor
Figure 3 for DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor
Figure 4 for DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor
Viaarxiv icon

Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics

Add code
May 29, 2024
Figure 1 for Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics
Figure 2 for Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics
Figure 3 for Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics
Figure 4 for Opinion-Unaware Blind Image Quality Assessment using Multi-Scale Deep Feature Statistics
Viaarxiv icon

Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare

Add code
May 29, 2024
Figure 1 for Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare
Figure 2 for Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare
Figure 3 for Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare
Figure 4 for Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare
Viaarxiv icon

Training LLMs to Better Self-Debug and Explain Code

Add code
May 28, 2024
Figure 1 for Training LLMs to Better Self-Debug and Explain Code
Figure 2 for Training LLMs to Better Self-Debug and Explain Code
Figure 3 for Training LLMs to Better Self-Debug and Explain Code
Figure 4 for Training LLMs to Better Self-Debug and Explain Code
Viaarxiv icon

Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method

Add code
May 07, 2024
Viaarxiv icon

Gradient-Congruity Guided Federated Sparse Training

Add code
May 02, 2024
Figure 1 for Gradient-Congruity Guided Federated Sparse Training
Figure 2 for Gradient-Congruity Guided Federated Sparse Training
Figure 3 for Gradient-Congruity Guided Federated Sparse Training
Figure 4 for Gradient-Congruity Guided Federated Sparse Training
Viaarxiv icon

FakeBench: Uncover the Achilles' Heels of Fake Images with Large Multimodal Models

Add code
Apr 20, 2024
Figure 1 for FakeBench: Uncover the Achilles' Heels of Fake Images with Large Multimodal Models
Figure 2 for FakeBench: Uncover the Achilles' Heels of Fake Images with Large Multimodal Models
Figure 3 for FakeBench: Uncover the Achilles' Heels of Fake Images with Large Multimodal Models
Figure 4 for FakeBench: Uncover the Achilles' Heels of Fake Images with Large Multimodal Models
Viaarxiv icon

CompGS: Efficient 3D Scene Representation via Compressed Gaussian Splatting

Add code
Apr 15, 2024
Viaarxiv icon