Picture for Zheng-Jun Zha

Zheng-Jun Zha

University of Science and Technology of China

ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization

Add code
Oct 14, 2024
Figure 1 for ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization
Figure 2 for ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization
Figure 3 for ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization
Figure 4 for ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization
Viaarxiv icon

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

Add code
Oct 07, 2024
Figure 1 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 2 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 3 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 4 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Viaarxiv icon

Grounding 3D Scene Affordance From Egocentric Interactions

Add code
Sep 29, 2024
Figure 1 for Grounding 3D Scene Affordance From Egocentric Interactions
Figure 2 for Grounding 3D Scene Affordance From Egocentric Interactions
Figure 3 for Grounding 3D Scene Affordance From Egocentric Interactions
Figure 4 for Grounding 3D Scene Affordance From Egocentric Interactions
Viaarxiv icon

DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion

Add code
Sep 25, 2024
Figure 1 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Figure 2 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Figure 3 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Figure 4 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Viaarxiv icon

QMambaBSR: Burst Image Super-Resolution with Query State Space Model

Add code
Aug 16, 2024
Figure 1 for QMambaBSR: Burst Image Super-Resolution with Query State Space Model
Figure 2 for QMambaBSR: Burst Image Super-Resolution with Query State Space Model
Figure 3 for QMambaBSR: Burst Image Super-Resolution with Query State Space Model
Figure 4 for QMambaBSR: Burst Image Super-Resolution with Query State Space Model
Viaarxiv icon

Downstream-Pretext Domain Knowledge Traceback for Active Learning

Add code
Jul 20, 2024
Figure 1 for Downstream-Pretext Domain Knowledge Traceback for Active Learning
Figure 2 for Downstream-Pretext Domain Knowledge Traceback for Active Learning
Figure 3 for Downstream-Pretext Domain Knowledge Traceback for Active Learning
Figure 4 for Downstream-Pretext Domain Knowledge Traceback for Active Learning
Viaarxiv icon

FC3DNet: A Fully Connected Encoder-Decoder for Efficient Demoir'eing

Add code
Jun 21, 2024
Viaarxiv icon

DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera

Add code
Jun 12, 2024
Figure 1 for DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera
Figure 2 for DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera
Figure 3 for DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera
Figure 4 for DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera
Viaarxiv icon

Towards Realistic Data Generation for Real-World Super-Resolution

Add code
Jun 12, 2024
Figure 1 for Towards Realistic Data Generation for Real-World Super-Resolution
Figure 2 for Towards Realistic Data Generation for Real-World Super-Resolution
Figure 3 for Towards Realistic Data Generation for Real-World Super-Resolution
Figure 4 for Towards Realistic Data Generation for Real-World Super-Resolution
Viaarxiv icon

Context-aware Difference Distilling for Multi-change Captioning

Add code
May 31, 2024
Viaarxiv icon