Picture for Tongda Xu

Tongda Xu

Benchmarking and Enhancing VLM for Compressed Image Understanding

Add code
Dec 24, 2025
Figure 1 for Benchmarking and Enhancing VLM for Compressed Image Understanding
Figure 2 for Benchmarking and Enhancing VLM for Compressed Image Understanding
Figure 3 for Benchmarking and Enhancing VLM for Compressed Image Understanding
Figure 4 for Benchmarking and Enhancing VLM for Compressed Image Understanding
Viaarxiv icon

GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting

Add code
Dec 22, 2025
Viaarxiv icon

Optimizing Input of Denoising Score Matching is Biased Towards Higher Score Norm

Add code
Nov 13, 2025
Viaarxiv icon

Generative AI Meets 6G and Beyond: Diffusion Models for Semantic Communications

Add code
Nov 11, 2025
Viaarxiv icon

V-Shuffle: Zero-Shot Style Transfer via Value Shuffle

Add code
Nov 09, 2025
Figure 1 for V-Shuffle: Zero-Shot Style Transfer via Value Shuffle
Figure 2 for V-Shuffle: Zero-Shot Style Transfer via Value Shuffle
Figure 3 for V-Shuffle: Zero-Shot Style Transfer via Value Shuffle
Figure 4 for V-Shuffle: Zero-Shot Style Transfer via Value Shuffle
Viaarxiv icon

PICD: Versatile Perceptual Image Compression with Diffusion Rendering

Add code
May 09, 2025
Figure 1 for PICD: Versatile Perceptual Image Compression with Diffusion Rendering
Figure 2 for PICD: Versatile Perceptual Image Compression with Diffusion Rendering
Figure 3 for PICD: Versatile Perceptual Image Compression with Diffusion Rendering
Figure 4 for PICD: Versatile Perceptual Image Compression with Diffusion Rendering
Viaarxiv icon

CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query

Add code
Feb 26, 2025
Figure 1 for CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query
Figure 2 for CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query
Figure 3 for CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query
Figure 4 for CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query
Viaarxiv icon

Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior

Add code
Jan 31, 2025
Viaarxiv icon

MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes

Add code
Oct 17, 2024
Viaarxiv icon

Task-Aware Encoder Control for Deep Video Compression

Add code
Apr 07, 2024
Figure 1 for Task-Aware Encoder Control for Deep Video Compression
Figure 2 for Task-Aware Encoder Control for Deep Video Compression
Figure 3 for Task-Aware Encoder Control for Deep Video Compression
Figure 4 for Task-Aware Encoder Control for Deep Video Compression
Viaarxiv icon