Picture for Chao Dong

Chao Dong

How far have we gone in Generative Image Restoration? A study on its capability, limitations and evaluation practices

Add code
Mar 05, 2026
Viaarxiv icon

Position: Evaluation of Visual Processing Should Be Human-Centered, Not Metric-Centered

Add code
Feb 28, 2026
Viaarxiv icon

Prompt-Driven Low-Altitude Edge Intelligence: Modular Agents and Generative Reasoning

Add code
Feb 15, 2026
Viaarxiv icon

Harnessing Diffusion-Yielded Score Priors for Image Restoration

Add code
Jul 28, 2025
Viaarxiv icon

SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution

Add code
Jun 24, 2025
Viaarxiv icon

Joint Computation Offloading and Resource Allocation for Uncertain Maritime MEC via Cooperation of UAVs and Vessels

Add code
Jun 18, 2025
Figure 1 for Joint Computation Offloading and Resource Allocation for Uncertain Maritime MEC via Cooperation of UAVs and Vessels
Figure 2 for Joint Computation Offloading and Resource Allocation for Uncertain Maritime MEC via Cooperation of UAVs and Vessels
Figure 3 for Joint Computation Offloading and Resource Allocation for Uncertain Maritime MEC via Cooperation of UAVs and Vessels
Figure 4 for Joint Computation Offloading and Resource Allocation for Uncertain Maritime MEC via Cooperation of UAVs and Vessels
Viaarxiv icon

DualX-VSR: Dual Axial Spatial$\times$Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation

Add code
Jun 05, 2025
Viaarxiv icon

Semantics-Aware Human Motion Generation from Audio Instructions

Add code
May 29, 2025
Viaarxiv icon

SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity

Add code
May 15, 2025
Figure 1 for SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity
Figure 2 for SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity
Figure 3 for SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity
Figure 4 for SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity
Viaarxiv icon

Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision

Add code
Apr 08, 2025
Viaarxiv icon