Picture for Chao Dong

Chao Dong

DualX-VSR: Dual Axial Spatial$\times$Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation

Add code
Jun 05, 2025
Viaarxiv icon

Semantics-Aware Human Motion Generation from Audio Instructions

Add code
May 29, 2025
Viaarxiv icon

SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and $\mathcal{O}(T)$ Complexity

Add code
May 15, 2025
Viaarxiv icon

Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision

Add code
Apr 08, 2025
Viaarxiv icon

TurboFill: Adapting Few-step Text-to-image Model for Fast Image Inpainting

Add code
Apr 01, 2025
Viaarxiv icon

UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models

Add code
Mar 21, 2025
Viaarxiv icon

Joint ADS-B in B5G for Hierarchical UAV Networks: Performance Analysis and MEC Based Optimization

Add code
Mar 18, 2025
Viaarxiv icon

Revisiting the Generalization Problem of Low-level Vision Models Through the Lens of Image Deraining

Add code
Feb 18, 2025
Viaarxiv icon

Generative AI-Enhanced Cooperative MEC of UAVs and Ground Stations for Unmanned Surface Vehicles

Add code
Feb 12, 2025
Viaarxiv icon

Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Add code
Jan 20, 2025
Figure 1 for Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
Figure 2 for Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
Figure 3 for Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
Figure 4 for Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
Viaarxiv icon