Picture for Tao Zhang

Tao Zhang

Cooperative Causal GraphSAGE

Add code
May 20, 2025
Viaarxiv icon

EGFormer: Towards Efficient and Generalizable Multimodal Semantic Segmentation

Add code
May 20, 2025
Viaarxiv icon

Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses

Add code
May 19, 2025
Viaarxiv icon

On Path to Multimodal Generalist: General-Level and General-Bench

Add code
May 07, 2025
Viaarxiv icon

Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance

Add code
May 04, 2025
Figure 1 for Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance
Figure 2 for Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance
Figure 3 for Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance
Figure 4 for Unaligned RGB Guided Hyperspectral Image Super-Resolution with Spatial-Spectral Concordance
Viaarxiv icon

LL-Gaussian: Low-Light Scene Reconstruction and Enhancement via Gaussian Splatting for Novel View Synthesis

Add code
Apr 15, 2025
Viaarxiv icon

PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild

Add code
Apr 15, 2025
Figure 1 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 2 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 3 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 4 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Viaarxiv icon

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Add code
Apr 14, 2025
Figure 1 for Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Figure 2 for Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Figure 3 for Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Figure 4 for Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Viaarxiv icon

MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation

Add code
Apr 11, 2025
Viaarxiv icon

An Empirical Study of GPT-4o Image Generation Capabilities

Add code
Apr 08, 2025
Figure 1 for An Empirical Study of GPT-4o Image Generation Capabilities
Figure 2 for An Empirical Study of GPT-4o Image Generation Capabilities
Figure 3 for An Empirical Study of GPT-4o Image Generation Capabilities
Figure 4 for An Empirical Study of GPT-4o Image Generation Capabilities
Viaarxiv icon