Picture for Zhenhong Sun

Zhenhong Sun

Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision

Add code
Apr 22, 2025
Viaarxiv icon

Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers

Add code
Apr 14, 2025
Viaarxiv icon

T$^3$-S2S: Training-free Triplet Tuning for Sketch to Scene Generation

Add code
Dec 18, 2024
Viaarxiv icon

Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation

Add code
Mar 08, 2024
Viaarxiv icon

Learning Informative Latent Representation for Quantum State Tomography

Add code
Sep 30, 2023
Viaarxiv icon

Attention-Based Transformer Networks for Quantum State Tomography

Add code
May 09, 2023
Viaarxiv icon

Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video Recognition

Add code
Mar 05, 2023
Viaarxiv icon

Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search

Add code
Nov 26, 2021
Figure 1 for Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search
Figure 2 for Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search
Figure 3 for Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search
Figure 4 for Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search
Viaarxiv icon

Interpolation variable rate image compression

Add code
Sep 20, 2021
Figure 1 for Interpolation variable rate image compression
Figure 2 for Interpolation variable rate image compression
Figure 3 for Interpolation variable rate image compression
Figure 4 for Interpolation variable rate image compression
Viaarxiv icon

Spatiotemporal Entropy Model is All You Need for Learned Video Compression

Add code
Apr 13, 2021
Figure 1 for Spatiotemporal Entropy Model is All You Need for Learned Video Compression
Figure 2 for Spatiotemporal Entropy Model is All You Need for Learned Video Compression
Figure 3 for Spatiotemporal Entropy Model is All You Need for Learned Video Compression
Figure 4 for Spatiotemporal Entropy Model is All You Need for Learned Video Compression
Viaarxiv icon