Picture for Zhangyang Wang

Zhangyang Wang

Atlas

Are Large Kernels Better Teachers than Transformers for ConvNets?

Add code
May 30, 2023
Figure 1 for Are Large Kernels Better Teachers than Transformers for ConvNets?
Figure 2 for Are Large Kernels Better Teachers than Transformers for ConvNets?
Figure 3 for Are Large Kernels Better Teachers than Transformers for ConvNets?
Figure 4 for Are Large Kernels Better Teachers than Transformers for ConvNets?
Viaarxiv icon

Dynamic Sparsity Is Channel-Level Sparsity Learner

Add code
May 30, 2023
Figure 1 for Dynamic Sparsity Is Channel-Level Sparsity Learner
Figure 2 for Dynamic Sparsity Is Channel-Level Sparsity Learner
Figure 3 for Dynamic Sparsity Is Channel-Level Sparsity Learner
Figure 4 for Dynamic Sparsity Is Channel-Level Sparsity Learner
Viaarxiv icon

Towards Constituting Mathematical Structures for Learning to Optimize

Add code
May 29, 2023
Figure 1 for Towards Constituting Mathematical Structures for Learning to Optimize
Figure 2 for Towards Constituting Mathematical Structures for Learning to Optimize
Figure 3 for Towards Constituting Mathematical Structures for Learning to Optimize
Figure 4 for Towards Constituting Mathematical Structures for Learning to Optimize
Viaarxiv icon

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models

Add code
May 25, 2023
Figure 1 for Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Figure 2 for Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Figure 3 for Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Figure 4 for Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Viaarxiv icon

POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference

Add code
May 25, 2023
Figure 1 for POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference
Figure 2 for POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference
Figure 3 for POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference
Figure 4 for POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference
Viaarxiv icon

MMG-Ego4D: Multi-Modal Generalization in Egocentric Action Recognition

Add code
May 12, 2023
Figure 1 for MMG-Ego4D: Multi-Modal Generalization in Egocentric Action Recognition
Figure 2 for MMG-Ego4D: Multi-Modal Generalization in Egocentric Action Recognition
Figure 3 for MMG-Ego4D: Multi-Modal Generalization in Egocentric Action Recognition
Figure 4 for MMG-Ego4D: Multi-Modal Generalization in Egocentric Action Recognition
Viaarxiv icon

Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation

Add code
May 08, 2023
Figure 1 for Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation
Figure 2 for Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation
Figure 3 for Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation
Figure 4 for Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation
Viaarxiv icon

In-Context Learning Unlocked for Diffusion Models

Add code
May 01, 2023
Viaarxiv icon

Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models

Add code
Apr 25, 2023
Viaarxiv icon

AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection

Add code
Apr 12, 2023
Viaarxiv icon