Picture for Kai Han

Kai Han

and Other Contributors

SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution

Add code
Feb 27, 2024
Viaarxiv icon

Assortment Planning with Sponsored Products

Add code
Feb 09, 2024
Viaarxiv icon

Data-efficient Large Vision Models through Sequential Autoregression

Add code
Feb 07, 2024
Figure 1 for Data-efficient Large Vision Models through Sequential Autoregression
Figure 2 for Data-efficient Large Vision Models through Sequential Autoregression
Figure 3 for Data-efficient Large Vision Models through Sequential Autoregression
Figure 4 for Data-efficient Large Vision Models through Sequential Autoregression
Viaarxiv icon

Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Add code
Feb 06, 2024
Viaarxiv icon

Rethinking Optimization and Architecture for Tiny Language Models

Add code
Feb 06, 2024
Figure 1 for Rethinking Optimization and Architecture for Tiny Language Models
Figure 2 for Rethinking Optimization and Architecture for Tiny Language Models
Figure 3 for Rethinking Optimization and Architecture for Tiny Language Models
Figure 4 for Rethinking Optimization and Architecture for Tiny Language Models
Viaarxiv icon

A Survey on Transformer Compression

Add code
Feb 05, 2024
Figure 1 for A Survey on Transformer Compression
Figure 2 for A Survey on Transformer Compression
Figure 3 for A Survey on Transformer Compression
Figure 4 for A Survey on Transformer Compression
Viaarxiv icon

FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition

Add code
Feb 05, 2024
Viaarxiv icon

Large OCR Model:An Empirical Study of Scaling Law for OCR

Add code
Jan 02, 2024
Viaarxiv icon

PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation

Add code
Dec 27, 2023
Figure 1 for PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation
Figure 2 for PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation
Figure 3 for PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation
Figure 4 for PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation
Viaarxiv icon

LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models

Add code
Dec 01, 2023
Viaarxiv icon