Picture for Yuyin Zhou

Yuyin Zhou

Causal Image Modeling for Efficient Visual Understanding

Add code
Oct 10, 2024
Figure 1 for Causal Image Modeling for Efficient Visual Understanding
Figure 2 for Causal Image Modeling for Efficient Visual Understanding
Figure 3 for Causal Image Modeling for Efficient Visual Understanding
Figure 4 for Causal Image Modeling for Efficient Visual Understanding
Viaarxiv icon

Story-Adapter: A Training-free Iterative Framework for Long Story Visualization

Add code
Oct 08, 2024
Figure 1 for Story-Adapter: A Training-free Iterative Framework for Long Story Visualization
Figure 2 for Story-Adapter: A Training-free Iterative Framework for Long Story Visualization
Figure 3 for Story-Adapter: A Training-free Iterative Framework for Long Story Visualization
Figure 4 for Story-Adapter: A Training-free Iterative Framework for Long Story Visualization
Viaarxiv icon

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Add code
Aug 06, 2024
Viaarxiv icon

What If We Recaption Billions of Web Images with LLaMA-3?

Add code
Jun 12, 2024
Figure 1 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 2 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 3 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 4 for What If We Recaption Billions of Web Images with LLaMA-3?
Viaarxiv icon

DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor

Add code
Jun 12, 2024
Figure 1 for DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor
Figure 2 for DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor
Figure 3 for DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor
Figure 4 for DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor
Viaarxiv icon

Medical Vision Generalist: Unifying Medical Imaging Tasks in Context

Add code
Jun 08, 2024
Figure 1 for Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
Figure 2 for Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
Figure 3 for Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
Figure 4 for Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
Viaarxiv icon

Scaling White-Box Transformers for Vision

Add code
Jun 03, 2024
Figure 1 for Scaling White-Box Transformers for Vision
Figure 2 for Scaling White-Box Transformers for Vision
Figure 3 for Scaling White-Box Transformers for Vision
Figure 4 for Scaling White-Box Transformers for Vision
Viaarxiv icon

Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation

Add code
May 24, 2024
Figure 1 for Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation
Figure 2 for Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation
Figure 3 for Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation
Figure 4 for Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation
Viaarxiv icon

Mamba-R: Vision Mamba ALSO Needs Registers

Add code
May 23, 2024
Figure 1 for Mamba-R: Vision Mamba ALSO Needs Registers
Figure 2 for Mamba-R: Vision Mamba ALSO Needs Registers
Figure 3 for Mamba-R: Vision Mamba ALSO Needs Registers
Figure 4 for Mamba-R: Vision Mamba ALSO Needs Registers
Viaarxiv icon

Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation

Add code
May 23, 2024
Figure 1 for Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation
Figure 2 for Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation
Figure 3 for Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation
Figure 4 for Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation
Viaarxiv icon