Alert button

"Image": models, code, and papers
Alert button

Text-guided Explorable Image Super-resolution

Add code
Bookmark button
Alert button
Mar 02, 2024
Kanchana Vaishnavi Gandikota, Paramanand Chandramouli

Figure 1 for Text-guided Explorable Image Super-resolution
Figure 2 for Text-guided Explorable Image Super-resolution
Figure 3 for Text-guided Explorable Image Super-resolution
Figure 4 for Text-guided Explorable Image Super-resolution
Viaarxiv icon

ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation

Add code
Bookmark button
Alert button
Mar 02, 2024
Moran Yanuka, Morris Alper, Hadar Averbuch-Elor, Raja Giryes

Figure 1 for ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation
Figure 2 for ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation
Figure 3 for ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation
Figure 4 for ICC: Quantifying Image Caption Concreteness for Multimodal Dataset Curation
Viaarxiv icon

Order-One Rolling Shutter Cameras

Mar 17, 2024
Marvin Anas Hahn, Kathlén Kohn, Orlando Marigliano, Tomas Pajdla

Figure 1 for Order-One Rolling Shutter Cameras
Figure 2 for Order-One Rolling Shutter Cameras
Figure 3 for Order-One Rolling Shutter Cameras
Figure 4 for Order-One Rolling Shutter Cameras
Viaarxiv icon

LightIt: Illumination Modeling and Control for Diffusion Models

Add code
Bookmark button
Alert button
Mar 15, 2024
Peter Kocsis, Julien Philip, Kalyan Sunkavalli, Matthias Nießner, Yannick Hold-Geoffroy

Figure 1 for LightIt: Illumination Modeling and Control for Diffusion Models
Figure 2 for LightIt: Illumination Modeling and Control for Diffusion Models
Figure 3 for LightIt: Illumination Modeling and Control for Diffusion Models
Figure 4 for LightIt: Illumination Modeling and Control for Diffusion Models
Viaarxiv icon

Learning to Project for Cross-Task Knowledge Distillation

Mar 21, 2024
Dylan Auty, Roy Miles, Benedikt Kolbeinsson, Krystian Mikolajczyk

Figure 1 for Learning to Project for Cross-Task Knowledge Distillation
Figure 2 for Learning to Project for Cross-Task Knowledge Distillation
Figure 3 for Learning to Project for Cross-Task Knowledge Distillation
Figure 4 for Learning to Project for Cross-Task Knowledge Distillation
Viaarxiv icon

On the Concept Trustworthiness in Concept Bottleneck Models

Add code
Bookmark button
Alert button
Mar 21, 2024
Qihan Huang, Jie Song, Jingwen Hu, Haofei Zhang, Yong Wang, Mingli Song

Figure 1 for On the Concept Trustworthiness in Concept Bottleneck Models
Figure 2 for On the Concept Trustworthiness in Concept Bottleneck Models
Figure 3 for On the Concept Trustworthiness in Concept Bottleneck Models
Figure 4 for On the Concept Trustworthiness in Concept Bottleneck Models
Viaarxiv icon

Lightator: An Optical Near-Sensor Accelerator with Compressive Acquisition Enabling Versatile Image Processing

Mar 08, 2024
Mehrdad Morsali, Brendan Reidy, Deniz Najafi, Sepehr Tabrizchi, Mohsen Imani, Mahdi Nikdast, Arman Roohi, Ramtin Zand, Shaahin Angizi

Figure 1 for Lightator: An Optical Near-Sensor Accelerator with Compressive Acquisition Enabling Versatile Image Processing
Figure 2 for Lightator: An Optical Near-Sensor Accelerator with Compressive Acquisition Enabling Versatile Image Processing
Figure 3 for Lightator: An Optical Near-Sensor Accelerator with Compressive Acquisition Enabling Versatile Image Processing
Figure 4 for Lightator: An Optical Near-Sensor Accelerator with Compressive Acquisition Enabling Versatile Image Processing
Viaarxiv icon

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Add code
Bookmark button
Alert button
Mar 05, 2024
Weizhi Wang, Khalil Mrini, Linjie Yang, Sateesh Kumar, Yu Tian, Xifeng Yan, Heng Wang

Figure 1 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 2 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 3 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 4 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Viaarxiv icon

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Add code
Bookmark button
Alert button
Mar 18, 2024
Ruyi Xu, Yuan Yao, Zonghao Guo, Junbo Cui, Zanlin Ni, Chunjiang Ge, Tat-Seng Chua, Zhiyuan Liu, Maosong Sun, Gao Huang

Figure 1 for LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Figure 2 for LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Figure 3 for LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Figure 4 for LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Viaarxiv icon

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

Add code
Bookmark button
Alert button
Mar 18, 2024
Zixin Zhu, Xuelu Feng, Dongdong Chen, Junsong Yuan, Chunming Qiao, Gang Hua

Figure 1 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 2 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 3 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 4 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Viaarxiv icon