Alert button

"Image": models, code, and papers
Alert button

DataComp: In search of the next generation of multimodal datasets

Add code
Bookmark button
Alert button
May 03, 2023
Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt

Figure 1 for DataComp: In search of the next generation of multimodal datasets
Figure 2 for DataComp: In search of the next generation of multimodal datasets
Figure 3 for DataComp: In search of the next generation of multimodal datasets
Figure 4 for DataComp: In search of the next generation of multimodal datasets
Viaarxiv icon

MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models

Add code
Bookmark button
Alert button
Mar 25, 2023
Jing Zhao, Heliang Zheng, Chaoyue Wang, Long Lan, Wenjing Yang

Figure 1 for MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
Figure 2 for MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
Figure 3 for MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
Figure 4 for MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion Models
Viaarxiv icon

LRRNet: A Novel Representation Learning Guided Fusion Network for Infrared and Visible Images

Add code
Bookmark button
Alert button
Apr 16, 2023
Hui Li, Tianyang Xu, Xiao-Jun Wu, Jiwen Lu, Josef Kittler

Figure 1 for LRRNet: A Novel Representation Learning Guided Fusion Network for Infrared and Visible Images
Figure 2 for LRRNet: A Novel Representation Learning Guided Fusion Network for Infrared and Visible Images
Figure 3 for LRRNet: A Novel Representation Learning Guided Fusion Network for Infrared and Visible Images
Figure 4 for LRRNet: A Novel Representation Learning Guided Fusion Network for Infrared and Visible Images
Viaarxiv icon

DLGSANet: Lightweight Dynamic Local and Global Self-Attention Networks for Image Super-Resolution

Add code
Bookmark button
Alert button
Jan 05, 2023
Xiang Li, Jinshan Pan, Jinhui Tang, Jiangxin Dong

Figure 1 for DLGSANet: Lightweight Dynamic Local and Global Self-Attention Networks for Image Super-Resolution
Figure 2 for DLGSANet: Lightweight Dynamic Local and Global Self-Attention Networks for Image Super-Resolution
Figure 3 for DLGSANet: Lightweight Dynamic Local and Global Self-Attention Networks for Image Super-Resolution
Figure 4 for DLGSANet: Lightweight Dynamic Local and Global Self-Attention Networks for Image Super-Resolution
Viaarxiv icon

Fully Sparse Fusion for 3D Object Detection

Add code
Bookmark button
Alert button
Apr 25, 2023
Yingyan Li, Lue Fan, Yang Liu, Zehao Huang, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang, Tieniu Tan

Figure 1 for Fully Sparse Fusion for 3D Object Detection
Figure 2 for Fully Sparse Fusion for 3D Object Detection
Figure 3 for Fully Sparse Fusion for 3D Object Detection
Figure 4 for Fully Sparse Fusion for 3D Object Detection
Viaarxiv icon

Context-Aware Classification of Legal Document Pages

Apr 25, 2023
Pavlos Fragkogiannis, Martina Forster, Grace E. Lee, Dell Zhang

Figure 1 for Context-Aware Classification of Legal Document Pages
Figure 2 for Context-Aware Classification of Legal Document Pages
Figure 3 for Context-Aware Classification of Legal Document Pages
Figure 4 for Context-Aware Classification of Legal Document Pages
Viaarxiv icon

RobCaps: Evaluating the Robustness of Capsule Networks against Affine Transformations and Adversarial Attacks

Apr 25, 2023
Alberto Marchisio, Antonio De Marco, Alessio Colucci, Maurizio Martina, Muhammad Shafique

Figure 1 for RobCaps: Evaluating the Robustness of Capsule Networks against Affine Transformations and Adversarial Attacks
Figure 2 for RobCaps: Evaluating the Robustness of Capsule Networks against Affine Transformations and Adversarial Attacks
Figure 3 for RobCaps: Evaluating the Robustness of Capsule Networks against Affine Transformations and Adversarial Attacks
Figure 4 for RobCaps: Evaluating the Robustness of Capsule Networks against Affine Transformations and Adversarial Attacks
Viaarxiv icon

iMixer: hierarchical Hopfield network implies an invertible, implicit and iterative MLP-Mixer

Add code
Bookmark button
Alert button
Apr 25, 2023
Toshihiro Ota, Masato Taki

Figure 1 for iMixer: hierarchical Hopfield network implies an invertible, implicit and iterative MLP-Mixer
Figure 2 for iMixer: hierarchical Hopfield network implies an invertible, implicit and iterative MLP-Mixer
Figure 3 for iMixer: hierarchical Hopfield network implies an invertible, implicit and iterative MLP-Mixer
Figure 4 for iMixer: hierarchical Hopfield network implies an invertible, implicit and iterative MLP-Mixer
Viaarxiv icon

On the Use of Singular Value Decomposition as a Clutter Filter for Ultrasound Flow Imaging

Apr 25, 2023
Kai Riemer, Marcelo Lerendegui, Matthieu Toulemonde, Jiaqi Zhu, Christopher Dunsby, Peter D. Weinberg, Meng-Xing Tang

Figure 1 for On the Use of Singular Value Decomposition as a Clutter Filter for Ultrasound Flow Imaging
Figure 2 for On the Use of Singular Value Decomposition as a Clutter Filter for Ultrasound Flow Imaging
Figure 3 for On the Use of Singular Value Decomposition as a Clutter Filter for Ultrasound Flow Imaging
Figure 4 for On the Use of Singular Value Decomposition as a Clutter Filter for Ultrasound Flow Imaging
Viaarxiv icon

Text-guided Eyeglasses Manipulation with Spatial Constraints

Add code
Bookmark button
Alert button
Apr 25, 2023
Jiacheng Wang, Ping Liu, Jingen Liu, Wei Xu

Figure 1 for Text-guided Eyeglasses Manipulation with Spatial Constraints
Figure 2 for Text-guided Eyeglasses Manipulation with Spatial Constraints
Figure 3 for Text-guided Eyeglasses Manipulation with Spatial Constraints
Figure 4 for Text-guided Eyeglasses Manipulation with Spatial Constraints
Viaarxiv icon