Alert button

"Image": models, code, and papers
Alert button

Inferring the Future by Imagining the Past

May 26, 2023
Kartik Chandra, Tony Chen, Tzu-Mao Li, Jonathan Ragan-Kelley, Josh Tenenbaum

Figure 1 for Inferring the Future by Imagining the Past
Figure 2 for Inferring the Future by Imagining the Past
Figure 3 for Inferring the Future by Imagining the Past
Figure 4 for Inferring the Future by Imagining the Past
Viaarxiv icon

ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing

May 26, 2023
Min Zhao, Rongzhen Wang, Fan Bao, Chongxuan Li, Jun Zhu

Figure 1 for ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing
Figure 2 for ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing
Figure 3 for ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing
Figure 4 for ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing
Viaarxiv icon

Manifold Regularization for Memory-Efficient Training of Deep Neural Networks

May 26, 2023
Shadi Sartipi, Edgar A. Bernal

Figure 1 for Manifold Regularization for Memory-Efficient Training of Deep Neural Networks
Figure 2 for Manifold Regularization for Memory-Efficient Training of Deep Neural Networks
Figure 3 for Manifold Regularization for Memory-Efficient Training of Deep Neural Networks
Figure 4 for Manifold Regularization for Memory-Efficient Training of Deep Neural Networks
Viaarxiv icon

BIG-C: a Multimodal Multi-Purpose Dataset for Bemba

May 26, 2023
Claytone Sikasote, Eunice Mukonde, Md Mahfuz Ibn Alam, Antonios Anastasopoulos

Figure 1 for BIG-C: a Multimodal Multi-Purpose Dataset for Bemba
Figure 2 for BIG-C: a Multimodal Multi-Purpose Dataset for Bemba
Figure 3 for BIG-C: a Multimodal Multi-Purpose Dataset for Bemba
Figure 4 for BIG-C: a Multimodal Multi-Purpose Dataset for Bemba
Viaarxiv icon

Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis

Feb 17, 2023
Haoran Sun, Yang Wang, Haipeng Liu, Biao Qian, Meng Wang

Figure 1 for Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis
Figure 2 for Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis
Figure 3 for Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis
Figure 4 for Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis
Viaarxiv icon

Attacking Perceptual Similarity Metrics

May 15, 2023
Abhijay Ghildyal, Feng Liu

Figure 1 for Attacking Perceptual Similarity Metrics
Figure 2 for Attacking Perceptual Similarity Metrics
Figure 3 for Attacking Perceptual Similarity Metrics
Figure 4 for Attacking Perceptual Similarity Metrics
Viaarxiv icon

Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models

May 15, 2023
Zhimin Chen, Bing Li

Figure 1 for Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models
Figure 2 for Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models
Figure 3 for Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models
Figure 4 for Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models
Viaarxiv icon

CMSG Cross-Media Semantic-Graph Feature Matching Algorithm for Autonomous Vehicle Relocalization

May 15, 2023
Shuhang Tan, Hengyu Liu, Zhiling Wang

Figure 1 for CMSG Cross-Media Semantic-Graph Feature Matching Algorithm for Autonomous Vehicle Relocalization
Figure 2 for CMSG Cross-Media Semantic-Graph Feature Matching Algorithm for Autonomous Vehicle Relocalization
Figure 3 for CMSG Cross-Media Semantic-Graph Feature Matching Algorithm for Autonomous Vehicle Relocalization
Figure 4 for CMSG Cross-Media Semantic-Graph Feature Matching Algorithm for Autonomous Vehicle Relocalization
Viaarxiv icon

RADIFUSION: A multi-radiomics deep learning based breast cancer risk prediction model using sequential mammographic images with image attention and bilateral asymmetry refinement

Apr 01, 2023
Hong Hui Yeoh, Andrea Liew, Raphaël Phan, Fredrik Strand, Kartini Rahmat, Tuong Linh Nguyen, John L. Hopper, Maxine Tan

Figure 1 for RADIFUSION: A multi-radiomics deep learning based breast cancer risk prediction model using sequential mammographic images with image attention and bilateral asymmetry refinement
Figure 2 for RADIFUSION: A multi-radiomics deep learning based breast cancer risk prediction model using sequential mammographic images with image attention and bilateral asymmetry refinement
Figure 3 for RADIFUSION: A multi-radiomics deep learning based breast cancer risk prediction model using sequential mammographic images with image attention and bilateral asymmetry refinement
Figure 4 for RADIFUSION: A multi-radiomics deep learning based breast cancer risk prediction model using sequential mammographic images with image attention and bilateral asymmetry refinement
Viaarxiv icon

Concept Decomposition for Visual Exploration and Inspiration

May 31, 2023
Yael Vinker, Andrey Voynov, Daniel Cohen-Or, Ariel Shamir

Figure 1 for Concept Decomposition for Visual Exploration and Inspiration
Figure 2 for Concept Decomposition for Visual Exploration and Inspiration
Figure 3 for Concept Decomposition for Visual Exploration and Inspiration
Figure 4 for Concept Decomposition for Visual Exploration and Inspiration
Viaarxiv icon