Alert button

"Image": models, code, and papers
Alert button

Multimodal Learning with Channel-Mixing and Masked Autoencoder on Facial Action Unit Detection

Sep 25, 2022
Xiang Zhang, Huiyuan Yang, Taoyue Wang, Xiaotian Li, Lijun Yin

Figure 1 for Multimodal Learning with Channel-Mixing and Masked Autoencoder on Facial Action Unit Detection
Figure 2 for Multimodal Learning with Channel-Mixing and Masked Autoencoder on Facial Action Unit Detection
Figure 3 for Multimodal Learning with Channel-Mixing and Masked Autoencoder on Facial Action Unit Detection
Figure 4 for Multimodal Learning with Channel-Mixing and Masked Autoencoder on Facial Action Unit Detection
Viaarxiv icon

StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation

Add code
Bookmark button
Alert button
Dec 15, 2021
Umut Kocasari, Alara Dirik, Mert Tiftikci, Pinar Yanardag

Figure 1 for StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation
Figure 2 for StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation
Figure 3 for StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation
Figure 4 for StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation
Viaarxiv icon

Approaching the Limit of Image Rescaling via Flow Guidance

Nov 09, 2021
Shang Li, Guixuan Zhang, Zhengxiong Luo, Jie Liu, Zhi Zeng, Shuwu Zhang

Figure 1 for Approaching the Limit of Image Rescaling via Flow Guidance
Figure 2 for Approaching the Limit of Image Rescaling via Flow Guidance
Figure 3 for Approaching the Limit of Image Rescaling via Flow Guidance
Figure 4 for Approaching the Limit of Image Rescaling via Flow Guidance
Viaarxiv icon

Data Poisoning Attacks Against Multimodal Encoders

Add code
Bookmark button
Alert button
Sep 30, 2022
Ziqing Yang, Xinlei He, Zheng Li, Michael Backes, Mathias Humbert, Pascal Berrang, Yang Zhang

Figure 1 for Data Poisoning Attacks Against Multimodal Encoders
Figure 2 for Data Poisoning Attacks Against Multimodal Encoders
Figure 3 for Data Poisoning Attacks Against Multimodal Encoders
Figure 4 for Data Poisoning Attacks Against Multimodal Encoders
Viaarxiv icon

MD-Net: Multi-Detector for Local Feature Extraction

Aug 10, 2022
Emanuele Santellani, Christian Sormann, Mattia Rossi, Andreas Kuhn, Friedrich Fraundorfer

Figure 1 for MD-Net: Multi-Detector for Local Feature Extraction
Figure 2 for MD-Net: Multi-Detector for Local Feature Extraction
Figure 3 for MD-Net: Multi-Detector for Local Feature Extraction
Figure 4 for MD-Net: Multi-Detector for Local Feature Extraction
Viaarxiv icon

RGB-Event Fusion for Moving Object Detection in Autonomous Driving

Add code
Bookmark button
Alert button
Sep 17, 2022
Zhuyun Zhou, Zongwei Wu, Rémi Boutteau, Fan Yang, Cédric Demonceaux, Dominique Ginhac

Figure 1 for RGB-Event Fusion for Moving Object Detection in Autonomous Driving
Figure 2 for RGB-Event Fusion for Moving Object Detection in Autonomous Driving
Figure 3 for RGB-Event Fusion for Moving Object Detection in Autonomous Driving
Figure 4 for RGB-Event Fusion for Moving Object Detection in Autonomous Driving
Viaarxiv icon

Deep AUC Maximization for Medical Image Classification: Challenges and Opportunities

Add code
Bookmark button
Alert button
Nov 01, 2021
Tianbao Yang

Figure 1 for Deep AUC Maximization for Medical Image Classification: Challenges and Opportunities
Viaarxiv icon

CRAFT: Camera-Radar 3D Object Detection with Spatio-Contextual Fusion Transformer

Sep 14, 2022
Youngseok Kim, Sanmin Kim, Jun Won Choi, Dongsuk Kum

Figure 1 for CRAFT: Camera-Radar 3D Object Detection with Spatio-Contextual Fusion Transformer
Figure 2 for CRAFT: Camera-Radar 3D Object Detection with Spatio-Contextual Fusion Transformer
Figure 3 for CRAFT: Camera-Radar 3D Object Detection with Spatio-Contextual Fusion Transformer
Figure 4 for CRAFT: Camera-Radar 3D Object Detection with Spatio-Contextual Fusion Transformer
Viaarxiv icon

Single Image Object Counting and Localizing using Active-Learning

Nov 16, 2021
Inbar Huberman-Spiegelglas, Raanan Fattal

Figure 1 for Single Image Object Counting and Localizing using Active-Learning
Figure 2 for Single Image Object Counting and Localizing using Active-Learning
Figure 3 for Single Image Object Counting and Localizing using Active-Learning
Figure 4 for Single Image Object Counting and Localizing using Active-Learning
Viaarxiv icon

What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs

Add code
Bookmark button
Alert button
Jun 27, 2022
Tal Shaharabany, Yoad Tewel, Lior Wolf

Figure 1 for What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
Figure 2 for What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
Figure 3 for What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
Figure 4 for What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
Viaarxiv icon