Alert button

"Image": models, code, and papers
Alert button

Generating Human-Centric Visual Cues for Human-Object Interaction Detection via Large Vision-Language Models

Nov 26, 2023
Yu-Wei Zhan, Fan Liu, Xin Luo, Liqiang Nie, Xin-Shun Xu, Mohan Kankanhalli

Viaarxiv icon

OCT2Confocal: 3D CycleGAN based Translation of Retinal OCT Images to Confocal Microscopy

Nov 26, 2023
Xin Tian, Nantheera Anantrasirichai, Lindsay Nicholson, Alin Achim

Viaarxiv icon

Domain Aligned CLIP for Few-shot Classification

Add code
Bookmark button
Alert button
Nov 15, 2023
Muhammad Waleed Gondal, Jochen Gast, Inigo Alonso Ruiz, Richard Droste, Tommaso Macri, Suren Kumar, Luitpold Staudigl

Viaarxiv icon

Learning in Deep Factor Graphs with Gaussian Belief Propagation

Nov 24, 2023
Seth Nabarro, Mark van der Wilk, Andrew J Davison

Viaarxiv icon

Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images

Nov 27, 2023
Aiyu Cui, Jay Mahajan, Viraj Shah, Preeti Gomathinayagam, Svetlana Lazebnik

Figure 1 for Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images
Figure 2 for Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images
Figure 3 for Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images
Figure 4 for Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images
Viaarxiv icon

Unleashing the Power of Prompt-driven Nucleus Instance Segmentation

Add code
Bookmark button
Alert button
Nov 27, 2023
Zhongyi Shui, Yunlong Zhang, Kai Yao, Chenglu Zhu, Yuxuan Sun, Lin Yang

Viaarxiv icon

One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls

Add code
Bookmark button
Alert button
Nov 27, 2023
Minghui Hu, Jianbin Zheng, Chuanxia Zheng, Chaoyue Wang, Dacheng Tao, Tat-Jen Cham

Figure 1 for One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
Figure 2 for One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
Figure 3 for One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
Figure 4 for One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
Viaarxiv icon

DAS: A Deformable Attention to Capture Salient Information in CNNs

Nov 20, 2023
Farzad Salajegheh, Nader Asadi, Soroush Saryazdi, Sudhir Mudur

Viaarxiv icon

AdvGen: Physical Adversarial Attack on Face Presentation Attack Detection Systems

Nov 20, 2023
Sai Amrit Patnaik, Shivali Chansoriya, Anil K. Jain, Anoop M. Namboodiri

Viaarxiv icon

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark

Add code
Bookmark button
Alert button
Nov 28, 2023
Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao

Figure 1 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 2 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 3 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 4 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Viaarxiv icon