Alert button

"Image": models, code, and papers
Alert button

Release of Pre-Trained Models for the Japanese Language

Apr 02, 2024
Kei Sawada, Tianyu Zhao, Makoto Shing, Kentaro Mitsui, Akio Kaga, Yukiya Hono, Toshiaki Wakatsuki, Koh Mitsuda

Viaarxiv icon

Fashion Style Editing with Generative Human Prior

Apr 02, 2024
Chaerin Kong, Seungyong Lee, Soohyeok Im, Wonsuk Yang

Viaarxiv icon

Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation

Add code
Bookmark button
Alert button
Apr 02, 2024
Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Junjie Hu, Ming Jiang, Shuqiang Jiang

Viaarxiv icon

A Comprehensive Survey on AI-based Methods for Patents

Apr 02, 2024
Homaira Huda Shomee, Zhu Wang, Sathya N. Ravi, Sourav Medya

Viaarxiv icon

VisionCLIP: An Med-AIGC based Ethical Language-Image Foundation Model for Generalizable Retina Image Analysis

Mar 16, 2024
Hao Wei, Bowen Liu, Minqing Zhang, Peilun Shi, Wu Yuan

Figure 1 for VisionCLIP: An Med-AIGC based Ethical Language-Image Foundation Model for Generalizable Retina Image Analysis
Figure 2 for VisionCLIP: An Med-AIGC based Ethical Language-Image Foundation Model for Generalizable Retina Image Analysis
Viaarxiv icon

Segment Anything Model for Road Network Graph Extraction

Add code
Bookmark button
Alert button
Mar 31, 2024
Congrui Hetang, Haoru Xue, Cindy Le, Tianwei Yue, Wenping Wang, Yihui He

Viaarxiv icon

Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation

Add code
Bookmark button
Alert button
Mar 16, 2024
Soumyajyoti Dey, Sukanta Chakraborty, Utso Guha Roy, Nibaran Das

Figure 1 for Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation
Figure 2 for Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation
Figure 3 for Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation
Figure 4 for Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation
Viaarxiv icon

LocCa: Visual Pretraining with Location-aware Captioners

Add code
Bookmark button
Alert button
Mar 28, 2024
Bo Wan, Michael Tschannen, Yongqin Xian, Filip Pavetic, Ibrahim Alabdulmohsin, Xiao Wang, André Susano Pinto, Andreas Steiner, Lucas Beyer, Xiaohua Zhai

Figure 1 for LocCa: Visual Pretraining with Location-aware Captioners
Figure 2 for LocCa: Visual Pretraining with Location-aware Captioners
Figure 3 for LocCa: Visual Pretraining with Location-aware Captioners
Figure 4 for LocCa: Visual Pretraining with Location-aware Captioners
Viaarxiv icon

Bidirectional Consistency Models

Add code
Bookmark button
Alert button
Mar 30, 2024
Liangchen Li, Jiajun He

Viaarxiv icon

Towards 3D Vision with Low-Cost Single-Photon Cameras

Add code
Bookmark button
Alert button
Mar 29, 2024
Fangzhou Mu, Carter Sifferman, Sacha Jungerman, Yiquan Li, Mark Han, Michael Gleicher, Mohit Gupta, Yin Li

Viaarxiv icon