Alert button

"Image": models, code, and papers
Alert button

CLIP-Loc: Multi-modal Landmark Association for Global Localization in Object-based Maps

Feb 08, 2024
Shigemichi Matsuzaki, Takuma Sugino, Kazuhito Tanaka, Zijun Sha, Shintaro Nakaoka, Shintaro Yoshizawa, Kazuhiro Shintani

Viaarxiv icon

Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive

Add code
Bookmark button
Alert button
Jan 16, 2024
Yumeng Li, Margret Keuper, Dan Zhang, Anna Khoreva

Viaarxiv icon

Segment Any Change

Feb 02, 2024
Zhuo Zheng, Yanfei Zhong, Liangpei Zhang, Stefano Ermon

Viaarxiv icon

MixNet: Towards Effective and Efficient UHD Low-Light Image Enhancement

Add code
Bookmark button
Alert button
Jan 19, 2024
Chen Wu, Zhuoran Zheng, Xiuyi Jia, Wenqi Ren

Viaarxiv icon

Embedded Hyperspectral Band Selection with Adaptive Optimization for Image Semantic Segmentation

Jan 21, 2024
Yaniv Zimmer, Oren Glickman

Viaarxiv icon

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

Feb 09, 2024
Siming Yan, Min Bai, Weifeng Chen, Xiong Zhou, Qixing Huang, Li Erran Li

Viaarxiv icon

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

Add code
Bookmark button
Alert button
Feb 09, 2024
Zhuo Chen, Yichi Zhang, Yin Fang, Yuxia Geng, Lingbing Guo, Xiang Chen, Qian Li, Wen Zhang, Jiaoyan Chen, Yushan Zhu, Jiaqi Li, Xiaoze Liu, Jeff Z. Pan, Ningyu Zhang, Huajun Chen

Viaarxiv icon

GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting

Feb 06, 2024
Joanna Waczyńska, Piotr Borycki, Sławomir Tadeja, Jacek Tabor, Przemysław Spurek

Viaarxiv icon

DPAFNet:Dual Path Attention Fusion Network for Single Image Deraining

Jan 16, 2024
Bingcai Wei

Viaarxiv icon

Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers

Add code
Bookmark button
Alert button
Jan 21, 2024
Katherine Crowson, Stefan Andreas Baumann, Alex Birch, Tanishq Mathew Abraham, Daniel Z. Kaplan, Enrico Shippole

Viaarxiv icon