Alert button

"Image": models, code, and papers
Alert button

Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification

Jan 02, 2024
Xuelin Zhu, Jian Liu, Dongqi Tang, Jiawei Ge, Weijia Liu, Bo Liu, Jiuxin Cao

Viaarxiv icon

Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation

Add code
Bookmark button
Alert button
Jan 02, 2024
Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li

Viaarxiv icon

Q-Seg: Quantum Annealing-based Unsupervised Image Segmentation

Nov 30, 2023
Supreeth Mysore Venkatesh, Antonio Macaluso, Marlon Nuske, Matthias Klusch, Andreas Dengel

Viaarxiv icon

MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training

Nov 28, 2023
Pavan Kumar Anasosalu Vasu, Hadi Pouransari, Fartash Faghri, Raviteja Vemulapalli, Oncel Tuzel

Viaarxiv icon

DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition

Jan 01, 2024
Parul Gupta, Tuan Nguyen, Abhinav Dhall, Munawar Hayat, Trung Le, Thanh-Toan Do

Viaarxiv icon

Geometry Depth Consistency in RGBD Relative Pose Estimation

Jan 01, 2024
Sourav Kumar, Chiang-Heng Chien, Benjamin Kimia

Viaarxiv icon

C3: High-performance and low-complexity neural compression from a single image or video

Dec 05, 2023
Hyunjik Kim, Matthias Bauer, Lucas Theis, Jonathan Richard Schwarz, Emilien Dupont

Figure 1 for C3: High-performance and low-complexity neural compression from a single image or video
Figure 2 for C3: High-performance and low-complexity neural compression from a single image or video
Figure 3 for C3: High-performance and low-complexity neural compression from a single image or video
Figure 4 for C3: High-performance and low-complexity neural compression from a single image or video
Viaarxiv icon

Neural Born Series Operator for Biomedical Ultrasound Computed Tomography

Dec 25, 2023
Zhijun Zeng, Yihang Zheng, Youjia Zheng, Yubing Li, Zuoqiang Shi, He Sun

Viaarxiv icon

HyperDID: Hyperspectral Intrinsic Image Decomposition with Deep Feature Embedding

Add code
Bookmark button
Alert button
Nov 25, 2023
Zhiqiang Gong, Xian Zhou, Wen Yao, Xiaohu Zheng, Ping Zhong

Viaarxiv icon

UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs

Nov 29, 2023
Yanwu Xu, Yang Zhao, Zhisheng Xiao, Tingbo Hou

Figure 1 for UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Figure 2 for UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Figure 3 for UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Figure 4 for UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Viaarxiv icon