Alert button

"Image": models, code, and papers
Alert button

Cameras as Rays: Pose Estimation via Ray Diffusion

Feb 22, 2024
Jason Y. Zhang, Amy Lin, Moneish Kumar, Tzu-Hsuan Yang, Deva Ramanan, Shubham Tulsiani

Viaarxiv icon

ToDo: Token Downsampling for Efficient Generation of High-Resolution Images

Feb 21, 2024
Ethan Smith, Nayan Saxena, Aninda Saha

Viaarxiv icon

Scaling Supervised Local Learning with Augmented Auxiliary Networks

Add code
Bookmark button
Alert button
Feb 27, 2024
Chenxiang Ma, Jibin Wu, Chenyang Si, Kay Chen Tan

Viaarxiv icon

An Interpretable Evaluation of Entropy-based Novelty of Generative Models

Feb 27, 2024
Jingwei Zhang, Cheuk Ting Li, Farzan Farnia

Viaarxiv icon

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation

Add code
Bookmark button
Alert button
Feb 27, 2024
Hanan Gani, Muzammal Naseer, Fahad Khan, Salman Khan

Viaarxiv icon

FuseFormer: A Transformer for Visual and Thermal Image Fusion

Feb 01, 2024
Aytekin Erdogan, Erdem Akagunduz

Viaarxiv icon

PAC-FNO: Parallel-Structured All-Component Fourier Neural Operators for Recognizing Low-Quality Images

Feb 20, 2024
Jinsung Jeon, Hyundong Jin, Jonghyun Choi, Sanghyun Hong, Dongeun Lee, Kookjin Lee, Noseong Park

Viaarxiv icon

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

Add code
Bookmark button
Alert button
Feb 26, 2024
Zhuo Chen, Yichi Zhang, Yin Fang, Yuxia Geng, Lingbing Guo, Xiang Chen, Qian Li, Wen Zhang, Jiaoyan Chen, Yushan Zhu, Jiaqi Li, Xiaoze Liu, Jeff Z. Pan, Ningyu Zhang, Huajun Chen

Viaarxiv icon

Mysterious Projections: Multimodal LLMs Gain Domain-Specific Visual Capabilities Without Richer Cross-Modal Projections

Add code
Bookmark button
Alert button
Feb 26, 2024
Gaurav Verma, Minje Choi, Kartik Sharma, Jamelle Watson-Daniels, Sejoon Oh, Srijan Kumar

Viaarxiv icon

Cross-Modal Contextualized Diffusion Models for Text-Guided Visual Generation and Editing

Add code
Bookmark button
Alert button
Feb 26, 2024
Ling Yang, Zhilong Zhang, Zhaochen Yu, Jingwei Liu, Minkai Xu, Stefano Ermon, Bin Cui

Viaarxiv icon