Alert button

"Image": models, code, and papers
Alert button

Segment Any Medical Model Extended

Add code
Bookmark button
Alert button
Mar 26, 2024
Yihao Liu, Jiaming Zhang, Andres Diaz-Pinto, Haowei Li, Alejandro Martin-Gomez, Amir Kheradmand, Mehran Armand

Viaarxiv icon

Versatile Defense Against Adversarial Attacks on Image Recognition

Mar 13, 2024
Haibo Zhang, Zhihua Yao, Kouichi Sakurai

Figure 1 for Versatile Defense Against Adversarial Attacks on Image Recognition
Figure 2 for Versatile Defense Against Adversarial Attacks on Image Recognition
Figure 3 for Versatile Defense Against Adversarial Attacks on Image Recognition
Figure 4 for Versatile Defense Against Adversarial Attacks on Image Recognition
Viaarxiv icon

Modeling uncertainty for Gaussian Splatting

Mar 27, 2024
Luca Savant, Diego Valsesia, Enrico Magli

Viaarxiv icon

Synth$^2$: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings

Mar 12, 2024
Sahand Sharifzadeh, Christos Kaplanis, Shreya Pathak, Dharshan Kumaran, Anastasija Ilic, Jovana Mitrovic, Charles Blundell, Andrea Banino

Figure 1 for Synth$^2$: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings
Figure 2 for Synth$^2$: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings
Figure 3 for Synth$^2$: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings
Figure 4 for Synth$^2$: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings
Viaarxiv icon

Break-for-Make: Modular Low-Rank Adaptations for Composable Content-Style Customization

Mar 31, 2024
Yu Xu, Fan Tang, Juan Cao, Yuxin Zhang, Oliver Deussen, Weiming Dong, Jintao Li, Tong-Yee Lee

Figure 1 for Break-for-Make: Modular Low-Rank Adaptations for Composable Content-Style Customization
Figure 2 for Break-for-Make: Modular Low-Rank Adaptations for Composable Content-Style Customization
Figure 3 for Break-for-Make: Modular Low-Rank Adaptations for Composable Content-Style Customization
Figure 4 for Break-for-Make: Modular Low-Rank Adaptations for Composable Content-Style Customization
Viaarxiv icon

IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models

Mar 30, 2024
Haz Sameen Shahgir, Khondker Salman Sayeed, Abhik Bhattacharjee, Wasi Uddin Ahmad, Yue Dong, Rifat Shahriyar

Viaarxiv icon

Denoising Monte Carlo Renders With Diffusion Models

Mar 30, 2024
Vaibhav Vavilala, Rahul Vasanth, David Forsyth

Viaarxiv icon

T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image

Mar 20, 2024
Shijie Zhang, Boyan Jiang, Keke He, Junwei Zhu, Ying Tai, Chengjie Wang, Yinda Zhang, Yanwei Fu

Figure 1 for T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image
Figure 2 for T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image
Figure 3 for T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image
Figure 4 for T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image
Viaarxiv icon

Low Rank Groupwise Deformations for Motion Tracking in Cardiac Cine MRI

Mar 24, 2024
Sean Rendell, Jinming Duan

Viaarxiv icon

Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling

Mar 15, 2024
Baoquan Zhang, Huaibin Wang, Luo Chuyao, Xutao Li, Liang Guotao, Yunming Ye, Xiaochen Qi, Yao He

Figure 1 for Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Figure 2 for Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Figure 3 for Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Figure 4 for Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Viaarxiv icon