Alert button

"Image": models, code, and papers
Alert button

Image-Caption Encoding for Improving Zero-Shot Generalization

Add code
Bookmark button
Alert button
Feb 05, 2024
Eric Yang Yu, Christopher Liao, Sathvik Ravi, Theodoros Tsiligkaridis, Brian Kulis

Viaarxiv icon

ManiFPT: Defining and Analyzing Fingerprints of Generative Models

Feb 29, 2024
Hae Jin Song, Mahyar Khayatkhoei, Wael AbdAlmageed

Viaarxiv icon

DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization

Add code
Bookmark button
Alert button
Feb 15, 2024
Jisu Nam, Heesu Kim, DongJae Lee, Siyoon Jin, Seungryong Kim, Seunggyu Chang

Viaarxiv icon

GEA: Reconstructing Expressive 3D Gaussian Avatar from Monocular Video

Add code
Bookmark button
Alert button
Feb 26, 2024
Xinqi Liu, Chenming Wu, Xing Liu, Jialun Liu, Jinbo Wu, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang

Viaarxiv icon

A Multispectral Automated Transfer Technique (MATT) for machine-driven image labeling utilizing the Segment Anything Model (SAM)

Feb 18, 2024
James E. Gallagher, Aryav Gogia, Edward J. Oughton

Viaarxiv icon

Understanding Likelihood of Normalizing Flow and Image Complexity through the Lens of Out-of-Distribution Detection

Feb 16, 2024
Genki Osada, Tsubasa Takahashi, Takashi Nishide

Viaarxiv icon

Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation Strategies

Add code
Bookmark button
Alert button
Feb 17, 2024
Yifei Chen, Chenyan Zhang, Yifan Ke, Yiyu Huang, Xuezhou Dai, Feiwei Qin, Yongquan Zhang, Xiaodong Zhang, Changmiao Wang

Viaarxiv icon

Compressed image quality assessment using stacking

Feb 01, 2024
S. Farhad Hosseini-Benvidi, Hossein Motamednia, Azadeh Mansouri, Mohammadreza Raei, Ahmad Mahmoudi-Aznaveh

Viaarxiv icon

TempCompass: Do Video LLMs Really Understand Videos?

Add code
Bookmark button
Alert button
Mar 01, 2024
Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Figure 1 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 2 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 3 for TempCompass: Do Video LLMs Really Understand Videos?
Figure 4 for TempCompass: Do Video LLMs Really Understand Videos?
Viaarxiv icon

Compact and De-biased Negative Instance Embedding for Multi-Instance Learning on Whole-Slide Image Classification

Add code
Bookmark button
Alert button
Feb 16, 2024
Joohyung Lee, Heejeong Nam, Kwanhyung Lee, Sangchul Hahn

Viaarxiv icon