Alert button

"Image": models, code, and papers
Alert button

Unsupervised Modality-Transferable Video Highlight Detection with Representation Activation Sequence Learning

Mar 18, 2024
Tingtian Li, Zixun Sun, Xinyu Xiao

Figure 1 for Unsupervised Modality-Transferable Video Highlight Detection with Representation Activation Sequence Learning
Figure 2 for Unsupervised Modality-Transferable Video Highlight Detection with Representation Activation Sequence Learning
Figure 3 for Unsupervised Modality-Transferable Video Highlight Detection with Representation Activation Sequence Learning
Figure 4 for Unsupervised Modality-Transferable Video Highlight Detection with Representation Activation Sequence Learning
Viaarxiv icon

SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration

Mar 18, 2024
Yanfei Song, Bangzheng Pu, Peng Wang, Hongxu Jiang, Dong Dong, Yongxiang Cao, Yiqing Shen

Figure 1 for SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration
Figure 2 for SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration
Figure 3 for SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration
Figure 4 for SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration
Viaarxiv icon

When Semantic Segmentation Meets Frequency Aliasing

Add code
Bookmark button
Alert button
Mar 18, 2024
Linwei Chen, Lin Gu, Ying Fu

Figure 1 for When Semantic Segmentation Meets Frequency Aliasing
Figure 2 for When Semantic Segmentation Meets Frequency Aliasing
Figure 3 for When Semantic Segmentation Meets Frequency Aliasing
Figure 4 for When Semantic Segmentation Meets Frequency Aliasing
Viaarxiv icon

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Add code
Bookmark button
Alert button
Mar 18, 2024
Xiaojie Li, Yibo Yang, Xiangtai Li, Jianlong Wu, Yue Yu, Bernard Ghanem, Min Zhang

Figure 1 for GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
Figure 2 for GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
Figure 3 for GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
Figure 4 for GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
Viaarxiv icon

UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling

Add code
Bookmark button
Alert button
Mar 18, 2024
Yujiao Jiang, Qingmin Liao, Xiaoyu Li, Li Ma, Qi Zhang, Chaopeng Zhang, Zongqing Lu, Ying Shan

Figure 1 for UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling
Figure 2 for UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling
Figure 3 for UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling
Figure 4 for UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling
Viaarxiv icon

Arc2Face: A Foundation Model of Human Faces

Add code
Bookmark button
Alert button
Mar 18, 2024
Foivos Paraperas Papantoniou, Alexandros Lattas, Stylianos Moschoglou, Jiankang Deng, Bernhard Kainz, Stefanos Zafeiriou

Figure 1 for Arc2Face: A Foundation Model of Human Faces
Figure 2 for Arc2Face: A Foundation Model of Human Faces
Figure 3 for Arc2Face: A Foundation Model of Human Faces
Figure 4 for Arc2Face: A Foundation Model of Human Faces
Viaarxiv icon

Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Add code
Bookmark button
Alert button
Mar 18, 2024
Sha Zhang, Di Huang, Jiajun Deng, Shixiang Tang, Wanli Ouyang, Tong He, Yanyong Zhang

Figure 1 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 2 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 3 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 4 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Viaarxiv icon

Polos: Multimodal Metric Learning from Human Feedback for Image Captioning

Feb 28, 2024
Yuiga Wada, Kanta Kaneda, Daichi Saito, Komei Sugiura

Viaarxiv icon

Quantifying the Resolution of a Template after Image Registration

Add code
Bookmark button
Alert button
Feb 27, 2024
Matthias Glock, Thomas Hotz

Viaarxiv icon

Perspective-Equivariant Imaging: an Unsupervised Framework for Multispectral Pansharpening

Add code
Bookmark button
Alert button
Mar 14, 2024
Andrew Wang, Mike Davies

Figure 1 for Perspective-Equivariant Imaging: an Unsupervised Framework for Multispectral Pansharpening
Figure 2 for Perspective-Equivariant Imaging: an Unsupervised Framework for Multispectral Pansharpening
Figure 3 for Perspective-Equivariant Imaging: an Unsupervised Framework for Multispectral Pansharpening
Figure 4 for Perspective-Equivariant Imaging: an Unsupervised Framework for Multispectral Pansharpening
Viaarxiv icon