Alert button

"Image": models, code, and papers
Alert button

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

Add code
Bookmark button
Alert button
Mar 18, 2024
Zixin Zhu, Xuelu Feng, Dongdong Chen, Junsong Yuan, Chunming Qiao, Gang Hua

Figure 1 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 2 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 3 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 4 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Viaarxiv icon

HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances

Add code
Bookmark button
Alert button
Mar 04, 2024
Supreeth Narasimhaswamy, Uttaran Bhattacharya, Xiang Chen, Ishita Dasgupta, Saayan Mitra, Minh Hoai

Figure 1 for HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Figure 2 for HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Figure 3 for HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Figure 4 for HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Viaarxiv icon

General Purpose Image Encoder DINOv2 for Medical Image Registration

Feb 24, 2024
Xinrui Song, Xuanang Xu, Pingkun Yan

Viaarxiv icon

Randomized Principal Component Analysis for Hyperspectral Image Classification

Mar 14, 2024
Mustafa Ustuner

Figure 1 for Randomized Principal Component Analysis for Hyperspectral Image Classification
Figure 2 for Randomized Principal Component Analysis for Hyperspectral Image Classification
Figure 3 for Randomized Principal Component Analysis for Hyperspectral Image Classification
Figure 4 for Randomized Principal Component Analysis for Hyperspectral Image Classification
Viaarxiv icon

ChartReformer: Natural Language-Driven Chart Image Editing

Add code
Bookmark button
Alert button
Mar 01, 2024
Pengyu Yan, Mahesh Bhosale, Jay Lal, Bikhyat Adhikari, David Doermann

Figure 1 for ChartReformer: Natural Language-Driven Chart Image Editing
Figure 2 for ChartReformer: Natural Language-Driven Chart Image Editing
Figure 3 for ChartReformer: Natural Language-Driven Chart Image Editing
Figure 4 for ChartReformer: Natural Language-Driven Chart Image Editing
Viaarxiv icon

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Add code
Bookmark button
Alert button
Mar 14, 2024
Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang

Figure 1 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 2 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 3 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 4 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Viaarxiv icon

Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning

Add code
Bookmark button
Alert button
Mar 05, 2024
Zhaoxin Fan, Runmin Jiang, Junhao Wu, Xin Huang, Tianyang Wang, Heng Huang, Min Xu

Figure 1 for Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning
Figure 2 for Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning
Figure 3 for Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning
Figure 4 for Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning
Viaarxiv icon

Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders

Mar 05, 2024
Daniele Mari, Simone Milani

Figure 1 for Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders
Figure 2 for Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders
Figure 3 for Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders
Viaarxiv icon

ConGeo: Robust Cross-view Geo-localization across Ground View Variations

Add code
Bookmark button
Alert button
Mar 20, 2024
Li Mi, Chang Xu, Javiera Castillo-Navarro, Syrielle Montariol, Wen Yang, Antoine Bosselut, Devis Tuia

Figure 1 for ConGeo: Robust Cross-view Geo-localization across Ground View Variations
Figure 2 for ConGeo: Robust Cross-view Geo-localization across Ground View Variations
Figure 3 for ConGeo: Robust Cross-view Geo-localization across Ground View Variations
Figure 4 for ConGeo: Robust Cross-view Geo-localization across Ground View Variations
Viaarxiv icon

SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning

Add code
Bookmark button
Alert button
Mar 20, 2024
Hongjun Wang, Sagar Vaze, Kai Han

Figure 1 for SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning
Figure 2 for SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning
Figure 3 for SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning
Figure 4 for SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning
Viaarxiv icon