Alert button

"photo": models, code, and papers
Alert button

Towards Realistic Scene Generation with LiDAR Diffusion Models

Add code
Bookmark button
Alert button
Mar 31, 2024
Haoxi Ran, Vitor Guizilini, Yue Wang

Viaarxiv icon

3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos

Add code
Bookmark button
Alert button
Mar 05, 2024
Jiakai Sun, Han Jiao, Guangyuan Li, Zhanjie Zhang, Lei Zhao, Wei Xing

Figure 1 for 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos
Figure 2 for 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos
Figure 3 for 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos
Figure 4 for 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos
Viaarxiv icon

Disentangling Racial Phenotypes: Fine-Grained Control of Race-related Facial Phenotype Characteristics

Mar 29, 2024
Seyma Yucer, Amir Atapour Abarghouei, Noura Al Moubayed, Toby P. Breckon

Figure 1 for Disentangling Racial Phenotypes: Fine-Grained Control of Race-related Facial Phenotype Characteristics
Figure 2 for Disentangling Racial Phenotypes: Fine-Grained Control of Race-related Facial Phenotype Characteristics
Figure 3 for Disentangling Racial Phenotypes: Fine-Grained Control of Race-related Facial Phenotype Characteristics
Figure 4 for Disentangling Racial Phenotypes: Fine-Grained Control of Race-related Facial Phenotype Characteristics
Viaarxiv icon

CosmicMan: A Text-to-Image Foundation Model for Humans

Add code
Bookmark button
Alert button
Apr 01, 2024
Shikai Li, Jianglin Fu, Kaiyuan Liu, Wentao Wang, Kwan-Yee Lin, Wayne Wu

Viaarxiv icon

Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos

Add code
Bookmark button
Alert button
Mar 19, 2024
Hadi Alzayer, Zhihao Xia, Xuaner Zhang, Eli Shechtman, Jia-Bin Huang, Michael Gharbi

Figure 1 for Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Figure 2 for Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Figure 3 for Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Figure 4 for Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Viaarxiv icon

Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model

Add code
Bookmark button
Alert button
Mar 28, 2024
Zhicai Wang, Longhui Wei, Tan Wang, Heyu Chen, Yanbin Hao, Xiang Wang, Xiangnan He, Qi Tian

Figure 1 for Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
Figure 2 for Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
Figure 3 for Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
Figure 4 for Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
Viaarxiv icon

Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers

Add code
Bookmark button
Alert button
Mar 20, 2024
Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

Figure 1 for Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers
Figure 2 for Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers
Figure 3 for Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers
Figure 4 for Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers
Viaarxiv icon

DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images

Mar 28, 2024
Zaid Tasneem, Akshat Dave, Abhishek Singh, Kushagra Tiwary, Praneeth Vepakomma, Ashok Veeraraghavan, Ramesh Raskar

Figure 1 for DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images
Figure 2 for DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images
Figure 3 for DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images
Figure 4 for DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images
Viaarxiv icon

IVLMap: Instance-Aware Visual Language Grounding for Consumer Robot Navigation

Add code
Bookmark button
Alert button
Mar 28, 2024
Jiacui Huang, Hongtao Zhang, Mingbo Zhao, Zhou Wu

Viaarxiv icon

If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions

Add code
Bookmark button
Alert button
Mar 25, 2024
Reza Esfandiarpoor, Cristina Menghini, Stephen H. Bach

Viaarxiv icon