Alert button

"Image": models, code, and papers
Alert button

One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization

Jun 29, 2023
Minghua Liu, Chao Xu, Haian Jin, Linghao Chen, Mukund Varma T, Zexiang Xu, Hao Su

Figure 1 for One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization
Figure 2 for One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization
Figure 3 for One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization
Figure 4 for One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization
Viaarxiv icon

Improving 3D Pose Estimation for Sign Language

Add code
Bookmark button
Alert button
Aug 18, 2023
Maksym Ivashechkin, Oscar Mendez, Richard Bowden

Figure 1 for Improving 3D Pose Estimation for Sign Language
Figure 2 for Improving 3D Pose Estimation for Sign Language
Figure 3 for Improving 3D Pose Estimation for Sign Language
Figure 4 for Improving 3D Pose Estimation for Sign Language
Viaarxiv icon

Transferring Knowledge for Food Image Segmentation using Transformers and Convolutions

Jun 15, 2023
Grant Sinha, Krish Parmar, Hilda Azimi, Amy Tai, Yuhao Chen, Alexander Wong, Pengcheng Xi

Figure 1 for Transferring Knowledge for Food Image Segmentation using Transformers and Convolutions
Figure 2 for Transferring Knowledge for Food Image Segmentation using Transformers and Convolutions
Figure 3 for Transferring Knowledge for Food Image Segmentation using Transformers and Convolutions
Figure 4 for Transferring Knowledge for Food Image Segmentation using Transformers and Convolutions
Viaarxiv icon

ACE-HetEM for ab initio Heterogenous Cryo-EM 3D Reconstruction

Aug 09, 2023
Weijie Chen, Lin Yao, Zeqing Xia, Yuhang Wang

Viaarxiv icon

Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt

Add code
Bookmark button
Alert button
Jun 28, 2023
Kai Chen, Enze Xie, Zhe Chen, Lanqing Hong, Zhenguo Li, Dit-Yan Yeung

Figure 1 for Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt
Figure 2 for Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt
Figure 3 for Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt
Figure 4 for Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt
Viaarxiv icon

Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP

Add code
Bookmark button
Alert button
Aug 27, 2023
Vedant Palit, Rohan Pandey, Aryaman Arora, Paul Pu Liang

Figure 1 for Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP
Figure 2 for Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP
Figure 3 for Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP
Figure 4 for Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP
Viaarxiv icon

Pruning the Unlabeled Data to Improve Semi-Supervised Learning

Aug 27, 2023
Guy Hacohen, Daphna Weinshall

Viaarxiv icon

Joint learning of images and videos with a single Vision Transformer

Aug 21, 2023
Shuki Shimizu, Toru Tamaki

Viaarxiv icon

Bang and the Artefacts are Gone! Rapid Artefact Removal and Tissue Segmentation in Haematoxylin and Eosin Stained Biopsies

Add code
Bookmark button
Alert button
Aug 25, 2023
B. A. Schreiber, J. Denholm, F. Jaeckle, M. J. Arends, K. M. Branson, C. -B. Schönlieb, E. J. Soilleux

Viaarxiv icon

DISGO: Automatic End-to-End Evaluation for Scene Text OCR

Aug 25, 2023
Mei-Yuh Hwang, Yangyang Shi, Ankit Ramchandani, Guan Pang, Praveen Krishnan, Lucas Kabela, Frank Seide, Samyak Datta, Jun Liu

Figure 1 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 2 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 3 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 4 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Viaarxiv icon