Alert button

"Image": models, code, and papers
Alert button

Dolphins: Multimodal Language Model for Driving

Add code
Bookmark button
Alert button
Dec 01, 2023
Yingzi Ma, Yulong Cao, Jiachen Sun, Marco Pavone, Chaowei Xiao

Figure 1 for Dolphins: Multimodal Language Model for Driving
Figure 2 for Dolphins: Multimodal Language Model for Driving
Figure 3 for Dolphins: Multimodal Language Model for Driving
Figure 4 for Dolphins: Multimodal Language Model for Driving
Viaarxiv icon

Text-Guided 3D Face Synthesis -- From Generation to Editing

Add code
Bookmark button
Alert button
Dec 01, 2023
Yunjie Wu, Yapeng Meng, Zhipeng Hu, Lincheng Li, Haoqian Wu, Kun Zhou, Weiwei Xu, Xin Yu

Figure 1 for Text-Guided 3D Face Synthesis -- From Generation to Editing
Figure 2 for Text-Guided 3D Face Synthesis -- From Generation to Editing
Figure 3 for Text-Guided 3D Face Synthesis -- From Generation to Editing
Figure 4 for Text-Guided 3D Face Synthesis -- From Generation to Editing
Viaarxiv icon

Self-Supervised Learning of Whole and Component-Based Semantic Representations for Person Re-Identification

Dec 01, 2023
Siyuan Huang, Yifan Zhou, Ram Prabhakar Kathirvel, Rama Chellappa, Chun Pong Lau

Viaarxiv icon

A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data

Add code
Bookmark button
Alert button
Nov 29, 2023
Ethan Harvey, Wansu Chen, David M. Kent, Michael C. Hughes

Figure 1 for A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data
Figure 2 for A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data
Figure 3 for A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data
Figure 4 for A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data
Viaarxiv icon

Computer Vision for Increased Operative Efficiency via Identification of Instruments in the Neurosurgical Operating Room: A Proof-of-Concept Study

Dec 03, 2023
Tanner J. Zachem, Sully F. Chen, Vishal Venkatraman, David AW Sykes, Ravi Prakash, Samantha Spellicy, Alexander D Suarez, Weston Ross, Patrick J. Codd

Viaarxiv icon

Consistency Prototype Module and Motion Compensation for Few-Shot Action Recognition (CLIP-CP$\mathbf{M^2}$C)

Add code
Bookmark button
Alert button
Dec 02, 2023
Fei Guo, Li Zhu, YiKang Wang, Han Qi

Viaarxiv icon

StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D

Dec 02, 2023
Pengsheng Guo, Hans Hao, Adam Caccavale, Zhongzheng Ren, Edward Zhang, Qi Shan, Aditya Sankar, Alexander G. Schwing, Alex Colburn, Fangchang Ma

Viaarxiv icon

Deep Learning as a Method for Inversion of NMR Signals

Nov 22, 2023
Julian B. B. Beckmann, Mick D. Mantle, Andrew J. Sederman, Lynn F. Gladden

Viaarxiv icon

Adversarial Prompt Tuning for Vision-Language Models

Nov 19, 2023
Jiaming Zhang, Xingjun Ma, Xin Wang, Lingyu Qiu, Jiaqi Wang, Yu-Gang Jiang, Jitao Sang

Viaarxiv icon

EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model

Oct 19, 2023
Zheyuan Zhang, Lanhong Yao, Bin Wang, Debesh Jha, Elif Keles, Alpay Medetalibeyoglu, Ulas Bagci

Viaarxiv icon