Picture for Xiaoke Huang

Xiaoke Huang

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

Medical Vision Generalist: Unifying Medical Imaging Tasks in Context

Add code
Jun 08, 2024
Viaarxiv icon

Segment and Caption Anything

Add code
Dec 01, 2023
Figure 1 for Segment and Caption Anything
Figure 2 for Segment and Caption Anything
Figure 3 for Segment and Caption Anything
Figure 4 for Segment and Caption Anything
Viaarxiv icon

Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data

Add code
Jul 05, 2023
Figure 1 for Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data
Figure 2 for Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data
Figure 3 for Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data
Figure 4 for Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data
Viaarxiv icon

Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution

Add code
Jun 03, 2023
Figure 1 for Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Figure 2 for Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Figure 3 for Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Figure 4 for Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Viaarxiv icon

Predicting Token Impact Towards Efficient Vision Transformer

Add code
May 24, 2023
Figure 1 for Predicting Token Impact Towards Efficient Vision Transformer
Figure 2 for Predicting Token Impact Towards Efficient Vision Transformer
Figure 3 for Predicting Token Impact Towards Efficient Vision Transformer
Figure 4 for Predicting Token Impact Towards Efficient Vision Transformer
Viaarxiv icon

Differentiate ChatGPT-generated and Human-written Medical Texts

Add code
Apr 23, 2023
Figure 1 for Differentiate ChatGPT-generated and Human-written Medical Texts
Figure 2 for Differentiate ChatGPT-generated and Human-written Medical Texts
Figure 3 for Differentiate ChatGPT-generated and Human-written Medical Texts
Figure 4 for Differentiate ChatGPT-generated and Human-written Medical Texts
Viaarxiv icon

Efficient Meshy Neural Fields for Animatable Human Avatars

Add code
Mar 23, 2023
Figure 1 for Efficient Meshy Neural Fields for Animatable Human Avatars
Figure 2 for Efficient Meshy Neural Fields for Animatable Human Avatars
Figure 3 for Efficient Meshy Neural Fields for Animatable Human Avatars
Figure 4 for Efficient Meshy Neural Fields for Animatable Human Avatars
Viaarxiv icon

AugGPT: Leveraging ChatGPT for Text Data Augmentation

Add code
Mar 20, 2023
Figure 1 for AugGPT: Leveraging ChatGPT for Text Data Augmentation
Figure 2 for AugGPT: Leveraging ChatGPT for Text Data Augmentation
Figure 3 for AugGPT: Leveraging ChatGPT for Text Data Augmentation
Figure 4 for AugGPT: Leveraging ChatGPT for Text Data Augmentation
Viaarxiv icon

Mask-guided BERT for Few Shot Text Classification

Add code
Mar 09, 2023
Figure 1 for Mask-guided BERT for Few Shot Text Classification
Figure 2 for Mask-guided BERT for Few Shot Text Classification
Figure 3 for Mask-guided BERT for Few Shot Text Classification
Figure 4 for Mask-guided BERT for Few Shot Text Classification
Viaarxiv icon