Picture for Vikas Chandra

Vikas Chandra

Taming Mode Collapse in Score Distillation for Text-to-3D Generation

Add code
Dec 31, 2023
Figure 1 for Taming Mode Collapse in Score Distillation for Text-to-3D Generation
Figure 2 for Taming Mode Collapse in Score Distillation for Text-to-3D Generation
Figure 3 for Taming Mode Collapse in Score Distillation for Text-to-3D Generation
Figure 4 for Taming Mode Collapse in Score Distillation for Text-to-3D Generation
Viaarxiv icon

SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity

Add code
Dec 31, 2023
Figure 1 for SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Figure 2 for SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Figure 3 for SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Figure 4 for SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Viaarxiv icon

SqueezeSAM: User friendly mobile interactive segmentation

Add code
Dec 11, 2023
Viaarxiv icon

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Add code
Dec 01, 2023
Figure 1 for EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Figure 2 for EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Figure 3 for EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Figure 4 for EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Viaarxiv icon

In-Context Prompt Editing For Conditional Audio Generation

Add code
Nov 01, 2023
Viaarxiv icon

On The Open Prompt Challenge In Conditional Audio Generation

Add code
Nov 01, 2023
Figure 1 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 2 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 3 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 4 for On The Open Prompt Challenge In Conditional Audio Generation
Viaarxiv icon

MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning

Add code
Oct 26, 2023
Figure 1 for MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Figure 2 for MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Figure 3 for MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Figure 4 for MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Viaarxiv icon

Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition

Add code
Sep 21, 2023
Figure 1 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 2 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 3 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 4 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Viaarxiv icon

Exploring Speech Enhancement for Low-resource Speech Synthesis

Add code
Sep 19, 2023
Figure 1 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 2 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 3 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Figure 4 for Exploring Speech Enhancement for Low-resource Speech Synthesis
Viaarxiv icon

FoleyGen: Visually-Guided Audio Generation

Add code
Sep 19, 2023
Viaarxiv icon