Picture for Thieu Vo

Thieu Vo

OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching

Add code
May 19, 2025
Viaarxiv icon

Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications

Add code
Sep 26, 2024
Figure 1 for Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications
Figure 2 for Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications
Figure 3 for Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications
Figure 4 for Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications
Viaarxiv icon

Language-driven Grasp Detection with Mask-guided Attention

Add code
Jul 29, 2024
Figure 1 for Language-driven Grasp Detection with Mask-guided Attention
Figure 2 for Language-driven Grasp Detection with Mask-guided Attention
Figure 3 for Language-driven Grasp Detection with Mask-guided Attention
Figure 4 for Language-driven Grasp Detection with Mask-guided Attention
Viaarxiv icon

Lightweight Language-driven Grasp Detection using Conditional Consistency Model

Add code
Jul 25, 2024
Figure 1 for Lightweight Language-driven Grasp Detection using Conditional Consistency Model
Figure 2 for Lightweight Language-driven Grasp Detection using Conditional Consistency Model
Figure 3 for Lightweight Language-driven Grasp Detection using Conditional Consistency Model
Figure 4 for Lightweight Language-driven Grasp Detection using Conditional Consistency Model
Viaarxiv icon

Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance

Add code
Jul 18, 2024
Figure 1 for Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance
Figure 2 for Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance
Figure 3 for Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance
Figure 4 for Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance
Viaarxiv icon

Language-driven Grasp Detection

Add code
Jun 13, 2024
Viaarxiv icon

Language-driven Scene Synthesis using Multi-conditional Diffusion Model

Add code
Oct 24, 2023
Viaarxiv icon

Language-Conditioned Affordance-Pose Detection in 3D Point Clouds

Add code
Sep 19, 2023
Figure 1 for Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Figure 2 for Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Figure 3 for Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Figure 4 for Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Viaarxiv icon

Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation

Add code
Sep 19, 2023
Viaarxiv icon

Grasp-Anything: Large-scale Grasp Dataset from Foundation Models

Add code
Sep 18, 2023
Figure 1 for Grasp-Anything: Large-scale Grasp Dataset from Foundation Models
Figure 2 for Grasp-Anything: Large-scale Grasp Dataset from Foundation Models
Figure 3 for Grasp-Anything: Large-scale Grasp Dataset from Foundation Models
Figure 4 for Grasp-Anything: Large-scale Grasp Dataset from Foundation Models
Viaarxiv icon