Picture for Guanyan Chen

Guanyan Chen

VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions

Add code
Oct 29, 2024
Figure 1 for VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions
Figure 2 for VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions
Figure 3 for VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions
Figure 4 for VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions
Viaarxiv icon