Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jason Chen

D-CODA: Diffusion for Coordinated Dual-Arm Data Augmentation

May 08, 2025

I-Chun Arthur Liu, Jason Chen, Gaurav Sukhatme, Daniel Seita

Abstract:Learning bimanual manipulation is challenging due to its high dimensionality and tight coordination required between two arms. Eye-in-hand imitation learning, which uses wrist-mounted cameras, simplifies perception by focusing on task-relevant views. However, collecting diverse demonstrations remains costly, motivating the need for scalable data augmentation. While prior work has explored visual augmentation in single-arm settings, extending these approaches to bimanual manipulation requires generating viewpoint-consistent observations across both arms and producing corresponding action labels that are both valid and feasible. In this work, we propose Diffusion for COordinated Dual-arm Data Augmentation (D-CODA), a method for offline data augmentation tailored to eye-in-hand bimanual imitation learning that trains a diffusion model to synthesize novel, viewpoint-consistent wrist-camera images for both arms while simultaneously generating joint-space action labels. It employs constrained optimization to ensure that augmented states involving gripper-to-object contacts adhere to constraints suitable for bimanual coordination. We evaluate D-CODA on 5 simulated and 3 real-world tasks. Our results across 2250 simulation trials and 300 real-world trials demonstrate that it outperforms baselines and ablations, showing its potential for scalable data augmentation in eye-in-hand bimanual manipulation. Our project website is at: https://dcodaaug.github.io/D-CODA/.

Via

Access Paper or Ask Questions

Detection of Spider Mites on Labrador Beans through Machine Learning Approaches Using Custom Datasets

Feb 12, 2024

Violet Liu, Jason Chen, Ans Qureshi, Mahla Nejati

Figure 1 for Detection of Spider Mites on Labrador Beans through Machine Learning Approaches Using Custom Datasets

Figure 2 for Detection of Spider Mites on Labrador Beans through Machine Learning Approaches Using Custom Datasets

Figure 3 for Detection of Spider Mites on Labrador Beans through Machine Learning Approaches Using Custom Datasets

Figure 4 for Detection of Spider Mites on Labrador Beans through Machine Learning Approaches Using Custom Datasets

Abstract:Amidst growing food production demands, early plant disease detection is essential to safeguard crops; this study proposes a visual machine learning approach for plant disease detection, harnessing RGB and NIR data collected in real-world conditions through a JAI FS-1600D-10GE camera to build an RGBN dataset. A two-stage early plant disease detection model with YOLOv8 and a sequential CNN was used to train on a dataset with partial labels, which showed a 3.6% increase in mAP compared to a single-stage end-to-end segmentation model. The sequential CNN model achieved 90.62% validation accuracy utilising RGBN data. An average of 6.25% validation accuracy increase is found using RGBN in classification compared to RGB using ResNet15 and the sequential CNN models. Further research and dataset improvements are needed to meet food production demands.

* Australasian Conference on Robotics and Automation (ACRA 2023)

Via

Access Paper or Ask Questions