Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vikash Kumar

Modified Lagrangian Formulation of Gear Tooth Crack Analysis using Combined Approach of Variable Mode Decomposition (VMD) and Time Synchronous Averaging (TSA)

Aug 29, 2023

Subrata Mukherjee, Vikash Kumar, Somnath Sarangi

Abstract:This paper discusses the possible observation of an integrated gear tooth crack analysis procedure that employs the combined approach of variable mode decomposition (VMD) and time synchronous averaging (TSA) based on the coupled electromechanical gearbox (CEMG) system. This paper also incorporates the modified Lagrangian formulation to model the CEMG system by considering Rayleigh's dissipative potential. An analytical improved time-varying mesh stiffness (IAM-TVMS) with different levels of gear tooth crack depts is also incorporated into the CEMG system to inspect the influence of cracks on the system's dynamic behavior. Dynamic responses of the CEMG system with different tooth crack levels have been used for further investigations. For the first time, the integrated approach of variable mode decomposition (VMD) and time-synchronous averaging (TSA) has been presented to analyze the dynamic behaviour of CEMG systems at the different gear tooth cracks have been experienced as non-stationary and complex vibration signals with noise. Based on the integrated approach of VMD-TSA, two types of nonlinear features, i.e., Lyapunov Exponent (LE) and Correlation Dimension (CD), were calculated to predict the level of chaotic vibration and complexity of the CEMG system at the different levels of gear tooth cracks. Also, the LE and CD are used as chaotic behaviour features to predict the gear tooth crack propagation level. The results of the proposed approach show significant improvements in the gear tooth crack analysis based on the chaotic features. Also, this is one of the first attempts to study the CEMG system using chaotic features based on the combined approach of VMD-TSA.

* 17 pages, 36 figures, 6th Joint International Conference on Multibody System Dynamics and the 10th Asian Conference on Multibody Dynamics 2022

Via

Access Paper or Ask Questions

Integrated Approach of Gearbox Fault Diagnosis

Aug 27, 2023

Vikash Kumar, Subrata Mukherjee, Somnath Sarangi

Figure 1 for Integrated Approach of Gearbox Fault Diagnosis

Figure 2 for Integrated Approach of Gearbox Fault Diagnosis

Figure 3 for Integrated Approach of Gearbox Fault Diagnosis

Figure 4 for Integrated Approach of Gearbox Fault Diagnosis

Abstract:Gearbox fault diagnosis is one of the most important parts in any industrial systems. Failure of components inside gearbox can lead to a catastrophic failure, uneven breakdown, and financial losses in industrial organization. In that case intelligent maintenance of the gearbox comes into context. This paper presents an integrated gearbox fault diagnosis approach which can easily deploy in online condition monitoring. This work introduces a nonparametric data preprocessing technique i.e., calculus enhanced energy operator (CEEO) to preserve the characteristics frequencies in the noisy and inferred vibrational signal. A set of time domain and spectral domain features are calculated from the raw and CEEO vibration signal and inputted to the multiclass support vector machine (MCSVM) to diagnose the faults on the system. An effective comparison between raw signal and CEEO signal are presented to show the impact of CEEO in gearbox fault diagnosis. The obtained results of this work look very promising and can be implemented in any type of industrial system due to its nonparametric nature.

Via

Access Paper or Ask Questions

SAR: Generalization of Physiological Agility and Dexterity via Synergistic Action Representation

Jul 14, 2023

Cameron Berg, Vittorio Caggiano, Vikash Kumar

Abstract:Learning effective continuous control policies in high-dimensional systems, including musculoskeletal agents, remains a significant challenge. Over the course of biological evolution, organisms have developed robust mechanisms for overcoming this complexity to learn highly sophisticated strategies for motor control. What accounts for this robust behavioral flexibility? Modular control via muscle synergies, i.e. coordinated muscle co-contractions, is considered to be one putative mechanism that enables organisms to learn muscle control in a simplified and generalizable action space. Drawing inspiration from this evolved motor control strategy, we use physiologically accurate human hand and leg models as a testbed for determining the extent to which a Synergistic Action Representation (SAR) acquired from simpler tasks facilitates learning more complex tasks. We find in both cases that SAR-exploiting policies significantly outperform end-to-end reinforcement learning. Policies trained with SAR were able to achieve robust locomotion on a wide set of terrains with high sample efficiency, while baseline approaches failed to learn meaningful behaviors. Additionally, policies trained with SAR on a multiobject manipulation task significantly outperformed (>70% success) baseline approaches (<20% success). Both of these SAR-exploiting policies were also found to generalize zero-shot to out-of-domain environmental conditions, while policies that did not adopt SAR failed to generalize. Finally, we establish the generality of SAR on broader high-dimensional control problems using a robotic manipulation task set and a full-body humanoid locomotion task. To the best of our knowledge, this investigation is the first of its kind to present an end-to-end pipeline for discovering synergies and using this representation to learn high-dimensional continuous control across a wide diversity of tasks.

* Presented at RSS 2023

Via

Access Paper or Ask Questions

TorchRL: A data-driven decision-making library for PyTorch

Jun 01, 2023

Albert Bou, Matteo Bettini, Sebastian Dittert, Vikash Kumar, Shagun Sodhani, Xiaomeng Yang, Gianni De Fabritiis, Vincent Moens

Abstract:Striking a balance between integration and modularity is crucial for a machine learning library to be versatile and user-friendly, especially in handling decision and control tasks that involve large development teams and complex, real-world data, and environments. To address this issue, we propose TorchRL, a generalistic control library for PyTorch that provides well-integrated, yet standalone components. With a versatile and robust primitive design, TorchRL facilitates streamlined algorithm development across the many branches of Reinforcement Learning (RL) and control. We introduce a new PyTorch primitive, TensorDict, as a flexible data carrier that empowers the integration of the library's components while preserving their modularity. Hence replay buffers, datasets, distributed data collectors, environments, transforms and objectives can be effortlessly used in isolation or combined. We provide a detailed description of the building blocks, supporting code examples and an extensive overview of the library across domains and tasks. Finally, we show comparative benchmarks to demonstrate its computational efficiency. TorchRL fosters long-term support and is publicly available on GitHub for greater reproducibility and collaboration within the research community. The code is opensourced on https://github.com/pytorch/rl.

Via

Access Paper or Ask Questions

LIV: Language-Image Representations and Rewards for Robotic Control

Jun 01, 2023

Yecheng Jason Ma, William Liang, Vaidehi Som, Vikash Kumar, Amy Zhang, Osbert Bastani, Dinesh Jayaraman

Figure 1 for LIV: Language-Image Representations and Rewards for Robotic Control

Figure 2 for LIV: Language-Image Representations and Rewards for Robotic Control

Figure 3 for LIV: Language-Image Representations and Rewards for Robotic Control

Figure 4 for LIV: Language-Image Representations and Rewards for Robotic Control

Abstract:We present Language-Image Value learning (LIV), a unified objective for vision-language representation and reward learning from action-free videos with text annotations. Exploiting a novel connection between dual reinforcement learning and mutual information contrastive learning, the LIV objective trains a multi-modal representation that implicitly encodes a universal value function for tasks specified as language or image goals. We use LIV to pre-train the first control-centric vision-language representation from large human video datasets such as EpicKitchen. Given only a language or image goal, the pre-trained LIV model can assign dense rewards to each frame in videos of unseen robots or humans attempting that task in unseen environments. Further, when some target domain-specific data is available, the same objective can be used to fine-tune and improve LIV and even other pre-trained representations for robotic control and reward specification in that domain. In our experiments on several simulated and real-world robot environments, LIV models consistently outperform the best prior input state representations for imitation learning, as well as reward specification methods for policy synthesis. Our results validate the advantages of joint vision-language representation and reward learning within the unified, compact LIV framework.

* Extended version of ICML 2023 camera-ready; Project website: https://penn-pal-lab.github.io/LIV/

Via

Access Paper or Ask Questions

MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results

Apr 27, 2023

Qingpeng Zhu, Wenxiu Sun, Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qianhui Sun, Chen Change Loy, Jinwei Gu, Yi Yu(+13 more)

Figure 1 for MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results

Figure 2 for MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results

Figure 3 for MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results

Figure 4 for MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results

Abstract:Depth completion from RGB images and sparse Time-of-Flight (ToF) measurements is an important problem in computer vision and robotics. While traditional methods for depth completion have relied on stereo vision or structured light techniques, recent advances in deep learning have enabled more accurate and efficient completion of depth maps from RGB images and sparse ToF measurements. To evaluate the performance of different depth completion methods, we organized an RGB+sparse ToF depth completion competition. The competition aimed to encourage research in this area by providing a standardized dataset and evaluation metrics to compare the accuracy of different approaches. In this report, we present the results of the competition and analyze the strengths and weaknesses of the top-performing methods. We also discuss the implications of our findings for future research in RGB+sparse ToF depth completion. We hope that this competition and report will help to advance the state-of-the-art in this important area of research. More details of this challenge and the link to the dataset can be found at https://mipi-challenge.org/MIPI2023.

* arXiv admin note: substantial text overlap with arXiv:2209.07057

Via

Access Paper or Ask Questions

Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware

Apr 23, 2023

Tony Z. Zhao, Vikash Kumar, Sergey Levine, Chelsea Finn

Figure 1 for Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware

Figure 2 for Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware

Figure 3 for Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware

Figure 4 for Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware

Abstract:Fine manipulation tasks, such as threading cable ties or slotting a battery, are notoriously difficult for robots because they require precision, careful coordination of contact forces, and closed-loop visual feedback. Performing these tasks typically requires high-end robots, accurate sensors, or careful calibration, which can be expensive and difficult to set up. Can learning enable low-cost and imprecise hardware to perform these fine manipulation tasks? We present a low-cost system that performs end-to-end imitation learning directly from real demonstrations, collected with a custom teleoperation interface. Imitation learning, however, presents its own challenges, particularly in high-precision domains: errors in the policy can compound over time, and human demonstrations can be non-stationary. To address these challenges, we develop a simple yet novel algorithm, Action Chunking with Transformers (ACT), which learns a generative model over action sequences. ACT allows the robot to learn 6 difficult tasks in the real world, such as opening a translucent condiment cup and slotting a battery with 80-90% success, with only 10 minutes worth of demonstrations. Project website: https://tonyzhaozh.github.io/aloha/

Via

Access Paper or Ask Questions

GenAug: Retargeting behaviors to unseen situations via Generative Augmentation

Feb 22, 2023

Zoey Chen, Sho Kiami, Abhishek Gupta, Vikash Kumar

Figure 1 for GenAug: Retargeting behaviors to unseen situations via Generative Augmentation

Figure 2 for GenAug: Retargeting behaviors to unseen situations via Generative Augmentation

Figure 3 for GenAug: Retargeting behaviors to unseen situations via Generative Augmentation

Figure 4 for GenAug: Retargeting behaviors to unseen situations via Generative Augmentation

Abstract:Robot learning methods have the potential for widespread generalization across tasks, environments, and objects. However, these methods require large diverse datasets that are expensive to collect in real-world robotics settings. For robot learning to generalize, we must be able to leverage sources of data or priors beyond the robot's own experience. In this work, we posit that image-text generative models, which are pre-trained on large corpora of web-scraped data, can serve as such a data source. We show that despite these generative models being trained on largely non-robotics data, they can serve as effective ways to impart priors into the process of robot learning in a way that enables widespread generalization. In particular, we show how pre-trained generative models can serve as effective tools for semantically meaningful data augmentation. By leveraging these pre-trained models for generating appropriate "semantic" data augmentations, we propose a system GenAug that is able to significantly improve policy generalization. We apply GenAug to tabletop manipulation tasks, showing the ability to re-target behavior to novel scenarios, while only requiring marginal amounts of real-world data. We demonstrate the efficacy of this system on a number of object manipulation problems in the real world, showing a 40% improvement in generalization to novel scenes and objects.

Via

Access Paper or Ask Questions

Zero-Shot Robot Manipulation from Passive Human Videos

Feb 03, 2023

Homanga Bharadhwaj, Abhinav Gupta, Shubham Tulsiani, Vikash Kumar

Abstract:Can we learn robot manipulation for everyday tasks, only by watching videos of humans doing arbitrary tasks in different unstructured settings? Unlike widely adopted strategies of learning task-specific behaviors or direct imitation of a human video, we develop a a framework for extracting agent-agnostic action representations from human videos, and then map it to the agent's embodiment during deployment. Our framework is based on predicting plausible human hand trajectories given an initial image of a scene. After training this prediction model on a diverse set of human videos from the internet, we deploy the trained model zero-shot for physical robot manipulation tasks, after appropriate transformations to the robot's embodiment. This simple strategy lets us solve coarse manipulation tasks like opening and closing drawers, pushing, and tool use, without access to any in-domain robot manipulation trajectories. Our real-world deployment results establish a strong baseline for action prediction information that can be acquired from diverse arbitrary videos of human activities, and be useful for zero-shot robotic manipulation in unseen scenes.

* Preprint. Under review

Via

Access Paper or Ask Questions

Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance

Dec 19, 2022

Kelvin Xu, Zheyuan Hu, Ria Doshi, Aaron Rovinsky, Vikash Kumar, Abhishek Gupta, Sergey Levine

Figure 1 for Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance

Figure 2 for Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance

Figure 3 for Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance

Figure 4 for Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance

Abstract:Complex and contact-rich robotic manipulation tasks, particularly those that involve multi-fingered hands and underactuated object manipulation, present a significant challenge to any control method. Methods based on reinforcement learning offer an appealing choice for such settings, as they can enable robots to learn to delicately balance contact forces and dexterously reposition objects without strong modeling assumptions. However, running reinforcement learning on real-world dexterous manipulation systems often requires significant manual engineering. This negates the benefits of autonomous data collection and ease of use that reinforcement learning should in principle provide. In this paper, we describe a system for vision-based dexterous manipulation that provides a "programming-free" approach for users to define new tasks and enable robots with complex multi-fingered hands to learn to perform them through interaction. The core principle underlying our system is that, in a vision-based setting, users should be able to provide high-level intermediate supervision that circumvents challenges in teleoperation or kinesthetic teaching which allow a robot to not only learn a task efficiently but also to autonomously practice. Our system includes a framework for users to define a final task and intermediate sub-tasks with image examples, a reinforcement learning procedure that learns the task autonomously without interventions, and experimental results with a four-finger robotic hand learning multi-stage object manipulation tasks directly in the real world, without simulation, manual modeling, or reward engineering.

* First two authors contributed equally

Via

Access Paper or Ask Questions