Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shubham Singh

Phasing Through the Flames: Rapid Motion Planning with the AGHF PDE for Arbitrary Objective Functions and Constraints

May 02, 2025

Challen Enninful Adu, César E. Ramos Chuquiure, Yutong Zhou, Pearl Lin, Ruikai Yang, Bohao Zhang, Shubham Singh, Ram Vasudevan

Abstract:The generation of optimal trajectories for high-dimensional robotic systems under constraints remains computationally challenging due to the need to simultaneously satisfy dynamic feasibility, input limits, and task-specific objectives while searching over high-dimensional spaces. Recent approaches using the Affine Geometric Heat Flow (AGHF) Partial Differential Equation (PDE) have demonstrated promising results, generating dynamically feasible trajectories for complex systems like the Digit V3 humanoid within seconds. These methods efficiently solve trajectory optimization problems over a two-dimensional domain by evolving an initial trajectory to minimize control effort. However, these AGHF approaches are limited to a single type of optimal control problem (i.e., minimizing the integral of squared control norms) and typically require initial guesses that satisfy constraints to ensure satisfactory convergence. These limitations restrict the potential utility of the AGHF PDE especially when trying to synthesize trajectories for robotic systems. This paper generalizes the AGHF formulation to accommodate arbitrary cost functions, significantly expanding the classes of trajectories that can be generated. This work also introduces a Phase1 - Phase 2 Algorithm that enables the use of constraint-violating initial guesses while guaranteeing satisfactory convergence. The effectiveness of the proposed method is demonstrated through comparative evaluations against state-of-the-art techniques across various dynamical systems and challenging trajectory generation problems. Project Page: https://roahmlab.github.io/BLAZE/

* 15 pages, 5 figures

Via

Access Paper or Ask Questions

Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection

Sep 22, 2024

Orchid Chetia Phukan, Swarup Ranjan Behera, Shubham Singh, Muskaan Singh, Vandana Rajan, Arun Balaji Buduru, Rajesh Sharma, S. R. Mahadeva Prasanna

Figure 1 for Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection

Figure 2 for Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection

Figure 3 for Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection

Figure 4 for Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection

Abstract:In this study, we address the challenge of depression detection from speech, focusing on the potential of non-semantic features (NSFs) to capture subtle markers of depression. While prior research has leveraged various features for this task, NSFs-extracted from pre-trained models (PTMs) designed for non-semantic tasks such as paralinguistic speech processing (TRILLsson), speaker recognition (x-vector), and emotion recognition (emoHuBERT)-have shown significant promise. However, the potential of combining these diverse features has not been fully explored. In this work, we demonstrate that the amalgamation of NSFs results in complementary behavior, leading to enhanced depression detection performance. Furthermore, to our end, we introduce a simple novel framework, FuSeR, designed to effectively combine these features. Our results show that FuSeR outperforms models utilizing individual NSFs as well as baseline fusion techniques and obtains state-of-the-art (SOTA) performance in E-DAIC benchmark with RMSE of 5.51 and MAE of 4.48, establishing it as a robust approach for depression detection.

* Submitted to ICASSP 2025

Via

Access Paper or Ask Questions

Towards identifying Source credibility on Information Leakage in Digital Gadget Market

Sep 07, 2024

Neha Kumaru, Garvit Gupta, Shreyas Mongia, Shubham Singh, Ponnurangam Kumaraguru, Arun Balaji Buduru

Figure 1 for Towards identifying Source credibility on Information Leakage in Digital Gadget Market

Figure 2 for Towards identifying Source credibility on Information Leakage in Digital Gadget Market

Figure 3 for Towards identifying Source credibility on Information Leakage in Digital Gadget Market

Figure 4 for Towards identifying Source credibility on Information Leakage in Digital Gadget Market

Abstract:The use of Social media to share content is on a constant rise. One of the capsize effect of information sharing on Social media includes the spread of sensitive information on the public domain. With the digital gadget market becoming highly competitive and ever-evolving, the trend of an increasing number of sensitive posts leaking information on devices in social media is observed. Many web-blogs on digital gadget market have mushroomed recently, making the problem of information leak all pervasive. Credible leaks on specifics of an upcoming device can cause a lot of financial damage to the respective organization. Hence, it is crucial to assess the credibility of the platforms that continuously post about a smartphone or digital gadget leaks. In this work, we analyze the headlines of leak web-blog posts and their corresponding official press-release. We first collect 54, 495 leak and press-release headlines for different smartphones. We train our custom NER model to capture the evolving smartphone names with an accuracy of 82.14% on manually annotated results. We further propose a credibility score metric for the web-blog, based on the number of falsified and authentic smartphone leak posts.

Via

Access Paper or Ask Questions

KAN based Autoencoders for Factor Models

Aug 04, 2024

Tianqi Wang, Shubham Singh

Abstract:Inspired by recent advances in Kolmogorov-Arnold Networks (KANs), we introduce a novel approach to latent factor conditional asset pricing models. While previous machine learning applications in asset pricing have predominantly used Multilayer Perceptrons with ReLU activation functions to model latent factor exposures, our method introduces a KAN-based autoencoder which surpasses MLP models in both accuracy and interpretability. Our model offers enhanced flexibility in approximating exposures as nonlinear functions of asset characteristics, while simultaneously providing users with an intuitive framework for interpreting latent factors. Empirical backtesting demonstrates our model's superior ability to explain cross-sectional risk exposures. Moreover, long-short portfolios constructed using our model's predictions achieve higher Sharpe ratios, highlighting its practical value in investment management.

* 7 pages

Via

Access Paper or Ask Questions

ComFeAT: Combination of Neural and Spectral Features for Improved Depression Detection

Jun 10, 2024

Orchid Chetia Phukan, Sarthak Jain, Shubham Singh, Muskaan Singh, Arun Balaji Buduru, Rajesh Sharma

Figure 1 for ComFeAT: Combination of Neural and Spectral Features for Improved Depression Detection

Figure 2 for ComFeAT: Combination of Neural and Spectral Features for Improved Depression Detection

Figure 3 for ComFeAT: Combination of Neural and Spectral Features for Improved Depression Detection

Abstract:In this work, we focus on the detection of depression through speech analysis. Previous research has widely explored features extracted from pre-trained models (PTMs) primarily trained for paralinguistic tasks. Although these features have led to sufficient advances in speech-based depression detection, their performance declines in real-world settings. To address this, in this paper, we introduce ComFeAT, an application that employs a CNN model trained on a combination of features extracted from PTMs, a.k.a. neural features and spectral features to enhance depression detection. Spectral features are robust to domain variations, but, they are not as good as neural features in performance, suprisingly, combining them shows complementary behavior and improves over both neural and spectral features individually. The proposed method also improves over previous state-of-the-art (SOTA) works on E-DAIC benchmark.

* Accepted to INTERSPEECH 2024 Show & Tell Demonstrations

Via

Access Paper or Ask Questions

Stabilizing Circular Motion Within Nonconcentric Circular Boundary: A Mobius Transformation-Based Approach

May 11, 2024

Shubham Singh, Anoop Jain

Figure 1 for Stabilizing Circular Motion Within Nonconcentric Circular Boundary: A Mobius Transformation-Based Approach

Figure 2 for Stabilizing Circular Motion Within Nonconcentric Circular Boundary: A Mobius Transformation-Based Approach

Figure 3 for Stabilizing Circular Motion Within Nonconcentric Circular Boundary: A Mobius Transformation-Based Approach

Figure 4 for Stabilizing Circular Motion Within Nonconcentric Circular Boundary: A Mobius Transformation-Based Approach

Abstract:Nonuniform motion constraints are ubiquitous in robotic applications. Geofencing control is one such paradigm where the motion of a robot must be constrained within a predefined boundary. This paper addresses the problem of stabilizing a unicycle robot around a desired circular orbit while confining its motion within a nonconcentric external circular boundary. Our solution approach relies on the concept of the so-called Mobius transformation that, under certain practical conditions, maps two nonconcentric circles to a pair of concentric circles, and hence, results in uniform spatial motion constraints. The choice of such a Mobius transformation is governed by the roots of a quadratic equation in the post-design analysis that decides how the regions enclosed by the two circles are mapped onto the two planes. We show that the problem can be formulated either as a trajectory-constraining problem or an obstacle-avoidance problem in the transformed plane, depending on these roots. Exploiting the idea of the barrier Lyapunov function, we propose a unique control law that solves both these contrasting problems in the transformed plane and renders a solution to the original problem in the actual plane. By relating parameters of two planes under Mobius transformation and its inverse map, we further establish a connection between the control laws in two planes and determine the control law to be applied in the actual plane. Simulation and experimental results are provided to illustrate the key theoretical developments.

Via

Access Paper or Ask Questions

Transformer-based approach for Ethereum Price Prediction Using Crosscurrency correlation and Sentiment Analysis

Jan 16, 2024

Shubham Singh, Mayur Bhat

Abstract:The research delves into the capabilities of a transformer-based neural network for Ethereum cryptocurrency price forecasting. The experiment runs around the hypothesis that cryptocurrency prices are strongly correlated with other cryptocurrencies and the sentiments around the cryptocurrency. The model employs a transformer architecture for several setups from single-feature scenarios to complex configurations incorporating volume, sentiment, and correlated cryptocurrency prices. Despite a smaller dataset and less complex architecture, the transformer model surpasses ANN and MLP counterparts on some parameters. The conclusion presents a hypothesis on the illusion of causality in cryptocurrency price movements driven by sentiments.

* 12 pages

Via

Access Paper or Ask Questions

BrainVoxGen: Deep learning framework for synthesis of Ultrasound to MRI

Oct 11, 2023

Shubham Singh, Dr. Mrunal Bewoor, Ammar Ranapurwala, Satyam Rai, Sheetal Patil

Abstract:The study presents a deep learning framework aimed at synthesizing 3D MRI volumes from three-dimensional ultrasound images of the brain utilizing the Pix2Pix GAN model. The process involves inputting a 3D volume of ultrasounds into a UNET generator and patch discriminator, generating a corresponding 3D volume of MRI. Model performance was evaluated using losses on the discriminator and generator applied to a dataset of 3D ultrasound and MRI images. The results indicate that the synthesized MRI images exhibit some similarity to the expected outcomes. Despite challenges related to dataset size, computational resources, and technical complexities, the method successfully generated MRI volume with a satisfactory similarity score meant to serve as a baseline for further research. It underscores the potential of deep learning-based volume synthesis techniques for ultrasound to MRI conversion, showcasing their viability for medical applications. Further refinement and exploration are warranted for enhanced clinical relevance.

* 6 pages

Via

Access Paper or Ask Questions

Systematic Review of Techniques in Brain Image Synthesis using Deep Learning

Sep 08, 2023

Shubham Singh, Ammar Ranapurwala, Mrunal Bewoor, Sheetal Patil, Satyam Rai

Figure 1 for Systematic Review of Techniques in Brain Image Synthesis using Deep Learning

Figure 2 for Systematic Review of Techniques in Brain Image Synthesis using Deep Learning

Figure 3 for Systematic Review of Techniques in Brain Image Synthesis using Deep Learning

Figure 4 for Systematic Review of Techniques in Brain Image Synthesis using Deep Learning

Abstract:This review paper delves into the present state of medical imaging, with a specific focus on the use of deep learning techniques for brain image synthesis. The need for medical image synthesis to improve diagnostic accuracy and decrease invasiveness in medical procedures is emphasized, along with the role of deep learning in enabling these advancements. The paper examines various methods and techniques for brain image synthesis, including 2D to 3D constructions, MRI synthesis, and the use of transformers. It also addresses limitations and challenges faced in these methods, such as obtaining well-curated training data and addressing brain ultrasound issues. The review concludes by exploring the future potential of this field and the opportunities for further advancements in medical imaging using deep learning techniques. The significance of transformers and their potential to revolutionize the medical imaging field is highlighted. Additionally, the paper discusses the potential solutions to the shortcomings and limitations faced in this field. The review provides researchers with an updated reference on the present state of the field and aims to inspire further research and bridge the gap between the present state of medical imaging and the future possibilities offered by deep learning techniques.

* 8 pages

Via

Access Paper or Ask Questions

Multi-Shooting Differential Dynamic Programming for Hybrid Systems using Analytical Derivatives

Jul 24, 2023

Shubham Singh, Ryan P. Russell, Patrick M. Wensing

Figure 1 for Multi-Shooting Differential Dynamic Programming for Hybrid Systems using Analytical Derivatives

Figure 2 for Multi-Shooting Differential Dynamic Programming for Hybrid Systems using Analytical Derivatives

Figure 3 for Multi-Shooting Differential Dynamic Programming for Hybrid Systems using Analytical Derivatives

Figure 4 for Multi-Shooting Differential Dynamic Programming for Hybrid Systems using Analytical Derivatives

Abstract:Differential Dynamic Programming (DDP) is a popular technique used to generate motion for dynamic-legged robots in the recent past. However, in most cases, only the first-order partial derivatives of the underlying dynamics are used, resulting in the iLQR approach. Neglecting the second-order terms often slows down the convergence rate compared to full DDP. Multi-Shooting is another popular technique to improve robustness, especially if the dynamics are highly non-linear. In this work, we consider Multi-Shooting DDP for trajectory optimization of a bounding gait for a simplified quadruped model. As the main contribution, we develop Second-Order analytical partial derivatives of the rigid-body contact dynamics, extending our previous results for fixed/floating base models with multi-DoF joints. Finally, we show the benefits of a novel Quasi-Newton method for approximating second-order derivatives of the dynamics, leading to order-of-magnitude speedups in the convergence compared to the full DDP method.

* https://www.youtube.com/watch?v=C0h6mEpcnAE

Via

Access Paper or Ask Questions