Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yu Zheng

Uncovering the Secrets of Human-Like Movement: A Fresh Perspective on Motion Planning

Sep 18, 2024

Lei Shi, Qichao Liu, Cheng Zhou, Wentao Gao, Haotian Wu, Yu Zheng, Xiong Li

Figure 1 for Uncovering the Secrets of Human-Like Movement: A Fresh Perspective on Motion Planning

Figure 2 for Uncovering the Secrets of Human-Like Movement: A Fresh Perspective on Motion Planning

Figure 3 for Uncovering the Secrets of Human-Like Movement: A Fresh Perspective on Motion Planning

Figure 4 for Uncovering the Secrets of Human-Like Movement: A Fresh Perspective on Motion Planning

Abstract:This article explores human-like movement from a fresh perspective on motion planning. We analyze the coordinated and compliant movement mechanisms of the human body from the perspective of biomechanics. Based on these mechanisms, we propose an optimal control framework that integrates compliant control dynamics, optimizing robotic arm motion through a response time matrix. This matrix sets the timing parameters for joint movements, turning the system into a time-parameterized optimal control problem. The model focuses on the interaction between active and passive joints under external disturbances, improving adaptability and compliance. This method achieves optimal trajectory generation and balances precision and compliance. Experimental results on both a manipulator and a humanoid robot validate the approach.

* 7 pages

Via

Access Paper or Ask Questions

Large-scale Urban Facility Location Selection with Knowledge-informed Reinforcement Learning

Sep 03, 2024

Hongyuan Su, Yu Zheng, Jingtao Ding, Depeng Jin, Yong Li

Figure 1 for Large-scale Urban Facility Location Selection with Knowledge-informed Reinforcement Learning

Figure 2 for Large-scale Urban Facility Location Selection with Knowledge-informed Reinforcement Learning

Figure 3 for Large-scale Urban Facility Location Selection with Knowledge-informed Reinforcement Learning

Figure 4 for Large-scale Urban Facility Location Selection with Knowledge-informed Reinforcement Learning

Abstract:The facility location problem (FLP) is a classical combinatorial optimization challenge aimed at strategically laying out facilities to maximize their accessibility. In this paper, we propose a reinforcement learning method tailored to solve large-scale urban FLP, capable of producing near-optimal solutions at superfast inference speed. We distill the essential swap operation from local search, and simulate it by intelligently selecting edges on a graph of urban regions, guided by a knowledge-informed graph neural network, thus sidestepping the need for heavy computation of local search. Extensive experiments on four US cities with different geospatial conditions demonstrate that our approach can achieve comparable performance to commercial solvers with less than 5\% accessibility loss, while displaying up to 1000 times speedup. We deploy our model as an online geospatial application at https://huggingface.co/spaces/randommmm/MFLP.

* 4 pages

Via

Access Paper or Ask Questions

SFR-GNN: Simple and Fast Robust GNNs against Structural Attacks

Sep 01, 2024

Xing Ai, Guanyu Zhu, Yulin Zhu, Yu Zheng, Gaolei Li, Jianhua Li, Kai Zhou

Figure 1 for SFR-GNN: Simple and Fast Robust GNNs against Structural Attacks

Figure 2 for SFR-GNN: Simple and Fast Robust GNNs against Structural Attacks

Figure 3 for SFR-GNN: Simple and Fast Robust GNNs against Structural Attacks

Figure 4 for SFR-GNN: Simple and Fast Robust GNNs against Structural Attacks

Abstract:Graph Neural Networks (GNNs) have demonstrated commendable performance for graph-structured data. Yet, GNNs are often vulnerable to adversarial structural attacks as embedding generation relies on graph topology. Existing efforts are dedicated to purifying the maliciously modified structure or applying adaptive aggregation, thereby enhancing the robustness against adversarial structural attacks. It is inevitable for a defender to consume heavy computational costs due to lacking prior knowledge about modified structures. To this end, we propose an efficient defense method, called Simple and Fast Robust Graph Neural Network (SFR-GNN), supported by mutual information theory. The SFR-GNN first pre-trains a GNN model using node attributes and then fine-tunes it over the modified graph in the manner of contrastive learning, which is free of purifying modified structures and adaptive aggregation, thus achieving great efficiency gains. Consequently, SFR-GNN exhibits a 24%--162% speedup compared to advanced robust models, demonstrating superior robustness for node classification tasks.

Via

Access Paper or Ask Questions

LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them

Jun 26, 2024

Wenya Xie, Qingying Xiao, Yu Zheng, Xidong Wang, Junying Chen, Ke Ji, Anningzhe Gao, Xiang Wan, Feng Jiang, Benyou Wang

Figure 1 for LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them

Figure 2 for LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them

Figure 3 for LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them

Figure 4 for LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them

Abstract:The recent success of Large Language Models (LLMs) has had a significant impact on the healthcare field, providing patients with medical advice, diagnostic information, and more. However, due to a lack of professional medical knowledge, patients are easily misled by generated erroneous information from LLMs, which may result in serious medical problems. To address this issue, we focus on tuning the LLMs to be medical assistants who collaborate with more experienced doctors. We first conduct a two-stage survey by inspiration-feedback to gain a broad understanding of the real needs of doctors for medical assistants. Based on this, we construct a Chinese medical dataset called DoctorFLAN to support the entire workflow of doctors, which includes 92K Q\&A samples from 22 tasks and 27 specialists. Moreover, we evaluate LLMs in doctor-oriented scenarios by constructing the DoctorFLAN-\textit{test} containing 550 single-turn Q\&A and DotaBench containing 74 multi-turn conversations. The evaluation results indicate that being a medical assistant still poses challenges for existing open-source models, but DoctorFLAN can help them significantly. It demonstrates that the doctor-oriented dataset and benchmarks we construct can complement existing patient-oriented work and better promote medical LLMs research.

Via

Access Paper or Ask Questions

ARC: A Generalist Graph Anomaly Detector with In-Context Learning

May 27, 2024

Yixin Liu, Shiyuan Li, Yu Zheng, Qingfeng Chen, Chengqi Zhang, Shirui Pan

Figure 1 for ARC: A Generalist Graph Anomaly Detector with In-Context Learning

Figure 2 for ARC: A Generalist Graph Anomaly Detector with In-Context Learning

Figure 3 for ARC: A Generalist Graph Anomaly Detector with In-Context Learning

Figure 4 for ARC: A Generalist Graph Anomaly Detector with In-Context Learning

Abstract:Graph anomaly detection (GAD), which aims to identify abnormal nodes that differ from the majority within a graph, has garnered significant attention. However, current GAD methods necessitate training specific to each dataset, resulting in high training costs, substantial data requirements, and limited generalizability when being applied to new datasets and domains. To address these limitations, this paper proposes ARC, a generalist GAD approach that enables a ``one-for-all'' GAD model to detect anomalies across various graph datasets on-the-fly. Equipped with in-context learning, ARC can directly extract dataset-specific patterns from the target dataset using few-shot normal samples at the inference stage, without the need for retraining or fine-tuning on the target dataset. ARC comprises three components that are well-crafted for capturing universal graph anomaly patterns: 1) smoothness-based feature Alignment module that unifies the features of different datasets into a common and anomaly-sensitive space; 2) ego-neighbor Residual graph encoder that learns abnormality-related node embeddings; and 3) cross-attentive in-Context anomaly scoring module that predicts node abnormality by leveraging few-shot normal samples. Extensive experiments on multiple benchmark datasets from various domains demonstrate the superior anomaly detection performance, efficiency, and generalizability of ARC.

* 25 pages, 10 figures

Via

Access Paper or Ask Questions

skscope: Fast Sparsity-Constrained Optimization in Python

Mar 27, 2024

Zezhi Wang, Jin Zhu, Peng Chen, Huiyang Peng, Xiaoke Zhang, Anran Wang, Yu Zheng, Junxian Zhu, Xueqin Wang

Figure 1 for skscope: Fast Sparsity-Constrained Optimization in Python

Figure 2 for skscope: Fast Sparsity-Constrained Optimization in Python

Abstract:Applying iterative solvers on sparsity-constrained optimization (SCO) requires tedious mathematical deduction and careful programming/debugging that hinders these solvers' broad impact. In the paper, the library skscope is introduced to overcome such an obstacle. With skscope, users can solve the SCO by just programming the objective function. The convenience of skscope is demonstrated through two examples in the paper, where sparse linear regression and trend filtering are addressed with just four lines of code. More importantly, skscope's efficient implementation allows state-of-the-art solvers to quickly attain the sparse solution regardless of the high dimensionality of parameter space. Numerical experiments reveal the available solvers in skscope can achieve up to 80x speedup on the competing relaxation solutions obtained via the benchmarked convex solver. skscope is published on the Python Package Index (PyPI) and Conda, and its source code is available at: https://github.com/abess-team/skscope.

* 4 pages

Via

Access Paper or Ask Questions

Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond

Mar 21, 2024

Wei Chen, Yuxuan Liang, Yuanshao Zhu, Yanchuan Chang, Kang Luo, Haomin Wen, Lei Li, Yanwei Yu, Qingsong Wen, Chao Chen(+4 more)

Figure 1 for Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond

Figure 2 for Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond

Figure 3 for Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond

Figure 4 for Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond

Abstract:Trajectory computing is a pivotal domain encompassing trajectory data management and mining, garnering widespread attention due to its crucial role in various practical applications such as location services, urban traffic, and public safety. Traditional methods, focusing on simplistic spatio-temporal features, face challenges of complex calculations, limited scalability, and inadequate adaptability to real-world complexities. In this paper, we present a comprehensive review of the development and recent advances in deep learning for trajectory computing (DL4Traj). We first define trajectory data and provide a brief overview of widely-used deep learning models. Systematically, we explore deep learning applications in trajectory management (pre-processing, storage, analysis, and visualization) and mining (trajectory-related forecasting, trajectory-related recommendation, trajectory classification, travel time estimation, anomaly detection, and mobility generation). Notably, we encapsulate recent advancements in Large Language Models (LLMs) that hold the potential to augment trajectory computing. Additionally, we summarize application scenarios, public datasets, and toolkits. Finally, we outline current challenges in DL4Traj research and propose future directions. Relevant papers and open-source resources have been collated and are continuously updated at: \href{https://github.com/yoshall/Awesome-Trajectory-Computing}{DL4Traj Repo}.

* 25 pages, 12 figures, 5 tables

Via

Access Paper or Ask Questions

Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy

Mar 18, 2024

Jiuming Liu, Ruiji Yu, Yian Wang, Yu Zheng, Tianchen Deng, Weicai Ye, Hesheng Wang

Figure 1 for Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy

Figure 2 for Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy

Figure 3 for Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy

Figure 4 for Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy

Abstract:Recently, state space model (SSM) has gained great attention due to its promising performance, linear complexity, and long sequence modeling ability in both language and image domains. However, it is non-trivial to extend SSM to the point cloud field, because of the causality requirement of SSM and the disorder and irregularity nature of point clouds. In this paper, we propose a novel SSM-based point cloud processing backbone, named Point Mamba, with a causality-aware ordering mechanism. To construct the causal dependency relationship, we design an octree-based ordering strategy on raw irregular points, globally sorting points in a z-order sequence and also retaining their spatial proximity. Our method achieves state-of-the-art performance compared with transformer-based counterparts, with 93.4% accuracy and 75.7 mIOU respectively on the ModelNet40 classification dataset and ScanNet semantic segmentation dataset. Furthermore, our Point Mamba has linear complexity, which is more efficient than transformer-based methods. Our method demonstrates the great potential that SSM can serve as a generic backbone in point cloud understanding. Codes are released at https://github.com/IRMVLab/Point-Mamba.

Via

Access Paper or Ask Questions

SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Mar 15, 2024

Xin Wang, Yu Zheng, Zhongwei Wan, Mi Zhang

Figure 1 for SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Figure 2 for SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Figure 3 for SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Figure 4 for SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Abstract:The advancements in Large Language Models (LLMs) have been hindered by their substantial sizes, which necessitate LLM compression methods for practical deployment. Singular Value Decomposition (SVD) offers a promising solution for LLM compression. However, state-of-the-art SVD-based LLM compression methods have two key limitations: truncating smaller singular values may lead to higher compression loss, and the lack of update on the remaining model parameters after SVD truncation. In this work, we propose SVD-LLM, a new SVD-based LLM compression method that addresses the limitations of existing methods. SVD-LLM incorporates a truncation-aware data whitening strategy to ensure a direct mapping between singular values and compression loss. Moreover, SVD-LLM adopts a layer-wise closed-form model parameter update strategy to compensate for accuracy degradation caused by SVD truncation. We evaluate SVD-LLM on a total of 11 datasets and seven models from three different LLM families at four different scales. Our results demonstrate the superiority of SVD-LLM over state-of-the-arts, especially at high model compression ratios. The source code is available at https://github.com/AIoT-MLSys-Lab/SVD-LLM.

* Under Review

Via

Access Paper or Ask Questions

Semi-Supervised Learning for Anomaly Traffic Detection via Bidirectional Normalizing Flows

Mar 13, 2024

Zhangxuan Dang, Yu Zheng, Xinglin Lin, Chunlei Peng, Qiuyu Chen, Xinbo Gao

Figure 1 for Semi-Supervised Learning for Anomaly Traffic Detection via Bidirectional Normalizing Flows

Figure 2 for Semi-Supervised Learning for Anomaly Traffic Detection via Bidirectional Normalizing Flows

Figure 3 for Semi-Supervised Learning for Anomaly Traffic Detection via Bidirectional Normalizing Flows

Figure 4 for Semi-Supervised Learning for Anomaly Traffic Detection via Bidirectional Normalizing Flows

Abstract:With the rapid development of the Internet, various types of anomaly traffic are threatening network security. We consider the problem of anomaly network traffic detection and propose a three-stage anomaly detection framework using only normal traffic. Our framework can generate pseudo anomaly samples without prior knowledge of anomalies to achieve the detection of anomaly data. Firstly, we employ a reconstruction method to learn the deep representation of normal samples. Secondly, these representations are normalized to a standard normal distribution using a bidirectional flow module. To simulate anomaly samples, we add noises to the normalized representations which are then passed through the generation direction of the bidirectional flow module. Finally, a simple classifier is trained to differentiate the normal samples and pseudo anomaly samples in the latent space. During inference, our framework requires only two modules to detect anomalous samples, leading to a considerable reduction in model size. According to the experiments, our method achieves the state of-the-art results on the common benchmarking datasets of anomaly network traffic detection. The code is given in the https://github.com/ZxuanDang/ATD-via-Flows.git

Via

Access Paper or Ask Questions