Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jie Yang

Anchor Graph Structure Fusion Hashing for Cross-Modal Similarity Search

Feb 09, 2022
Lu Wang, Jie Yang, Masoumeh Zareapoor, Zhonglong Zheng

Figure 1 for Anchor Graph Structure Fusion Hashing for Cross-Modal Similarity Search

Figure 2 for Anchor Graph Structure Fusion Hashing for Cross-Modal Similarity Search

Figure 3 for Anchor Graph Structure Fusion Hashing for Cross-Modal Similarity Search

Figure 4 for Anchor Graph Structure Fusion Hashing for Cross-Modal Similarity Search

Cross-modal hashing still has some challenges needed to address: (1) most existing CMH methods take graphs as input to model data distribution. These methods omit to consider the correlation of graph structure among multiple modalities; (2) most existing CMH methods ignores considering the fusion affinity among multi-modalities data; (3) most existing CMH methods relax the discrete constraints to solve the optimization objective, significantly degrading the retrieval performance. To solve the above limitations, we propose a novel Anchor Graph Structure Fusion Hashing (AGSFH). AGSFH constructs the anchor graph structure fusion matrix from different anchor graphs of multiple modalities with the Hadamard product, which can fully exploit the geometric property of underlying data structure. Based on the anchor graph structure fusion matrix, AGSFH attempts to directly learn an intrinsic anchor graph, where the structure of the intrinsic anchor graph is adaptively tuned so that the number of components of the intrinsic graph is exactly equal to the number of clusters. Besides, AGSFH preserves the anchor fusion affinity into the common binary Hamming space. Furthermore, a discrete optimization framework is designed to learn the unified binary codes. Extensive experimental results on three public social datasets demonstrate the superiority of AGSFH.

Via

Access Paper or Ask Questions

BREAK: Bronchi Reconstruction by gEodesic transformation And sKeleton embedding

Jan 29, 2022
Weihao Yu, Hao Zheng, Minghui Zhang, Hanxiao Zhang, Jiayuan Sun, Jie Yang

Figure 1 for BREAK: Bronchi Reconstruction by gEodesic transformation And sKeleton embedding

Figure 2 for BREAK: Bronchi Reconstruction by gEodesic transformation And sKeleton embedding

Figure 3 for BREAK: Bronchi Reconstruction by gEodesic transformation And sKeleton embedding

Figure 4 for BREAK: Bronchi Reconstruction by gEodesic transformation And sKeleton embedding

Airway segmentation is critical for virtual bronchoscopy and computer-aided pulmonary disease analysis. In recent years, convolutional neural networks (CNNs) have been widely used to delineate the bronchial tree. However, the segmentation results of the CNN-based methods usually include many discontinuous branches, which need manual repair in clinical use. A major reason for the breakages is that the appearance of the airway wall can be affected by the lung disease as well as the adjacency of the vessels, while the network tends to overfit to these special patterns in the training set. To learn robust features for these areas, we design a multi-branch framework that adopts the geodesic distance transform to capture the intensity changes between airway lumen and wall. Another reason for the breakages is the intra-class imbalance. Since the volume of the peripheral bronchi may be much smaller than the large branches in an input patch, the common segmentation loss is not sensitive to the breakages among the distal branches. Therefore, in this paper, a breakage-sensitive regularization term is designed and can be easily combined with other loss functions. Extensive experiments are conducted on publicly available datasets. Compared with state-of-the-art methods, our framework can detect more branches while maintaining competitive segmentation performance.

* Accept as IEEE ISBI 2022 oral

Via

Access Paper or Ask Questions

On the Value of ML Models

Dec 13, 2021
Fabio Casati, Pierre-André Noël, Jie Yang

We argue that, when establishing and benchmarking Machine Learning (ML) models, the research community should favour evaluation metrics that better capture the value delivered by their model in practical applications. For a specific class of use cases -- selective classification -- we show that not only can it be simple enough to do, but that it has import consequences and provides insights what to look for in a ``good'' ML model.

* Fabio Casati, Pierre-Andr\'e No\"el and Jie Yang (2021, December 14). On the Value of ML Models [Poster presentation]. Workshop on Human and Machine Decisions, NeurIPS 2021, virtual. https://sites.google.com/view/whmd2021
* Poster presentation at Workshop on Human and Machine Decisions at NeurIPS 2021 (WHMD 2021). https://sites.google.com/view/whmd2021

Via

Access Paper or Ask Questions

Using Conversational Artificial Intelligence to Support Children's Search in the Classroom

Nov 30, 2021
Garrett Allen, Jie Yang, Maria Soledad Pera, Ujwal Gadiraju

We present pathways of investigation regarding conversational user interfaces (CUIs) for children in the classroom. We highlight anticipated challenges to be addressed in order to advance knowledge on CUIs for children. Further, we discuss preliminary ideas on strategies for evaluation.

* Presented at CUI@CSCW 2021 -- https://www.conversationaluserinterfaces.org/workshops/CSCW2021/pdfs/2-Allen.pdf

Via

Access Paper or Ask Questions

The Science of Rejection: A Research Area for Human Computation

Nov 11, 2021
Burcu Sayin, Jie Yang, Andrea Passerini, Fabio Casati

Figure 1 for The Science of Rejection: A Research Area for Human Computation

We motivate why the science of learning to reject model predictions is central to ML, and why human computation has a lead role in this effort.

* To appear in the Proceedings of The 9th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2021)

Via

Access Paper or Ask Questions

OctField: Hierarchical Implicit Functions for 3D Modeling

Nov 01, 2021
Jia-Heng Tang, Weikai Chen, Jie Yang, Bo Wang, Songrun Liu, Bo Yang, Lin Gao

Figure 1 for OctField: Hierarchical Implicit Functions for 3D Modeling

Figure 2 for OctField: Hierarchical Implicit Functions for 3D Modeling

Figure 3 for OctField: Hierarchical Implicit Functions for 3D Modeling

Figure 4 for OctField: Hierarchical Implicit Functions for 3D Modeling

Recent advances in localized implicit functions have enabled neural implicit representation to be scalable to large scenes. However, the regular subdivision of 3D space employed by these approaches fails to take into account the sparsity of the surface occupancy and the varying granularities of geometric details. As a result, its memory footprint grows cubically with the input volume, leading to a prohibitive computational cost even at a moderately dense decomposition. In this work, we present a learnable hierarchical implicit representation for 3D surfaces, coded OctField, that allows high-precision encoding of intricate surfaces with low memory and computational budget. The key to our approach is an adaptive decomposition of 3D scenes that only distributes local implicit functions around the surface of interest. We achieve this goal by introducing a hierarchical octree structure to adaptively subdivide the 3D space according to the surface occupancy and the richness of part geometry. As octree is discrete and non-differentiable, we further propose a novel hierarchical network that models the subdivision of octree cells as a probabilistic process and recursively encodes and decodes both octree structure and surface geometry in a differentiable manner. We demonstrate the value of OctField for a range of shape modeling and reconstruction tasks, showing superiority over alternative approaches.

* 13 pages, 9 figures, NeurIPS 2021

Via

Access Paper or Ask Questions

C$^2$SP-Net: Joint Compression and Classification Network for Epilepsy Seizure Prediction

Oct 26, 2021
Di Wu, Yi Shi, Ziyu Wang, Jie Yang, Mohamad Sawan

Figure 1 for C$^2$SP-Net: Joint Compression and Classification Network for Epilepsy Seizure Prediction

Figure 2 for C$^2$SP-Net: Joint Compression and Classification Network for Epilepsy Seizure Prediction

Figure 3 for C$^2$SP-Net: Joint Compression and Classification Network for Epilepsy Seizure Prediction

Figure 4 for C$^2$SP-Net: Joint Compression and Classification Network for Epilepsy Seizure Prediction

Recent development in brain-machine interface technology has made seizure prediction possible. However, the communication of large volume of electrophysiological signals between sensors and processing apparatus and related computation become two major bottlenecks for seizure prediction systems due to the constrained bandwidth and limited computation resource, especially for wearable and implantable medical devices. Although compressive sensing (CS) can be adopted to compress the signals to reduce communication bandwidth requirement, it needs a complex reconstruction procedure before the signal can be used for seizure prediction. In this paper, we propose C$^2$SP-Net, to jointly solve compression, prediction, and reconstruction with a single neural network. A plug-and-play in-sensor compression matrix is constructed to reduce transmission bandwidth requirement. The compressed signal can be used for seizure prediction without additional reconstruction steps. Reconstruction of the original signal can also be carried out in high fidelity. Prediction accuracy, sensitivity, false prediction rate, and reconstruction quality of the proposed framework are evaluated under various compression ratios. The experimental results illustrate that our model outperforms the competitive state-of-the-art baselines by a large margin in prediction accuracy. In particular, our proposed method produces an average loss of 0.35 % in prediction accuracy with a compression ratio ranging from 1/2 to 1/16.

Via

Access Paper or Ask Questions

AEFE: Automatic Embedded Feature Engineering for Categorical Features

Oct 19, 2021
Zhenyuan Zhong, Jie Yang, Yacong Ma, Shoubin Dong, Jinlong Hu

Figure 1 for AEFE: Automatic Embedded Feature Engineering for Categorical Features

Figure 2 for AEFE: Automatic Embedded Feature Engineering for Categorical Features

Figure 3 for AEFE: Automatic Embedded Feature Engineering for Categorical Features

Figure 4 for AEFE: Automatic Embedded Feature Engineering for Categorical Features

The challenge of solving data mining problems in e-commerce applications such as recommendation system (RS) and click-through rate (CTR) prediction is how to make inferences by constructing combinatorial features from a large number of categorical features while preserving the interpretability of the method. In this paper, we propose Automatic Embedded Feature Engineering(AEFE), an automatic feature engineering framework for representing categorical features, which consists of various components including custom paradigm feature construction and multiple feature selection. By selecting the potential field pairs intelligently and generating a series of interpretable combinatorial features, our framework can provide a set of unseen generated features for enhancing model performance and then assist data analysts in discovering the feature importance for particular data mining tasks. Furthermore, AEFE is distributed implemented by task-parallelism, data sampling, and searching schema based on Matrix Factorization field combination, to optimize the performance and enhance the efficiency and scalability of the framework. Experiments conducted on some typical e-commerce datasets indicate that our method outperforms the classical machine learning models and state-of-the-art deep learning models.

* 24 pages, 6 figures, 13 tables

Via

Access Paper or Ask Questions

Demographic Biases of Crowd Workers in Key Opinion Leaders Finding

Oct 19, 2021
Hossein A. Rahmani, Jie Yang

Key Opinion Leaders (KOLs) are people that have a strong influence and their opinions are listened to by people when making important decisions. Crowdsourcing provides an efficient and cost-effective means to gather data for the KOL finding task. However, data collected through crowdsourcing is affected by the inherent demographic biases of crowd workers. To avoid such demographic biases, we need to measure how biased each crowd worker is. In this paper, we propose a simple yet effective approach based on demographic information of candidate KOLs and their counterfactual value. We argue that it is effectiveness because of the extra information that we can consider together with labeled data to curate a less biased dataset.

* 3 pages, CSCW 2021 Workshop - Investigating and Mitigating Biases in Crowdsourced Data

Via

Access Paper or Ask Questions

Osteoporosis Prescreening using Panoramic Radiographs through a Deep Convolutional Neural Network with Attention Mechanism

Oct 19, 2021
Heng Fan, Jiaxiang Ren, Jie Yang, Yi-Xian Qin, Haibin Ling

Figure 1 for Osteoporosis Prescreening using Panoramic Radiographs through a Deep Convolutional Neural Network with Attention Mechanism

Figure 2 for Osteoporosis Prescreening using Panoramic Radiographs through a Deep Convolutional Neural Network with Attention Mechanism

Figure 3 for Osteoporosis Prescreening using Panoramic Radiographs through a Deep Convolutional Neural Network with Attention Mechanism

Figure 4 for Osteoporosis Prescreening using Panoramic Radiographs through a Deep Convolutional Neural Network with Attention Mechanism

Objectives. The aim of this study was to investigate whether a deep convolutional neural network (CNN) with an attention module can detect osteoporosis on panoramic radiographs. Study Design. A dataset of 70 panoramic radiographs (PRs) from 70 different subjects of age between 49 to 60 was used, including 49 subjects with osteoporosis and 21 normal subjects. We utilized the leave-one-out cross-validation approach to generate 70 training and test splits. Specifically, for each split, one image was used for testing and the remaining 69 images were used for training. A deep convolutional neural network (CNN) using the Siamese architecture was implemented through a fine-tuning process to classify an PR image using patches extracted from eight representative trabecula bone areas (Figure 1). In order to automatically learn the importance of different PR patches, an attention module was integrated into the deep CNN. Three metrics, including osteoporosis accuracy (OPA), non-osteoporosis accuracy (NOPA) and overall accuracy (OA), were utilized for performance evaluation. Results. The proposed baseline CNN approach achieved the OPA, NOPA and OA scores of 0.667, 0.878 and 0.814, respectively. With the help of the attention module, the OPA, NOPA and OA scores were further improved to 0.714, 0.939 and 0.871, respectively. Conclusions. The proposed method obtained promising results using deep CNN with an attention module, which might be applied to osteoporosis prescreening.

* 9 pages

Via

Access Paper or Ask Questions