Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zitao Liu

pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models

Jun 23, 2022

Zitao Liu, Qiongqiong Liu, Jiahao Chen, Shuyan Huang, Jiliang Tang, Weiqi Luo

Figure 1 for pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models

Figure 2 for pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models

Figure 3 for pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models

Figure 4 for pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models

Abstract:Knowledge tracing (KT) is the task of using students' historical learning interaction data to model their knowledge mastery over time so as to make predictions on their future interaction performance. Recently, remarkable progress has been made of using various deep learning techniques to solve the KT problem. However, the success behind deep learning based knowledge tracing (DLKT) approaches is still left somewhat mysterious and proper measurement and analysis of these DLKT approaches remain a challenge. First, data preprocessing procedures in existing works are often private and/or custom, which limits experimental standardization. Furthermore, existing DLKT studies often differ in terms of the evaluation protocol and are far away real-world educational contexts. To address these problems, we introduce a comprehensive python based benchmark platform, \textsc{pyKT}, to guarantee valid comparisons across DLKT methods via thorough evaluations. The \textsc{pyKT} library consists of a standardized set of integrated data preprocessing procedures on 7 popular datasets across different domains, and 10 frequently compared DLKT model implementations for transparent experiments. Results from our fine-grained and rigorous empirical KT studies yield a set of observations and suggestions for effective DLKT, e.g., wrong evaluation setting may cause label leakage that generally leads to performance inflation; and the improvement of many DLKT approaches is minimal compared to the very first DLKT model proposed by Piech et al. \cite{piech2015deep}. We have open sourced \textsc{pyKT} and our experimental results at \url{https://pykt.org/}. We welcome contributions from other research groups and practitioners.

Via

Access Paper or Ask Questions

A Knowledge-Based Decision Support System for In Vitro Fertilization Treatment

Jan 27, 2022

Xizhe Wang, Ning Zhang, Jia Wang, Jing Ni, Xinzi Sun, John Zhang, Zitao Liu, Yu Cao, Benyuan Liu

Figure 1 for A Knowledge-Based Decision Support System for In Vitro Fertilization Treatment

Figure 2 for A Knowledge-Based Decision Support System for In Vitro Fertilization Treatment

Figure 3 for A Knowledge-Based Decision Support System for In Vitro Fertilization Treatment

Figure 4 for A Knowledge-Based Decision Support System for In Vitro Fertilization Treatment

Abstract:In Vitro Fertilization (IVF) is the most widely used Assisted Reproductive Technology (ART). IVF usually involves controlled ovarian stimulation, oocyte retrieval, fertilization in the laboratory with subsequent embryo transfer. The first two steps correspond with follicular phase of females and ovulation in their menstrual cycle. Therefore, we refer to it as the treatment cycle in our paper. The treatment cycle is crucial because the stimulation medications in IVF treatment are applied directly on patients. In order to optimize the stimulation effects and lower the side effects of the stimulation medications, prompt treatment adjustments are in need. In addition, the quality and quantity of the retrieved oocytes have a significant effect on the outcome of the following procedures. To improve the IVF success rate, we propose a knowledge-based decision support system that can provide medical advice on the treatment protocol and medication adjustment for each patient visit during IVF treatment cycle. Our system is efficient in data processing and light-weighted which can be easily embedded into electronic medical record systems. Moreover, an oocyte retrieval oriented evaluation demonstrates that our system performs well in terms of accuracy of advice for the protocols and medications.

* 8 pages, 2020 IEEE International Conference on E-health Networking, Application & Services (HEALTHCOM). IEEE, 2021

Via

Access Paper or Ask Questions

CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations

Sep 01, 2021

Hang Li, Yu Kang, Tianqiao Liu, Wenbiao Ding, Zitao Liu

Figure 1 for CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations

Figure 2 for CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations

Figure 3 for CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations

Figure 4 for CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations

Abstract:Existing audio-language task-specific predictive approaches focus on building complicated late-fusion mechanisms. However, these models are facing challenges of overfitting with limited labels and low model generalization abilities. In this paper, we present a Cross-modal Transformer for Audio-and-Language, i.e., CTAL, which aims to learn the intra-modality and inter-modality connections between audio and language through two proxy tasks on a large amount of audio-and-language pairs: masked language modeling and masked cross-modal acoustic modeling. After fine-tuning our pre-trained model on multiple downstream audio-and-language tasks, we observe significant improvements across various tasks, such as, emotion classification, sentiment analysis, and speaker verification. On this basis, we further propose a specially-designed fusion mechanism that can be used in fine-tuning phase, which allows our pre-trained model to achieve better performance. Lastly, we demonstrate detailed ablation studies to prove that both our novel cross-modality fusion component and audio-language pre-training methods significantly contribute to the promising results.

* The 2021 Conference on Empirical Methods in Natural Language Processing

Via

Access Paper or Ask Questions

Temporal-aware Language Representation Learning From Crowdsourced Labels

Jul 15, 2021

Yang Hao, Xiao Zhai, Wenbiao Ding, Zitao Liu

Figure 1 for Temporal-aware Language Representation Learning From Crowdsourced Labels

Figure 2 for Temporal-aware Language Representation Learning From Crowdsourced Labels

Figure 3 for Temporal-aware Language Representation Learning From Crowdsourced Labels

Figure 4 for Temporal-aware Language Representation Learning From Crowdsourced Labels

Abstract:Learning effective language representations from crowdsourced labels is crucial for many real-world machine learning tasks. A challenging aspect of this problem is that the quality of crowdsourced labels suffer high intra- and inter-observer variability. Since the high-capacity deep neural networks can easily memorize all disagreements among crowdsourced labels, directly applying existing supervised language representation learning algorithms may yield suboptimal solutions. In this paper, we propose \emph{TACMA}, a \underline{t}emporal-\underline{a}ware language representation learning heuristic for \underline{c}rowdsourced labels with \underline{m}ultiple \underline{a}nnotators. The proposed approach (1) explicitly models the intra-observer variability with attention mechanism; (2) computes and aggregates per-sample confidence scores from multiple workers to address the inter-observer disagreements. The proposed heuristic is extremely easy to implement in around 5 lines of code. The proposed heuristic is evaluated on four synthetic and four real-world data sets. The results show that our approach outperforms a wide range of state-of-the-art baselines in terms of prediction accuracy and AUC. To encourage the reproducible results, we make our code publicly available at \url{https://github.com/CrowdsourcingMining/TACMA}.

* The 59th Annual Meeting of the Association for Computational Linguistics Workshop on Representation Learning for NLP (ACL RepL4NLP 2021)

Via

Access Paper or Ask Questions

Automatic Task Requirements Writing Evaluation via Machine Reading Comprehension

Jul 15, 2021

Shiting Xu, Guowei Xu, Peilei Jia, Wenbiao Ding, Zhongqin Wu, Zitao Liu

Figure 1 for Automatic Task Requirements Writing Evaluation via Machine Reading Comprehension

Figure 2 for Automatic Task Requirements Writing Evaluation via Machine Reading Comprehension

Figure 3 for Automatic Task Requirements Writing Evaluation via Machine Reading Comprehension

Figure 4 for Automatic Task Requirements Writing Evaluation via Machine Reading Comprehension

Abstract:Task requirements (TRs) writing is an important question type in Key English Test and Preliminary English Test. A TR writing question may include multiple requirements and a high-quality essay must respond to each requirement thoroughly and accurately. However, the limited teacher resources prevent students from getting detailed grading instantly. The majority of existing automatic essay scoring systems focus on giving a holistic score but rarely provide reasons to support it. In this paper, we proposed an end-to-end framework based on machine reading comprehension (MRC) to address this problem to some extent. The framework not only detects whether an essay responds to a requirement question, but clearly marks where the essay answers the question. Our framework consists of three modules: question normalization module, ELECTRA based MRC module and response locating module. We extensively explore state-of-the-art MRC methods. Our approach achieves 0.93 accuracy score and 0.85 F1 score on a real-world educational dataset. To encourage reproducible results, we make our code publicly available at \url{https://github.com/aied2021TRMRC/AIED_2021_TRMRC_code}.

* AIED'21: The 22nd International Conference on Artificial Intelligence in Education, 2021

Via

Access Paper or Ask Questions

A Multimodal Machine Learning Framework for Teacher Vocal Delivery Evaluation

Jul 15, 2021

Hang Li, Yu Kang, Yang Hao, Wenbiao Ding, Zhongqin Wu, Zitao Liu

Figure 1 for A Multimodal Machine Learning Framework for Teacher Vocal Delivery Evaluation

Figure 2 for A Multimodal Machine Learning Framework for Teacher Vocal Delivery Evaluation

Abstract:The quality of vocal delivery is one of the key indicators for evaluating teacher enthusiasm, which has been widely accepted to be connected to the overall course qualities. However, existing evaluation for vocal delivery is mainly conducted with manual ratings, which faces two core challenges: subjectivity and time-consuming. In this paper, we present a novel machine learning approach that utilizes pairwise comparisons and a multimodal orthogonal fusing algorithm to generate large-scale objective evaluation results of the teacher vocal delivery in terms of fluency and passion. We collect two datasets from real-world education scenarios and the experiment results demonstrate the effectiveness of our algorithm. To encourage reproducible results, we make our code public available at \url{https://github.com/tal-ai/ML4VocalDelivery.git}.

* AIED'21: The 22nd International Conference on Artificial Intelligence in Education, 2021

Via

Access Paper or Ask Questions

An Educational System for Personalized Teacher Recommendation in K-12 Online Classrooms

Jul 15, 2021

Jiahao Chen, Hang Li, Wenbiao Ding, Zitao Liu

Figure 1 for An Educational System for Personalized Teacher Recommendation in K-12 Online Classrooms

Abstract:In this paper, we propose a simple yet effective solution to build practical teacher recommender systems for online one-on-one classes. Our system consists of (1) a pseudo matching score module that provides reliable training labels; (2) a ranking model that scores every candidate teacher; (3) a novelty boosting module that gives additional opportunities to new teachers; and (4) a diversity metric that guardrails the recommended results to reduce the chance of collision. Offline experimental results show that our approach outperforms a wide range of baselines. Furthermore, we show that our approach is able to reduce the number of student-teacher matching attempts from 7.22 to 3.09 in a five-month observation on a third-party online education platform.

* AIED'21: The 22nd International Conference on Artificial Intelligence in Education, 2021

Via

Access Paper or Ask Questions

Solving ESL Sentence Completion Questions via Pre-trained Neural Language Models

Jul 15, 2021

Qiongqiong Liu, Tianqiao Liu, Jiafu Zhao, Qiang Fang, Wenbiao Ding, Zhongqin Wu, Feng Xia, Jiliang Tang, Zitao Liu

Figure 1 for Solving ESL Sentence Completion Questions via Pre-trained Neural Language Models

Figure 2 for Solving ESL Sentence Completion Questions via Pre-trained Neural Language Models

Abstract:Sentence completion (SC) questions present a sentence with one or more blanks that need to be filled in, three to five possible words or phrases as options. SC questions are widely used for students learning English as a Second Language (ESL) and building computational approaches to automatically solve such questions is beneficial to language learners. In this work, we propose a neural framework to solve SC questions in English examinations by utilizing pre-trained language models. We conduct extensive experiments on a real-world K-12 ESL SC question dataset and the results demonstrate the superiority of our model in terms of prediction accuracy. Furthermore, we run precision-recall trade-off analysis to discuss the practical issues when deploying it in real-life scenarios. To encourage reproducible results, we make our code publicly available at \url{https://github.com/AIED2021/ESL-SentenceCompletion}.

* AIED'21: The 22nd International Conference on Artificial Intelligence in Education, 2021

Via

Access Paper or Ask Questions

Multi-Task Learning based Online Dialogic Instruction Detection with Pre-trained Language Models

Jul 15, 2021

Yang Hao, Hang Li, Wenbiao Ding, Zhongqin Wu, Jiliang Tang, Rose Luckin, Zitao Liu

Figure 1 for Multi-Task Learning based Online Dialogic Instruction Detection with Pre-trained Language Models

Abstract:In this work, we study computational approaches to detect online dialogic instructions, which are widely used to help students understand learning materials, and build effective study habits. This task is rather challenging due to the widely-varying quality and pedagogical styles of dialogic instructions. To address these challenges, we utilize pre-trained language models, and propose a multi-task paradigm which enhances the ability to distinguish instances of different classes by enlarging the margin between categories via contrastive loss. Furthermore, we design a strategy to fully exploit the misclassified examples during the training stage. Extensive experiments on a real-world online educational data set demonstrate that our approach achieves superior performance compared to representative baselines. To encourage reproducible results, we make our implementation online available at \url{https://github.com/AIED2021/multitask-dialogic-instruction}.

* AIED'21: The 22nd International Conference on Artificial Intelligence in Education, 2021

Via

Access Paper or Ask Questions

Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining

Jul 15, 2021

Guowei Xu, Wenbiao Ding, Weiping Fu, Zhongqin Wu, Zitao Liu

Figure 1 for Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining

Figure 2 for Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining

Figure 3 for Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining

Figure 4 for Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining

Abstract:Many real-world applications involve the use of Optical Character Recognition (OCR) engines to transform handwritten images into transcripts on which downstream Natural Language Processing (NLP) models are applied. In this process, OCR engines may introduce errors and inputs to downstream NLP models become noisy. Despite that pre-trained models achieve state-of-the-art performance in many NLP benchmarks, we prove that they are not robust to noisy texts generated by real OCR engines. This greatly limits the application of NLP models in real-world scenarios. In order to improve model performance on noisy OCR transcripts, it is natural to train the NLP model on labelled noisy texts. However, in most cases there are only labelled clean texts. Since there is no handwritten pictures corresponding to the text, it is impossible to directly use the recognition model to obtain noisy labelled data. Human resources can be employed to copy texts and take pictures, but it is extremely expensive considering the size of data for model training. Consequently, we are interested in making NLP models intrinsically robust to OCR errors in a low resource manner. We propose a novel robust training framework which 1) employs simple but effective methods to directly simulate natural OCR noises from clean texts and 2) iteratively mines the hard examples from a large number of simulated samples for optimal performance. 3) To make our model learn noise-invariant representations, a stability loss is employed. Experiments on three real-world datasets show that the proposed framework boosts the robustness of pre-trained models by a large margin. We believe that this work can greatly promote the application of NLP models in actual scenarios, although the algorithm we use is simple and straightforward. We make our codes and three datasets publicly available\footnote{https://github.com/tal-ai/Robust-learning-MSSHEM}.

* ECML-PKDD'21: The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021

Via

Access Paper or Ask Questions