Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dusica Marijan

Industry-Academia Research Collaboration in Software Engineering: The Certus Model

Apr 23, 2022

Dusica Marijan, Arnaud Gotlieb

Figure 1 for Industry-Academia Research Collaboration in Software Engineering: The Certus Model

Figure 2 for Industry-Academia Research Collaboration in Software Engineering: The Certus Model

Figure 3 for Industry-Academia Research Collaboration in Software Engineering: The Certus Model

Figure 4 for Industry-Academia Research Collaboration in Software Engineering: The Certus Model

Abstract:Context: Research collaborations between software engineering industry and academia can provide significant benefits to both sides, including improved innovation capacity for industry, and real-world environment for motivating and validating research ideas. However, building scalable and effective research collaborations in software engineering is known to be challenging. While such challenges can be varied and many, in this paper we focus on the challenges of achieving participative knowledge creation supported by active dialog between industry and academia and continuous commitment to joint problem solving. Objective: This paper aims to understand what are the elements of a successful industry-academia collaboration that enable the culture of participative knowledge creation. Method: We conducted participant observation collecting qualitative data spanning 8 years of collaborative research between a software engineering research group on software V&V and the Norwegian IT sector. The collected data was analyzed and synthesized into a practical collaboration model, named the Certus Model. Results: The model is structured in seven phases, describing activities from setting up research projects to the exploitation of research results. As such, the Certus model advances other collaborations models from literature by delineating different phases covering the complete life cycle of participative research knowledge creation. Conclusion: The Certus model describes the elements of a research collaboration process between researchers and practitioners in software engineering, grounded on the principles of research knowledge co-creation and continuous commitment to joint problem solving. The model can be applied and tested in other contexts where it may be adapted to the local context through experimentation.

* Information and Software Technology, Volume 132, 2021, 106473, ISSN 0950-5849

Via

Access Paper or Ask Questions

Comparative Study of Machine Learning Test Case Prioritization for Continuous Integration Testing

Apr 22, 2022

Dusica Marijan

Figure 1 for Comparative Study of Machine Learning Test Case Prioritization for Continuous Integration Testing

Figure 2 for Comparative Study of Machine Learning Test Case Prioritization for Continuous Integration Testing

Figure 3 for Comparative Study of Machine Learning Test Case Prioritization for Continuous Integration Testing

Figure 4 for Comparative Study of Machine Learning Test Case Prioritization for Continuous Integration Testing

Abstract:There is a growing body of research indicating the potential of machine learning to tackle complex software testing challenges. One such challenge pertains to continuous integration testing, which is highly time-constrained, and generates a large amount of data coming from iterative code commits and test runs. In such a setting, we can use plentiful test data for training machine learning predictors to identify test cases able to speed up the detection of regression bugs introduced during code integration. However, different machine learning models can have different fault prediction performance depending on the context and the parameters of continuous integration testing, for example variable time budget available for continuous integration cycles, or the size of test execution history used for learning to prioritize failing test cases. Existing studies on test case prioritization rarely study both of these factors, which are essential for the continuous integration practice. In this study we perform a comprehensive comparison of the fault prediction performance of machine learning approaches that have shown the best performance on test case prioritization tasks in the literature. We evaluate the accuracy of the classifiers in predicting fault-detecting tests for different values of the continuous integration time budget and with different length of test history used for training the classifiers. In evaluation, we use real-world industrial datasets from a continuous integration practice. The results show that different machine learning models have different performance for different size of test history used for model training and for different time budget available for test case execution. Our results imply that machine learning approaches for test prioritization in continuous integration testing should be carefully configured to achieve optimal performance.

Via

Access Paper or Ask Questions

Evaluating the Robustness of Deep Reinforcement Learning for Autonomous and Adversarial Policies in a Multi-agent Urban Driving Environment

Dec 22, 2021

Aizaz Sharif, Dusica Marijan

Figure 1 for Evaluating the Robustness of Deep Reinforcement Learning for Autonomous and Adversarial Policies in a Multi-agent Urban Driving Environment

Figure 2 for Evaluating the Robustness of Deep Reinforcement Learning for Autonomous and Adversarial Policies in a Multi-agent Urban Driving Environment

Figure 3 for Evaluating the Robustness of Deep Reinforcement Learning for Autonomous and Adversarial Policies in a Multi-agent Urban Driving Environment

Figure 4 for Evaluating the Robustness of Deep Reinforcement Learning for Autonomous and Adversarial Policies in a Multi-agent Urban Driving Environment

Abstract:Deep reinforcement learning is actively used for training autonomous driving agents in a vision-based urban simulated environment. Due to the large availability of various reinforcement learning algorithms, we are still unsure of which one works better while training autonomous cars in single-agent as well as multi-agent driving environments. A comparison of deep reinforcement learning in vision-based autonomous driving will open up the possibilities for training better autonomous car policies. Also, autonomous cars trained on deep reinforcement learning-based algorithms are known for being vulnerable to adversarial attacks, and we have less information on which algorithms would act as a good adversarial agent. In this work, we provide a systematic evaluation and comparative analysis of 6 deep reinforcement learning algorithms for autonomous and adversarial driving in four-way intersection scenario. Specifically, we first train autonomous cars using state-of-the-art deep reinforcement learning algorithms. Second, we test driving capabilities of the trained autonomous policies in single-agent as well as multi-agent scenarios. Lastly, we use the same deep reinforcement learning algorithms to train adversarial driving agents, in order to test the driving performance of autonomous cars and look for possible collision and offroad driving scenarios. We perform experiments by using vision-only high fidelity urban driving simulated environments.

Via

Access Paper or Ask Questions

Adversarial Deep Reinforcement Learning for Trustworthy Autonomous Driving Policies

Dec 22, 2021

Aizaz Sharif, Dusica Marijan

Figure 1 for Adversarial Deep Reinforcement Learning for Trustworthy Autonomous Driving Policies

Figure 2 for Adversarial Deep Reinforcement Learning for Trustworthy Autonomous Driving Policies

Figure 3 for Adversarial Deep Reinforcement Learning for Trustworthy Autonomous Driving Policies

Figure 4 for Adversarial Deep Reinforcement Learning for Trustworthy Autonomous Driving Policies

Abstract:Deep reinforcement learning is widely used to train autonomous cars in a simulated environment. Still, autonomous cars are well known for being vulnerable when exposed to adversarial attacks. This raises the question of whether we can train the adversary as a driving agent for finding failure scenarios in autonomous cars, and then retrain autonomous cars with new adversarial inputs to improve their robustness. In this work, we first train and compare adversarial car policy on two custom reward functions to test the driving control decision of autonomous cars in a multi-agent setting. Second, we verify that adversarial examples can be used not only for finding unwanted autonomous driving behavior, but also for helping autonomous driving cars in improving their deep reinforcement learning policies. By using a high fidelity urban driving simulation environment and vision-based driving agents, we demonstrate that the autonomous cars retrained using the adversary player noticeably increase the performance of their driving policies in terms of reducing collision and offroad steering errors.

Via

Access Paper or Ask Questions

DeepOrder: Deep Learning for Test Case Prioritization in Continuous Integration Testing

Oct 14, 2021

Aizaz Sharif, Dusica Marijan, Marius Liaaen

Abstract:Continuous integration testing is an important step in the modern software engineering life cycle. Test prioritization is a method that can improve the efficiency of continuous integration testing by selecting test cases that can detect faults in the early stage of each cycle. As continuous integration testing produces voluminous test execution data, test history is a commonly used artifact in test prioritization. However, existing test prioritization techniques for continuous integration either cannot handle large test history or are optimized for using a limited number of historical test cycles. We show that such a limitation can decrease fault detection effectiveness of prioritized test suites. This work introduces DeepOrder, a deep learning-based model that works on the basis of regression machine learning. DeepOrder ranks test cases based on the historical record of test executions from any number of previous test cycles. DeepOrder learns failed test cases based on multiple factors including the duration and execution status of test cases. We experimentally show that deep neural networks, as a simple regression model, can be efficiently used for test case prioritization in continuous integration testing. DeepOrder is evaluated with respect to time-effectiveness and fault detection effectiveness in comparison with an industry practice and the state of the art approaches. The results show that DeepOrder outperforms the industry practice and state-of-the-art test prioritization approaches in terms of these two metrics.

* 10 pages, 9 figures, conference paper

Via

Access Paper or Ask Questions

Opening the Software Engineering Toolbox for the Assessment of Trustworthy AI

Jul 14, 2020

Mohit Kumar Ahuja, Mohamed-Bachir Belaid, Pierre Bernabé, Mathieu Collet, Arnaud Gotlieb, Chhagan Lal, Dusica Marijan, Sagar Sen, Aizaz Sharif, Helge Spieker

Figure 1 for Opening the Software Engineering Toolbox for the Assessment of Trustworthy AI

Figure 2 for Opening the Software Engineering Toolbox for the Assessment of Trustworthy AI

Abstract:Trustworthiness is a central requirement for the acceptance and success of human-centered artificial intelligence (AI). To deem an AI system as trustworthy, it is crucial to assess its behaviour and characteristics against a gold standard of Trustworthy AI, consisting of guidelines, requirements, or only expectations. While AI systems are highly complex, their implementations are still based on software. The software engineering community has a long-established toolbox for the assessment of software systems, especially in the context of software testing. In this paper, we argue for the application of software engineering and testing practices for the assessment of trustworthy AI. We make the connection between the seven key requirements as defined by the European Commission's AI high-level expert group and established procedures from software engineering and raise questions for future work.

* 1st International Workshop on New Foundations for Human-Centered AI @ ECAI 2020

Via

Access Paper or Ask Questions

Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous Integration

Nov 09, 2018

Helge Spieker, Arnaud Gotlieb, Dusica Marijan, Morten Mossige

Figure 1 for Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous Integration

Figure 2 for Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous Integration

Figure 3 for Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous Integration

Figure 4 for Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous Integration

Abstract:Testing in Continuous Integration (CI) involves test case prioritization, selection, and execution at each cycle. Selecting the most promising test cases to detect bugs is hard if there are uncertainties on the impact of committed code changes or, if traceability links between code and tests are not available. This paper introduces Retecs, a new method for automatically learning test case selection and prioritization in CI with the goal to minimize the round-trip time between code commits and developer feedback on failed test cases. The Retecs method uses reinforcement learning to select and prioritize test cases according to their duration, previous last execution and failure history. In a constantly changing environment, where new test cases are created and obsolete test cases are deleted, the Retecs method learns to prioritize error-prone test cases higher under guidance of a reward function and by observing previous CI cycles. By applying Retecs on data extracted from three industrial case studies, we show for the first time that reinforcement learning enables fruitful automatic adaptive test case selection and prioritization in CI and regression testing.

* Spieker, H., Gotlieb, A., Marijan, D., & Mossige, M. (2017). Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous Integration. In Proceedings of 26th International Symposium on Software Testing and Analysis (ISSTA'17) (pp. 12--22). ACM

Via

Access Paper or Ask Questions

Stratified Constructive Disjunction and Negation in Constraint Programming

Nov 09, 2018

Arnaud Gotlieb, Dusica Marijan, Helge Spieker

Figure 1 for Stratified Constructive Disjunction and Negation in Constraint Programming

Figure 2 for Stratified Constructive Disjunction and Negation in Constraint Programming

Figure 3 for Stratified Constructive Disjunction and Negation in Constraint Programming

Abstract:Constraint Programming (CP) is a powerful declarative programming paradigm combining inference and search in order to find solutions to various type of constraint systems. Dealing with highly disjunctive constraint systems is notoriously difficult in CP. Apart from trying to solve each disjunct independently from each other, there is little hope and effort to succeed in constructing intermediate results combining the knowledge originating from several disjuncts. In this paper, we propose If Then Else (ITE), a lightweight approach for implementing stratified constructive disjunction and negation on top of an existing CP solver, namely SICStus Prolog clp(FD). Although constructive disjunction is known for more than three decades, it does not have straightforward implementations in most CP solvers. ITE is a freely available library proposing stratified and constructive reasoning for various operators, including disjunction and negation, implication and conditional. Our preliminary experimental results show that ITE is competitive with existing approaches that handle disjunctive constraint systems.

* Published in the SAT/CSP Track of the International Conference on Tools with Artificial Intelligence (ICTAI 2018)

Via

Access Paper or Ask Questions