NFDI4DS | UHH-SEMS - Publication Details

Better cross company defect prediction

OPENALEX - Publications

Fayola Peters Tim Menzies Andrian Marcus

How can we find data for quality prediction? Early in the life cycle, projects may lack needed to build such predictors. Prior work assumed that relevant training was found nearest local project. But is this best approach? This paper introduces Peters filter which based on following conjecture: When scarce, more information exists other projects. Accordingly, selects via structure of To assess performance filter, compare it with two approaches prediction. Within-company learning and...

10.1109/msr.2013.6624057 article EN 2013-05-01

Balancing Privacy and Utility in Cross-Company Defect Prediction

OPENALEX - Publications

Fayola Peters Tim Menzies Liang Gong Hongyu Zhang

Background: Cross-company defect prediction (CCDP) is a field of study where an organization lacking enough local data can use from other organizations for building predictors. To support CCDP, must be shared. Such shared privatized, but that privatization could severely damage the utility data. Aim: enable effective while preserving privacy. Method: We explore algorithms maintain class boundaries in dataset. CLIFF instance pruner deletes irrelevant examples. MORPH mutator moves random...

10.1109/tse.2013.6 article EN IEEE Transactions on Software Engineering 2013-01-24

Learning from Open-Source Projects: An Empirical Study on Defect Prediction

OPENALEX - Publications

Zhimin He Fayola Peters Tim Menzies Ye Yang

The fundamental issue in cross project defect prediction is selecting the most appropriate training data for creating quality predictors. Another concern whether historical of open-source projects can be used to create predictors proprietary from a practical point-of-view. Current studies have proposed statistical approaches finding these data, however, thus far no apparent effort has been made study their success on data. Also methods apply brute force techniques which are computationally...

10.1109/esem.2013.20 article EN 2013-10-01

Text Filtering and Ranking for Security Bug Report Prediction

OPENALEX - Publications

Fayola Peters Thein Than Tun Yijun Yu Bashar Nuseibeh

Security bug reports can describe security critical vulnerabilities in software products. Bug tracking systems may contain thousands of reports, where relatively few them are related. Therefore finding unlabelled bugs among be challenging. To help engineers identify these quickly and accurately, text-based prediction models have been proposed. These often mislabel due to a number reasons such as class imbalance, the ratio non-security is very high. More critically, we observed that presence...

10.1109/tse.2017.2787653 article EN IEEE Transactions on Software Engineering 2017-12-27

LACE2: Better Privacy-Preserving Data Sharing for Cross Project Defect Prediction

OPENALEX - Publications

Fayola Peters Tim Menzies Lucas Layman

Before a community can learn general principles, it must share individual experiences. Data sharing is the fundamental step of cross project defect prediction, i.e. process using data from one to predict for defects in another. Prior work on secure allowed owners their single-party basis prediction via minimization and obfuscation. However studied method did not consider that bigger required owner more data. In this paper, we extend previous with LACE2 which reduces amount shared by...

10.1109/icse.2015.92 article EN 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering 2015-05-01

Privacy and utility for defect prediction: Experiments with MORPH

OPENALEX - Publications

Fayola Peters Tim Menzies

Ideally, we can learn lessons from software projects across multiple organizations. However, a major impediment to such knowledge sharing are the privacy concerns of development This paper aims provide defect data-set owners with an effective means privatizing their data prior release. We explore MORPH which understands how maintain class boundaries in data-set. is mutator that moves random distance, taking care not cross boundaries. The value training on this MORPHed tested via 10-way...

10.1109/icse.2012.6227194 article EN 2013 35th International Conference on Software Engineering (ICSE) 2012-06-01

LACE2: better privacy-preserving data sharing for cross project defect prediction

OPENALEX - Publications

Fayola Peters Tim Menzies Lucas Layman

Before a community can learn general principles, it must share individual experiences. Data sharing is the fundamental step of cross project defect prediction, i.e. process using data from one to predict for defects in another. Prior work on secure allowed owners their single-party basis prediction via minimization and obfuscation. However studied method did not consider that bigger required owner more data. In this paper, we extend previous with LACE2 which reduces amount shared by...

10.5555/2818754.2818851 article EN International Conference on Software Engineering 2015-05-16

Privacy and utility for defect prediction: experiments with MORPH

OPENALEX - Publications

Fayola Peters Tim Menzies

Ideally, we can learn lessons from software projects across multiple organizations. However, a major impediment to such knowledge sharing are the privacy concerns of development This paper aims provide defect data-set owners with an effective means privatizing their data prior release. We explore MORPH which understands how maintain class boundaries in data-set. is mutator that moves random distance, taking care not cross boundaries. The value training on this MORPHed tested via 10-way...

10.5555/2337223.2337246 article EN International Conference on Software Engineering 2012-06-02

Data science for software engineering

OPENALEX - Publications

Tim Menzies Ekrem Kocagüneli Fayola Peters Burak Turhan Leandro L. Minku

Target audience: Software practitioners and researchers wanting to understand the state of art in using data science for software engineering (SE). Content: In age big data, (the knowledge deriving meaningful outcomes from data) is an essential skill that should be equipped by engineers. It can used predict useful information on new projects based completed projects. This tutorial offers core insights about state-of-the-art this important field. What participants will learn: Before science:...

10.1109/icse.2013.6606752 article EN 2013 35th International Conference on Software Engineering (ICSE) 2013-05-01

The voice, the Word, the books: the sacred scripture of the Jews, Christians, and Muslims

OPENALEX - Publications

Fayola Peters

10.5860/choice.45-0817 article EN Choice Reviews Online 2007-10-01

Applications of Simulation and AI Search: Assessing the Relative Merits of Agile vs Traditional Software Development

OPENALEX - Publications

Bryan Lemon Aaron Riesbeck Tim Menzies Justin Price Joseph M. D'Alessandro and 5 more

This paper augments Boehm-Turner's model of agile and plan-based software development augmented with an AI search algorithm. The finds the key factors that predict for success or traditional developments. According to our simulations algorithm: (1) in no case did methods perform worse than approaches; (2) some cases, performed best. Hence, we recommend default practice organizations be method. simplicity this style analysis begs question: why is so much time wasted on evidence-less debates...

10.1109/ase.2009.42 article EN 2009-11-01

Dorothei Sidonii Carmen Astrologicum. Interpretationem Arabicam in Linguam Anglicam versam Una Cum Dorothei Fragmentis Et Graecis Et Latinis

OPENALEX - Publications

Fayola Peters David Pingree

10.2307/4348877 article EN The Classical World 1977-01-01

The art and science of analyzing software data; quantitative methods

OPENALEX - Publications

Tim Menzies Leandro L. Minku Fayola Peters

Using the tools of quantitative data science, software engineers that can predict useful information on new projects based past projects. This tutorial reflects state-of-the-art in reasoning this important field. discusses following: (a) when local is scarce, we show how to adapt from other organizations problems; (b) working with dubious quality, prune spurious information; (c) or models seem too complex, simplify mining results; (d) world changes, and old need be updated, handle those...

10.5555/2819009.2819229 article EN 2015-05-16

The Art and Science of Analyzing Software Data; Quantitative Methods

OPENALEX - Publications

Tim Menzies Leandro L. Minku Fayola Peters

Using the tools of quantitative data science, software engineers that can predict useful information on new projects based past projects. This tutorial reflects state-of-the-art in reasoning this important field. discusses following: (a) when local is scarce, we show how to adapt from other organizations problems; (b) working with dubious quality, prune spurious information; (c) or models seem too complex, simplify mining results; (d) world changes, and old need be updated, handle those...

10.1109/icse.2015.306 article EN 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering 2015-05-01

Diocles. On Burning Mirrors. The Arabic Translation of the Lost Greek Original

OPENALEX - Publications

Fayola Peters Diocles G. J. Toomer

10.2307/4348870 article EN The Classical World 1977-01-01

Generating Privacy Zones in Smart Cities

OPENALEX - Publications

Fayola Peters Sorren Hanvey Suresh Veluru Alie El‐Din Mady Menouer Boubekeur and 1 more

Smart cities offer a variety of services to provide citizens with efficient transport, water distribution, crime prevention, and traffic control. Such are personalized by automatically capturing, storing, processing personally identifiable data. The disclosure such data service provider raises privacy concerns for application users. As result, research has recognized the need aware in smart cities. In this paper we present PrivacyZones, awareness framework which requires share meaningful...

10.1109/isc2.2018.8656830 article EN 2022 IEEE International Smart Cities Conference (ISC2) 2018-09-01