NFDI4DS | UHH-SEMS - Publication Details

The Oracle Problem in Software Testing: A Survey

OPENALEX - Publications

Earl T. Barr Mark Harman Phil McMinn Muzammil Shahbaz Shin Yoo

Testing involves examining the behaviour of a system in order to discover potential faults. Given an input for system, challenge distinguishing corresponding desired, correct from potentially incorrect behavior is called "test oracle problem". Test automation important remove current bottleneck that inhibits greater overall test automation. Without automation, human has determine whether observed correct. The literature on oracles introduced techniques including modelling, specifications,...

10.1109/tse.2014.2372785 article EN cc-by IEEE Transactions on Software Engineering 2014-11-20

On the naturalness of software

OPENALEX - Publications

Abram Hindle Earl T. Barr Zhendong Su Mark Gabel Prémkumar Dévanbu

Natural languages like English are rich, complex, and powerful. The highly creative graceful use of Tamil, by masters Shakespeare Avvaiyar, can certainly delight inspire. But in practice, given cognitive constraints the exigencies daily life, most human utterances far simpler much more repetitive predictable. In fact, these be very usefully modeled using modern statistical methods. This fact has led to phenomenal success approaches speech recognition, natural language translation,...

10.5555/2337223.2337322 article EN International Conference on Software Engineering 2012-06-02

Suggesting accurate method and class names

OPENALEX - Publications

Miltiadis Allamanis Earl T. Barr Christian Bird Charles Sutton

Descriptive names are a vital part of readable, and hence maintainable, code. Recent progress on automatically suggesting for local variables tantalizes with the prospect replicating that success method class names. However, methods classes is much more difficult. This because good need to be functionally descriptive, but such requires model goes beyond context. We introduce neural probabilistic language source code specifically designed naming problem. Our learns which semantically similar...

10.1145/2786805.2786849 article EN 2015-08-26

On the naturalness of software

OPENALEX - Publications

Abram Hindle Earl T. Barr Zhendong Su Mark Gabel Prémkumar Dévanbu

Natural languages like English are rich, complex, and powerful. The highly creative graceful use of Tamil, by masters Shakespeare Avvaiyar, can certainly delight inspire. But in practice, given cognitive constraints the exigencies daily life, most human utterances far simpler much more repetitive predictable. In fact, these be very usefully modeled using modern statistical methods. This fact has led to phenomenal success approaches speech recognition, natural language translation,...

10.1109/icse.2012.6227135 article EN 2013 35th International Conference on Software Engineering (ICSE) 2012-06-01

Learning natural coding conventions

OPENALEX - Publications

Miltiadis Allamanis Earl T. Barr Christian Bird Charles Sutton

Every programmer has a characteristic style, ranging from preferences about identifier naming to object relationships and design patterns. Coding conventions define consistent syntactic fostering readability hence maintainability. When collaborating, programmers strive obey project's coding conventions. However, one third of reviews changes contain feedback conventions, indicating that do not always follow them project members care deeply adherence. Unfortunately, are often unaware because...

10.1145/2635868.2635883 preprint EN 2014-11-04

The promises and perils of mining git

OPENALEX - Publications

Christian Bird Peter C. Rigby Earl T. Barr David J. Hamilton Daniel M. Germán and 1 more

We are now witnessing the rapid growth of decentralized source code management (DSCM) systems, in which every developer has her own repository. DSCMs facilitate a style collaboration work output can flow sideways (and privately) between collaborators, rather than always up and down publicly) via central Decentralization comes with both promise new data peril its misinterpretation. focus on git, very popular DSCM used high-profile projects. Decentralization, other features such as...

10.1109/msr.2009.5069475 article EN 2009-05-01

Is the cure worse than the disease? overfitting in automated program repair

OPENALEX - Publications

Edward K. Smith Earl T. Barr Claire Le Goues Yuriy Brun

Automated program repair has shown promise for reducing the significant manual effort debugging requires. This paper addresses a deficit of earlier evaluations automated techniques caused by repairing programs and evaluating generated patches' correctness using same set tests. Since tests are an imperfect metric correctness, this type do not discriminate between correct patches that overfit available break untested but desired functionality. evaluates two well-studied tools, GenProg...

10.1145/2786805.2786825 article EN 2015-08-26

On the naturalness of software

OPENALEX - Publications

Abram Hindle Earl T. Barr Mark Gabel Zhendong Su Prémkumar Dévanbu

Natural languages like English are rich, complex, and powerful. The highly creative graceful use of Tamil, by masters Shakespeare Avvaiyar, can certainly delight inspire. But in practice, given cognitive constraints the exigencies daily life, most human utterances far simpler much more repetitive predictable. In fact, these be very usefully modeled using modern statistical methods. This fact has led to phenomenal success approaches speech recognition, natural language translation,...

10.1145/2902362 article EN Communications of the ACM 2016-04-26

The plastic surgery hypothesis

OPENALEX - Publications

Earl T. Barr Yuriy Brun Prémkumar Dévanbu Mark Harman Federica Sarro

Recent work on genetic-programming-based approaches to automatic program patching have relied the insight that content of new code can often be assembled out fragments already exist in base. This has been dubbed plastic surgery hypothesis; successful, well-known repair tools such as GenProg rest this hypothesis, but it never validated. We formalize and validate hypothesis empirically measure extent which raw material for changes actually exists projects. In paper, we mount a large-scale...

10.1145/2635868.2635898 article EN 2014-11-04

Deep learning type inference

OPENALEX - Publications

Vincent J. Hellendoorn Christian Bird Earl T. Barr Miltiadis Allamanis

Dynamically typed languages such as JavaScript and Python are increasingly popular, yet static typing has not been totally eclipsed: now supports type annotations like TypeScript offer a middle-ground for JavaScript: strict superset of JavaScript, to which it transpiles, coupled with system that permits partially programs. However, cost: adding annotations, reading the added syntax, wrestling fix errors. Type inference can ease transition more statically code unlock benefits richer...

10.1145/3236024.3236051 article EN 2018-10-26

Automatic Semantic Augmentation of Language Model Prompts (for Code Summarization)

OPENALEX - Publications

Toufique Ahmed Kunal Suresh Pai Prémkumar Dévanbu Earl T. Barr

Large Language Models (LLM) are a new class of computation engines, "programmed" via prompt engineering. Researchers still learning how to best "program" these LLMs help developers. We start with the intuition that developers tend consciously and unconsciously collect semantics facts, from code, while working. Mostly shallow, simple facts arising quick read. For function, such might include parameter local variable names, return expressions, pre- post-conditions, basic control data flow, etc.

10.1145/3597503.3639183 article EN cc-by 2024-04-12

Automated software transplantation

OPENALEX - Publications

Earl T. Barr Mark Harman Yue Jia Alexandru Marginean Justyna Petke

Automated transplantation would open many exciting avenues for software development: suppose we could autotransplant code from one system into another, entirely unrelated, system. This paper introduces a theory, an algorithm, and tool that achieve this. Leveraging lightweight annotation, program analysis identifies organ (interesting behavior to transplant); testing validates the exhibits desired during its extraction after implantation host. While do not claim automated is now solved...

10.1145/2771783.2771796 article EN 2015-07-10

Comparing static bug finders and statistical prediction

OPENALEX - Publications

Foyzur Rahman Sameer S. Khatri Earl T. Barr Prémkumar Dévanbu

The all-important goal of delivering better software at lower cost has led to a vital, enduring quest for ways find and remove defects efficiently accurately. To this end, two parallel lines research have emerged over the last years. Static analysis seeks using algorithms that process well-defined semantic abstractions code. Statistical defect prediction uses historical data estimate parameters statistical formulae modeling phenomena thought govern occurrence predict where are likely occur....

10.1145/2568225.2568269 article EN Proceedings of the 44th International Conference on Software Engineering 2014-05-20

Typilus: neural type hints

OPENALEX - Publications

Miltiadis Allamanis Earl T. Barr Soline Ducousso Zheng Gao

Type inference over partial contexts in dynamically typed languages is challenging. In this work, we present a graph neural network model that predicts types by probabilistically reasoning program's structure, names, and patterns. The uses deep similarity learning to learn TypeSpace — continuous relaxation of the discrete space how embed type properties symbol (i.e. identifier) into it. Importantly, our can employ one-shot predict an open vocabulary types, including rare user-defined ones....

10.1145/3385412.3385997 preprint EN 2020-06-07

Today Was a Good Day: The Daily Life of Software Developers

OPENALEX - Publications

André N. Meyer Earl T. Barr Christian Bird Thomas Zimmermann

What is a good workday for software developer? typical workday? We seek to answer these two questions learn how make days typical. Concretely, answering will help optimize development processes and select tools that increase job satisfaction productivity. Our work adds large body of research on developers spend their time. report the results from 5,971 responses professional at Microsoft, who reflected about what made workdays typical, self-reported they spent time various activities work....

10.1109/tse.2019.2904957 article EN IEEE Transactions on Software Engineering 2019-03-13

ConceptDoppler

OPENALEX - Publications

Jedidiah R. Crandall Daniel Zinn Michael Byrd Earl T. Barr Rich East

Article Share on ConceptDoppler: a weather tracker for internet censorshipCCS '07: Proceedings of the 14th ACM conference Computer and communications securityOctober 2007 Pages 352–365https://doi.org/10.1145/1315245.1315290Online:28 October 2007Publication History 42citation1,097DownloadsMetricsTotal Citations42Total Downloads1,097Last 12 Months115Last 6 weeks7 Get Citation AlertsNew Alert added!This alert has been successfully added will be sent to:You notified whenever record that you have...

10.1145/1315245.1315290 article EN 2007-10-28

BugCache for inspections

OPENALEX - Publications

Foyzur Rahman Daryl Posnett Abram Hindle Earl T. Barr Prémkumar Dévanbu

Inspection is a highly effective but costly technique for quality control. Most companies do not have the resources to inspect all code; thus accurate defect prediction can help focus available inspection resources. BugCache simple, elegant, award-winning scheme that "caches" files are likely contain defects [12]. In this paper, we evaluate utility of as tool focusing inspection, examine assumptions underlying with aim improving it, and finally compare it standard bug-prediction technique....

10.1145/2025113.2025157 article EN 2011-09-06

Automatic detection of floating-point exceptions

OPENALEX - Publications

Earl T. Barr Thanh Vo Vu Le Zhendong Su

It is well-known that floating-point exceptions can be disastrous and writing exception-free numerical programs very difficult. Thus, it important to automatically detect such errors. In this paper, we present Ariadne, a practical symbolic execution system specifically designed implemented for detecting exceptions. Ariadne systematically transforms program explicitly check each exception triggering condition. symbolically executes the transformed using real arithmetic find candidate...

10.1145/2429069.2429133 article EN 2013-01-22

Uncertainty, risk, and information value in software requirements and architecture

OPENALEX - Publications

Emmanuel Letier David Stefan Earl T. Barr

Uncertainty complicates early requirements and architecture decisions may expose a software project to significant risk. Yet architects lack support for evaluating uncertainty, its impact on risk, the value of reducing uncertainty before making critical decisions. We propose apply decision analysis multi-objective optimisation techniques provide such support. present systematic method allowing describe about alternatives stakeholders' goals; calculate consequences through Monte-Carlo...

10.1145/2568225.2568239 article EN Proceedings of the 44th International Conference on Software Engineering 2014-05-20

To Type or Not to Type: Quantifying Detectable Bugs in JavaScript

OPENALEX - Publications

Zheng Gao Christian Bird Earl T. Barr

JavaScript is growing explosively and now used in large mature projects even outside the web domain. also a dynamically typed language for which static type systems, notably Facebook's Flow Microsoft's TypeScript, have been written. What benefits do these systems provide? Leveraging project histories, we select fixed bug check out code just prior to fix. We manually add annotations buggy test whether TypeScript report an error on code, thereby possibly prompting developer fix before its...

10.1109/icse.2017.75 article EN 2017-05-01

Learning Python Code Suggestion with a Sparse Pointer Network

OPENALEX - Publications

Avishkar Bhoopchand Tim Rocktäschel Earl T. Barr Sebastian Riedel

To enhance developer productivity, all modern integrated development environments (IDEs) include code suggestion functionality that proposes likely next tokens at the cursor. While current IDEs work well for statically-typed languages, their reliance on type annotations means they do not provide same level of support dynamic programming languages as languages. Moreover, engines in propose expressions or multi-statement idiomatic code. Recent has shown language models can improve systems by...

10.48550/arxiv.1611.08307 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Has the bug really been fixed?

OPENALEX - Publications

Zhongxian Gu Earl T. Barr David J. Hamilton Zhendong Su

Software has bugs, and fixing those bugs pervades the software engineering process. It is folklore that bug fixes are often buggy themselves, resulting in bad fixes, either failing to fix a or creating new bugs. To confirm this folklore, we explored databases of Ant, AspectJ, Rhino projects, found comprise as much 9% all Thus, detecting correcting important for improving quality reliability software. However, no prior work systematically considered problem, which paper introduces formalizes....

10.1145/1806799.1806812 article EN 2010-05-01