NFDI4DS | UHH-SEMS - Publication Details

Fatemeh H. Fard

ORCID: 0000-0002-4505-6257

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5029327446

Research Areas

Software Engineering Research
Topic Modeling
Natural Language Processing Techniques
Software Testing and Debugging Techniques
Software Engineering Techniques and Practices
Web Data Mining and Analysis
Advanced Malware Detection Techniques
Software System Performance and Reliability
Advanced Software Engineering Methodologies
Multi-Agent Systems and Negotiation
Scientific Computing and Data Management
Data Analysis with R
Software Reliability and Analysis Research
Service-Oriented Architecture and Web Services
Web Application Security Vulnerabilities
Data Visualization and Analytics
Open Source Software Innovations
Mobile Crowdsensing and Crowdsourcing
Geographic Information Systems Studies
Mobile and Web Applications
Complex Network Analysis Techniques
Expert finding and Q&A systems
Formal Methods in Verification
Adversarial Robustness in Machine Learning
Auction Theory and Applications

University of British Columbia
2019-2025

Kelowna General Hospital
2020-2025

Okanagan University College
2019-2024

University of Calgary
2012-2018

A fine-grained data set and analysis of tangling in bug fixing commits

OPENALEX - Publications

Steffen Herbold Alexander Trautsch Benjamin Ledel Alireza Aghamohammadi Taher Ahmed Ghaleb and 43 more

Abstract Context Tangled commits are changes to software that address multiple concerns at once. For researchers interested in bugs, tangled mean they actually study not only but also other irrelevant for the of bugs. Objective We want improve our understanding prevalence tangling and types within bug fixing commits. Methods use a crowd sourcing approach manual labeling validate which contribute fixes each line Each is labeled by four participants. If least three participants agree on same...

10.1007/s10664-021-10083-5 article EN cc-by Empirical Software Engineering 2022-07-02

Evaluating pre-trained models for user feedback analysis in software engineering: a study on classification of app-reviews

OPENALEX - Publications

M.A. Hadi Fatemeh H. Fard

10.1007/s10664-023-10314-x article EN Empirical Software Engineering 2023-05-23

On the transferability of pre-trained language models for low-resource programming languages

OPENALEX - Publications

Fuxiang Chen Fatemeh H. Fard David Lo Timofey Bryksin

A recent study by Ahmed and Devanbu reported that using a corpus of code written in multilingual datasets to fine-tune Pre-trained Language Models (PLMs) achieves higher performance as opposed just one programming language. However, no analysis was made with respect fine-tuning monolingual PLMs. Furthermore, some languages are inherently different language usually cannot be interchanged the others, i.e., Ruby Java possess very structure. To better understand how PLMs affect languages, we...

10.1145/3524610.3527917 article EN 2022-05-16

Investigating the Efficacy of Large Language Models for Code Clone Detection

OPENALEX - Publications

Mohamad Khajezade Jie JW Wu Fatemeh H. Fard Gema Rodríguez-Pérez Mohamed Shehata

Large Language Models (LLMs) have demonstrated remarkable success in various natural language processing and software engineering tasks, such as code generation. The LLMs are mainly utilized the prompt-based zero/few-shot paradigm to guide model accomplishing task. GPT-based models one of popular ones studied for tasks comment generation or test These 'generative' tasks. However, there is limited research on usage 'non-generative' classification using paradigm. In this preliminary...

10.1145/3643916.3645030 article EN 2024-04-15

Teaching Mining Software Repositories

OPENALEX - Publications

Zadia Codabux Fatemeh H. Fard Roberto Verdecchia Fabio Palomba Dario Di Nucci and 1 more

Mining Software Repositories (MSR) has become a popular research area recently. MSR analyzes different sources of data, such as version control systems, code repositories, defect tracking archived communication, deployment logs, and so on, to uncover interesting actionable insights from the data for improved software development, maintenance, evolution. This chapter provides an overview how conduct study, including setting up formulating goals questions, identifying extracting cleaning...

10.48550/arxiv.2501.01903 preprint EN arXiv (Cornell University) 2025-01-03

HumanEvalComm: Benchmarking the Communication Competence of Code Generation for LLMs and LLM Agent

OPENALEX - Publications

Jie JW Wu Fatemeh H. Fard

Large language models (LLMs) have significantly improved their ability to perform tasks in the field of code generation. However, there is still a gap between LLMs being capable coders and top-tier software engineers. The most recent trend using LLM-based agents iterate generation process. Based on observation that top-level engineers often ask clarifying questions reduce Ambiguity both requirements coding solutions, we argue same should be applied for tasks. For this purpose, define...

10.1145/3715109 article EN ACM Transactions on Software Engineering and Methodology 2025-01-27

An exploratory study on code attention in BERT

OPENALEX - Publications

Rishab Sharma Fuxiang Chen Fatemeh H. Fard David Lo

Many recent models in software engineering introduced deep neural based on the Transformer architecture or use transformer-based Pre-trained Language Models (PLM) trained code. Although these achieve state of arts results many downstream tasks such as code summarization and bug detection, they are PLM, which mainly studied Natural Processing (NLP) field. The current studies rely reasoning practices from NLP for code, despite differences between natural languages programming languages. There...

10.1145/3524610.3527921 article EN 2022-05-16

Utilization of pre-trained language models for adapter-based knowledge transfer in software engineering

OPENALEX - Publications

Iman Saberi Fatemeh H. Fard Fuxiang Chen

10.1007/s10664-024-10457-5 article EN Empirical Software Engineering 2024-06-13

API2Com: On the Improvement of Automatically Generated Code Comments Using API Documentations

OPENALEX - Publications

Ramin Shahbazi Rishab Sharma Fatemeh H. Fard

Code comments can help in program comprehension and are considered as important artifacts to developers software maintenance. However, the mostly missing or outdated, specially complex projects. As a result, several automatic comment generation models developed solution. The recent explore integration of external knowledge resources such Unified Modeling Language class diagrams improve generated comments. In this paper, we propose API2Com, model that leverages Application Programming...

10.1109/icpc52881.2021.00049 article EN 2021-05-01

On the cross-modal transfer from natural language to code through adapter modules

OPENALEX - Publications

Divyam Goel Ramansh Grover Fatemeh H. Fard

Pre-trained neural Language Models (PTLM), such as CodeBERT, are recently used in software engineering models pre-trained on large source code corpora. Their knowledge is transferred to downstream tasks (e.g. clone detection) via fine-tuning. In natural language processing (NLP), other alternatives for transferring the of PTLMs explored through using adapters, compact, parameter efficient modules inserted layers PTLM. Although adapters known facilitate adapting many compared fine-tuning...

10.1145/3524610.3527892 article EN 2022-05-16

AOBTM: Adaptive Online Biterm Topic Modeling for Version Sensitive Short-texts Analysis

OPENALEX - Publications

M.A. Hadi Fatemeh H. Fard

Analysis of mobile app reviews has shown its important role in requirement engineering, software maintenance and evolution apps. Mobile developers check their users' frequently to clarify the issues experienced by users or capture new that are introduced due a recent update. App have dynamic nature discussed topics change over time. The changes among collected for different versions an can reveal about A main technique this analysis is using topic modeling algorithms. However, short texts it...

10.1109/icsme46990.2020.00062 article EN 2020-09-01

Technical Debt in the Peer-Review Documentation of R Packages: a rOpenSci Case Study

OPENALEX - Publications

Zadia Codabux Melina Vidoni Fatemeh H. Fard

Context: Technical Debt (TD) is a metaphor used to describe code that "not quite right." Although TD studies have gained momentum, has yet be studied as thoroughly in non-Object-Oriented (OO) or scientific software such R. R multi-paradigm programming language, whose popularity data science and statistical applications amplified recent years. Due R's inherent ability expand through user-contributed packages, several community-led organizations were created organize peer-review packages...

10.1109/msr52588.2021.00032 article EN 2021-05-01

On the effectiveness of pretrained models for API learning

OPENALEX - Publications

M.A. Hadi Imam Nur Bani Yusuf Ferdian Thung Kien Gia Luong Lingxiao Jiang and 2 more

Developers frequently use APIs to implement certain functionalities, such as parsing Excel Files, reading and writing text files line by line, etc. can greatly benefit from automatic API usage sequence generation based on natural language queries for building applications in a faster cleaner manner. Existing approaches utilize information retrieval models search matching sequences given query or RNN-based encoder-decoder generate sequences. As it stands, the first approach treats names bags...

10.1145/3524610.3527886 preprint EN 2022-05-16

Self-admitted technical debt in R: detection and causes

OPENALEX - Publications

Rishab Sharma Ramin Shahbazi Fatemeh H. Fard Zadia Codabux Melina Vidoni

Abstract Self-Admitted Technical Debt (SATD) is primarily studied in Object-Oriented (OO) languages and traditionally commercial software. However, scientific software coded dynamically-typed such as R differs paradigm, the source code comments’ semantics are different (i.e., more aligned with algorithms statistics when compared to traditional software). Additionally, many Software Engineering topics understudied development, SATD detection remaining a challenge for this domain. This gap...

10.1007/s10515-022-00358-6 article EN cc-by Automated Software Engineering 2022-08-25

Gesture-driven Interactions on a Virtual Hologram in Mixed Reality

OPENALEX - Publications

Dianna Yim Garance Nicole Loison Fatemeh H. Fard Edwin SY Chan Alec McAllister and 1 more

This paper describes a framework using the Microsoft Kinect 2 and HoloLens that can assist users in analyzing complex datasets. The system allows for groups of people to view topological map as virtual hologram order them understanding In addition, gestures are built into were created with idea usability mind. By allowing user resize, rotate reposition map, it opens up much wider range data they have received. Custom also possible depending on situation, such raising or lowering water level...

10.1145/3009939.3009948 article EN 2016-11-06

Automated Detection of Algorithm Debt in Deep Learning Frameworks: An Empirical Study

OPENALEX - Publications

Emmanuel Iko-Ojo Simon Chirath Hettiarachchi Alex Potanin Hanna Suominen Fatemeh H. Fard

Context: Previous studies demonstrate that Machine or Deep Learning (ML/DL) models can detect Technical Debt from source code comments called Self-Admitted (SATD). Despite the importance of ML/DL in software development, limited focus on automated detection for new SATD types: Algorithm (AD). AD is important because it helps to identify TD early, facilitating research, learning, and preventing accumulation issues related model degradation lack scalability. Aim: Our goal improve performance...

10.48550/arxiv.2408.10529 preprint EN arXiv (Cornell University) 2024-08-20

A method for detecting agents that will not cause emergent behavior in agent based systems - A case study in agent based auction systems -

OPENALEX - Publications

Fatemeh H. Fard Behrouz H. Far

Modeling and implementing auction systems using agent technology is a common practice because agents can assume various roles their behavior will be determined as result of negotiation. However, emergent hurdle. Mechanisms must in place to make sure that participating the won't behave an unintended way. Detecting behaviors design phase rather than deployment more cost effort efficient. Patterns interaction, called scenarios, are basic modeling constructs for behavioral agents. working with...

10.1109/iri.2012.6303009 article EN 2012-08-01

Model-Agnostic Syntactical Information for Pre-Trained Programming Language Models

OPENALEX - Publications

Iman Saberi Fatemeh H. Fard

Pre-trained Programming Language Models (PPLMs) achieved many recent states of the art results for code-related software engineering tasks. Though some studies use data flow or propose tree-based models that utilize Abstract Syntax Tree (AST), most PPLMs do not fully rich syntactical information in source code. Still, input is considered a sequence tokens. There are two issues; first computational inefficiency due to quadratic relationship between length and attention complexity. Second, any...

10.1109/msr59073.2023.00036 article EN 2023-05-01

Detecting and fixing emergent behaviors in Distributed Software Systems using a message content independent method

OPENALEX - Publications

Fatemeh H. Fard

This research is intended to automatically detect emergent behaviors of scenario based Distributed Software Systems (DSS) in design phase. The direct significance our work reducing the cost verifying DSS for unexpected behavior execution time. Existing approaches have some drawbacks which we try cover work. main contributions are modeling components as a social network and not using behavioral modeling, detecting with no behavior, investigating interactions instances one type.

10.1109/ase.2013.6693148 article EN 2021 36th IEEE/ACM International Conference on Automated Software Engineering (ASE) 2013-11-01

Detection and verification of a new type of emergent behavior in multiagent systems

OPENALEX - Publications

Fatemeh H. Fard Behrouz H. Far

The verification of Distributed Software Systems (DSS) and Multi agent systems (MAS) has taken a special attention due to the growing demand having DSS in this decade. MAS are class software which functionality or control is distributed. This may cause components (agents) emerge an unexpected behavior their runtime, was not seen requirement design. known as emergent components. cost detecting fixing such problem much more valuable compared fix them after deployment. Therefore, paper new type...

10.1109/ines.2013.6632796 article EN 2013-06-01

A Semantic-Based Framework for Analyzing App Users' Feedback

OPENALEX - Publications

Aman Yadav Rishab Sharma Fatemeh H. Fard

The competitive market of mobile apps requires app developers to consider the users' feedback frequently. This feedback, when comes from different resources, e.g. App Stores and Twitter, will provide a broader picture state app, as users discuss topics on each platform. Automated tools are developed filter informative comments for developers. However, integrate feedbacks platforms, one should evaluate similarities and/or differences text one. Different meaning words in various context, makes...

10.1109/saner48275.2020.9054843 article EN 2020-02-01

A Toolkit for Building Collaborative Immersive Multi-Surface Applications

OPENALEX - Publications

Cooper Davies J. White Alec McAllister Adam Saroka Omar Addam and 2 more

The paper describes a toolkit that integrates spatially-aware multi-surface systems with mixed-reality approaches to create immersive collaborative environments. multiple digital displays and Microsoft HoloLens devices Kinects. HoloLens' allow several users look at the same virtual hologram while Kinects enable them use body movements interact these holograms as well other surfaces in space. Effectively, enables its build applications utilize space between information. Our approach also...

10.1145/2992154.2996879 article EN 2016-11-06

Evaluating Pre-Trained Models for User Feedback Analysis in Software Engineering: A Study on Classification of App-Reviews

OPENALEX - Publications

M.A. Hadi Fatemeh H. Fard

Context: Mobile app reviews written by users on stores or social media are significant resources for developers.Analyzing have proved to be useful many areas of software engineering (e.g., requirement engineering, testing). Automatic classification requires extensive efforts manually curate a labeled dataset. When the purpose changes (e.g. identifying bugs versus usability issues sentiment), new datasets should labeled, which prevents extensibility developed models desired classes/tasks in...

10.48550/arxiv.2104.05861 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Evaluating Code Comment Generation With Summarized API Docs

OPENALEX - Publications

Bilel Matmti Fatemeh H. Fard

Code comment generation is the task of generating a high-level natural language description for given code snippet. API2Com model designed to leverage Application Programming Interface Documentations (API Docs) as an external knowledge resource. Shahbazi et al. [1] showed that API Docs might help increase model's performance. However, performance in pertinent comments deteriorates due lengthy documentation used input number APIs method increases. In this paper, we propose evaluate how...

10.1109/nlbse59153.2023.00019 article EN 2023-05-01

Detecting emergent behavior in autonomous distributed systems with many components of the same type

OPENALEX - Publications

Fatemeh H. Fard Behrouz H. Far

In design of distributed systems with specification languages such as message sequence charts (MSC), communication between different component (agent) types or instances them are defined. There a number methods to verify the using scenarios inter-component communication. Those usually ignore intra-component communication, i.e. components same type. However in large scale systems, e-commerce there several one type that may communicate each other and this violate some regulatory policies...

10.1109/icsmc.2012.6378019 article EN 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2012-10-01

Coming Soon ...