NFDI4DS | UHH-SEMS - Publication Details

Mark Coletti

ORCID: 0000-0003-1020-531X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5068431797

Research Areas

Evolutionary Algorithms and Applications
Scientific Computing and Data Management
Metaheuristic Optimization Algorithms Research
Distributed and Parallel Computing Systems
Advanced Proteomics Techniques and Applications
Neural Networks and Applications
Genomics and Phylogenetic Studies
Reinforcement Learning in Robotics
Advanced Memory and Neural Computing
Scientific Measurement and Uncertainty Evaluation
Remote-Sensing Image Classification
Ferroelectric and Negative Capacitance Devices
Anomaly Detection Techniques and Applications
Autonomous Vehicle Technology and Safety
Advanced Neural Network Applications
African history and culture analysis
Parallel Computing and Optimization Techniques
Geographic Information Systems Studies
Rangeland Management and Livestock Ecology
Machine Learning in Bioinformatics
Advanced Data Storage Technologies
Machine Learning and Data Classification
Advanced Multi-Objective Optimization Algorithms
Domain Adaptation and Few-Shot Learning
Land Rights and Reforms

Oak Ridge National Laboratory
2016-2024

Pennsylvania State University
2017

Institute for Advanced Study
2010-2014

George Mason University
1999-2014

A dataset of recorded electricity outages by United States county 2014–2022

OPENALEX - Publications

Christa Brelsford Sarah Tennille Aaron Myers Supriya Chinthavali Varisara Tansakul and 16 more

Abstract In this Data Descriptor, we present county-level electricity outage estimates at 15-minute intervals from 2014 to 2022. By 2022 92% of customers in the 50 US States, Washington DC, and Puerto Rico are represented. These data have been produced by Environment for Analysis Geo-Located Energy Information (EAGLE-I TM ) , a geographic information system visualization platform created Oak Ridge National Laboratory map population experiencing outages every 15 minutes county level. Although...

10.1038/s41597-024-03095-5 article EN cc-by Scientific Data 2024-03-05

Assessing Impacts of Atmospheric Conditions on Efficiency and Siting of Large-Scale Direct Air Capture Facilities

OPENALEX - Publications

Xuqing Cai Mark Coletti David S. Sholl Melissa Allen‐Dumas

The cost and efficiency of direct air capture (DAC) carbon dioxide (CO2) will be decisive in determining whether this technology can play a large role decarbonization. To probe the meteorological conditions on DAC we examine, at 1 × 1° resolution for continental United States (U.S.), impacts temperature, humidity, atmospheric pressure, CO2 concentration representative amine-based adsorption process. Spatial temporal variations pressure lead to strong available ambient across U.S. specific...

10.1021/jacsau.4c00082 article EN cc-by JACS Au 2024-05-01

Library for evolutionary algorithms in Python (LEAP)

OPENALEX - Publications

Mark Coletti Eric O. Scott Jeffrey K. Bassett

There are generally three types of scientific software users: users that solve problems using existing science tools, researchers explore new approaches by extending code, and educators teach students concepts. Python is a general-purpose programming language accessible to beginners, such as students, but also has rich ecosystem facilitates writing research software. Additionally, high-performance computing (HPC) resources become more readily available, support for parallel processing...

10.1145/3377929.3398147 article EN Proceedings of the Genetic and Evolutionary Computation Conference Companion 2020-07-08

Validating Safecast data by comparisons to a U. S. Department of Energy Fukushima Prefecture aerial survey

OPENALEX - Publications

Mark Coletti Carolynne Hultquist William G. Kennedy Guido Cervone

10.1016/j.jenvrad.2017.01.005 article EN publisher-specific-oa Journal of Environmental Radioactivity 2017-02-03

Proteome-scale Deployment of Protein Structure Prediction Workflows on the Summit Supercomputer

OPENALEX - Publications

Mu Gao Mark Coletti Russell B. Davidson Ryan Prout Subil Abraham and 2 more

Deep learning has contributed to major advances in the prediction of protein structure from sequence, a fundamental problem structural bioinformatics. With predictions now approaching accuracy crystallographic experiments, and with accelerators like GPUs TPUs making inference using large models rapid, genome-level becomes an obvious aim. Leadership-class computing resources can be used perform genome-scale state-of-the-art deep models, providing wealth new data for systems biology...

10.1109/ipdpsw55747.2022.00045 article EN 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) 2022-05-01

Towards Cross-Facility Workflows Orchestration through Distributed Automation

OPENALEX - Publications

Tyler J. Skluzacek Renan Souza Mark Coletti Frédéric Suter Rafael Ferreira da Silva

10.1145/3626203.3670606 article EN 2024-07-17

Predicted structural proteome of Sphagnum divinum and proteome-scale annotation

OPENALEX - Publications

Russell B. Davidson Mark Coletti Mu Gao Bryan Piatkowski Avinash Sreedasyam and 7 more

Abstract Motivation Sphagnum-dominated peatlands store a substantial amount of terrestrial carbon. The genus is undersampled and under-studied. No experimental crystal structure from any Sphagnum species exists in the Protein Data Bank fewer than 200 Sphagnum-related genes have structural models available AlphaFold Structure Database. Tools resources are needed to help bridge these gaps, enable analysis other proteomes now made possible by accurate prediction. Results We present predicted...

10.1093/bioinformatics/btad511 article EN cc-by Bioinformatics 2023-08-01

Performance analysis and optimization for scalable deployment of deep learning models for country‐scale settlement mapping on Titan supercomputer

OPENALEX - Publications

Kuldeep Kurte Jibonananda Sanyal Andy Berres Dalton Lunga Mark Coletti and 4 more

Summary This paper presents a scalable object detection workflow for detecting objects, such as settlements, from remotely sensed (RS) imagery. We have successfully deployed this on Titan supercomputer and utilized it the task of mapping human settlement at country scale. The performance various stages in was analyzed before making operational. implemented strategies to address issues suboptimal resource utilization long‐tail effects due unbalanced image workload, data loss runtime failures,...

10.1002/cpe.5305 article EN publisher-specific-oa Concurrency and Computation Practice and Experience 2019-05-08

Troubleshooting deep-learner training data problems using an evolutionary algorithm on Summit

OPENALEX - Publications

Mark Coletti Alexander Fafard David Page

Architectural and hyperparameter design choices can influence deep-learner (DL) model fidelity but also be affected by malformed training validation data. However, practitioners may spend significant time refining layers hyperparameters before discovering that distorted data were impeding the progress. We found an evolutionary algorithm (EA) used to troubleshoot this kind of DL problem. An EA evaluated thousands configurations on Summit yielded no overall improvement in performance, which...

10.1147/jrd.2019.2960225 article EN IBM Journal of Research and Development 2019-12-17

Diagnosing autonomous vehicle driving criteria with an adversarial evolutionary algorithm

OPENALEX - Publications

Mark Coletti Shang Gao Spencer Paulissen Nicholas Haas Robert M. Patton

10.1145/3449726.3459573 article EN other-oa OSTI OAI (U.S. Department of Energy Office of Scientific and Technical Information) 2021-07-01

Neuromorphic Computing for Scientific Applications

OPENALEX - Publications

Robert M. Patton Prasanna Date Shruti Kulkarni Chathika Gunaratne Seung–Hwan Lim and 5 more

Neuromorphic computing technology continues to make strides in the development of new algorithms, devices, and materials. In addition, applications have begun emerge where neuromorphic shows promising results. However, numerous barriers further application remain. this work, we identify several science areas can either an immediate impact (within 1 3 years) or societal would be extremely high if technological addressed. We both opportunities hurdles for these areas. Finally, discuss future...

10.1109/rsdha56811.2022.00008 article EN 2022-11-01

SuperNeuro: A Fast and Scalable Simulator for Neuromorphic Computing

OPENALEX - Publications

Prasanna Date Chathika Gunaratne Shruti Kulkarni Robert M. Patton Mark Coletti and 1 more

In many neuromorphic workflows, simulators play a vital role for important tasks such as training spiking neural networks, running neuroscience simulations, and designing, implementing, testing algorithms. Currently available cater to either workflows (e.g., NEST Brian2) or deep learning BindsNET). Problematically, the neuroscience-based are slow not very scalable, learning-based do support certain functionalities that typical of workloads synaptic delay). this paper, we address gap in...

10.1145/3589737.3606000 article EN 2023-08-01

Evolving Larger Convolutional Layer Kernel Sizes for a Settlement Detection Deep-Learner on Summit

OPENALEX - Publications

Mark Coletti Dalton Lunga Jeffrey K. Bassett Amy Rose

Deep-learner hyper-parameters, such as kernel sizes, batch and learning rates, can significantly influence the quality of trained models. The state art for finding optimal hyper-parameters generally uses a brute force, grid search approach, random search, or Bayesian-based optimization among other techniques. We applied an evolutionary algorithm to optimize sizes convolutional neural network used detect settlements in satellite imagery. Usually layer are small - typically one, three, five...

10.1109/dls49591.2019.00010 article EN 2019-11-01

The relationship between evolvability and bloat

OPENALEX - Publications

Jeffrey K. Bassett Mark Coletti Kenneth De Jong

Bloat is a common problem with Evolutionary Algorithms (EAs) that use variable length representation. By creating unnecessarily large individuals it results in longer EA runtimes and solutions are difficult to interpret. The causes of bloat still uncertain, but one theory suggests occurs when the phenotype (e.g. behaviors) parents not successfully inherited by their offspring. Noting similarity evolvability theory, which measures heritability fitness, we hypothesize reproductive operators...

10.1145/1569901.1570225 article EN 2009-07-08

Ramifications of Evolving Misbehaving Convolutional Neural Network Kernel and Batch Sizes

OPENALEX - Publications

Mark Coletti Dalton Lunga Andy Berres Jibonananda Sanyal Amy Rose

Deep-learners have many hyper-parameters including learning rate, batch size, kernel size - all playing a significant role toward estimating high quality models. Discovering useful hyper-parameter guidelines is an active area of research, though the state art generally uses brute force, uniform grid approach or random search for finding ideal settings. We share preliminary results using alternative to deep learner tuning that evolutionary algorithm improve accuracy deep-learner models used...

10.1109/mlhpc.2018.8638644 article EN 2018-11-01

Global Partitioning Elevation Normalization Applied to Building Footprint Prediction

OPENALEX - Publications

Alexander Fafard Jan van Aardt Mark Coletti David Page

Understanding and exploiting topographical data via standard machine learning techniques is challenging, mainly due to the large dynamic range of values present in elevation lack direct relationships between anthropogenic phenomena topography, when considering topographic-geology couplings, for instance. Here we consider first hurdle, range, an effort apply Convolutional Neural Network (CNN) approaches prediction human activity. CNN 3-D relies on normalization approaches, which only locally...

10.1109/jstars.2020.3002502 article EN cc-by IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2020-01-01

Avoiding excess computation in asynchronous evolutionary algorithms

OPENALEX - Publications

Eric O. Scott Mark Coletti Catherine D. Schuman Bill Kay Shruti Kulkarni and 3 more

Abstract Asynchronous evolutionary algorithms are becoming increasingly popular as a means of making full use many processors while solving computationally expensive search and optimization problems. These excel at keeping large clusters fully utilized, but may sometimes inefficiently sample an excess fast‐evaluating solutions the expense higher‐quality, slow‐evaluating ones. We have previously introduced steady‐state parent selection strategy, SWEET (“Selection whilE EvaluaTing”), that...

10.1111/exsy.13100 article EN Expert Systems 2022-08-08

Efficacy of using a dynamic length representation vs. a fixed-length for neuroarchitecture search

OPENALEX - Publications

Mark Coletti Chathika Gunaratne Steven R. Young Swetha Varadarajan Robert M. Patton and 1 more

10.1145/3638530.3664149 article EN Proceedings of the Genetic and Evolutionary Computation Conference Companion 2024-07-14

Coming Soon ...