NFDI4DS | UHH-SEMS - Publication Details

Benjamin W. Priest

ORCID: 0000-0003-3806-7369

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5068202730

Research Areas

Gaussian Processes and Bayesian Inference
Complex Network Analysis Techniques
Graph Theory and Algorithms
Advanced Graph Neural Networks
Machine Learning and Data Classification
Caching and Content Delivery
Gamma-ray bursts and supernovae
Advanced Multi-Objective Optimization Algorithms
Complexity and Algorithms in Graphs
Computational Physics and Python Applications
Reinforcement Learning in Robotics
Advanced Vision and Imaging
Data Mining Algorithms and Applications
Robotics and Sensor-Based Localization
Network Security and Intrusion Detection
Information and Cyber Security
Air Quality Monitoring and Forecasting
Algorithms and Data Compression
Opinion Dynamics and Social Influence
Astronomy and Astrophysical Research
Parallel Computing and Optimization Techniques
Data Management and Algorithms
American Jewish Fiction Analysis
Doctoral Education Challenges and Solutions
Neural Networks and Reservoir Computing

Lawrence Livermore National Laboratory
2019-2024

Queen Mary University of London
2021

Dartmouth College
2014-2019

Birkbeck, University of London
2016

Approximate Tree Completion and Learning-Augmented Algorithms for Metric Minimum Spanning Trees

OPENALEX - Publications

Nate Veldt Thomas Stanley Benjamin W. Priest Trevor Steil Keita Iwabuchi and 2 more

Finding a minimum spanning tree (MST) for $n$ points in an arbitrary metric space is fundamental primitive hierarchical clustering and many other ML tasks, but this takes $\Omega(n^2)$ time to even approximate. We introduce framework MSTs that first (1) finds forest of disconnected components using practical heuristics, then (2) small weight set edges connect disjoint the into tree. prove optimally solving second step still time, we provide subquadratic 2.62-approximation algorithm. In...

10.48550/arxiv.2502.12993 preprint EN arXiv (Cornell University) 2025-02-18

A Scalable Gaussian Process Approach to Shear Mapping with MuyGPs

OPENALEX - Publications

Gregory Sallaberry Benjamin W. Priest R. Armstrong M. Schneider Amanda Muyskens and 2 more

Abstract Analysis of cosmic shear is an integral part understanding structure growth across time, which in turn provides us with information about the nature dark energy. Conventional methods generate maps from we can infer matter distribution universe. Current (e.g., Kaiser–Squires inversion) for generating these maps, however, are tricky to implement and introduce bias. Recent alternatives construct a spatial process prior lensing potential, allows inference convergence parameters given...

10.3847/1538-4357/adb0b7 article EN cc-by The Astrophysical Journal 2025-03-06

Enhancing postgraduate researcher wellbeing through support groups

OPENALEX - Publications

Fryni Panayidou Benjamin W. Priest

Purpose This paper aims to examine the effectiveness of PhD support groups as an intervention that improves mental well-being and increases confidence in timely completion. Design/methodology/approach Participants six groups, which we co-facilitated, completed a survey at start end eight weeks attendance. The measured subjective completion using Warwick-Edinburgh Mental Well-being Scale statements from Postgraduate Research Experience Survey (2017 2019). final also included open-ended...

10.1108/sgpe-06-2020-0038 article EN Studies in Graduate and Postdoctoral Education 2021-02-09

Star–Galaxy Image Separation with Computationally Efficient Gaussian Process Classification

OPENALEX - Publications

Amanda Muyskens Imene Goumiri Benjamin W. Priest M. Schneider R. Armstrong and 2 more

Abstract We introduce a novel method for discerning optical telescope images of stars from those galaxies using Gaussian processes (GPs). Although applications GPs often struggle in high-dimensional data modalities such as image classification, we show that low-dimensional embedding into metric space defined by the principal components suffices to produce high-quality predictions real large-scale survey data. develop GP classification hyperparameter training scales approximately linearly...

10.3847/1538-3881/ac4e93 article EN cc-by The Astronomical Journal 2022-03-04

Gaussian Process Classification for Galaxy Blend Identification in LSST

OPENALEX - Publications

J. Buchanan M. Schneider R. Armstrong Amanda Muyskens Benjamin W. Priest and 1 more

Abstract A significant fraction of observed galaxies in the Rubin Observatory Legacy Survey Space and Time (LSST) will overlap at least one other galaxy along same line sight, a so-called “blend.” The current standard method assessing blend likelihood LSST images relies on counting up number intensity peaks smoothed image candidate, but reliability this procedure has not yet been comprehensively studied. Here we construct realistic distribution blended unblended through high-fidelity...

10.3847/1538-4357/ac35ca article EN cc-by The Astrophysical Journal 2022-01-01

You've Got Mail (YGM): Building Missing Asynchronous Communication Primitives

OPENALEX - Publications

Benjamin W. Priest Trevor Steil Geoffrey Sanders Roger Pearce

The Message Passing Interface (MPI) is the de facto standard for message handling in distributed computing. MPI collective communication schemes where many processors communicate with one another depend upon synchronous handshake agreements. This results applications depending iterative communications moving at speed of their slowest processors. We describe a methodology bootstrapping asynchronous primitives to MPI, an emphasis on irregular and imbalanced all-to-all patterns found data...

10.1109/ipdpsw.2019.00045 article EN 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) 2019-05-01

One Quadrillion Triangles Queried on One Million Processors

OPENALEX - Publications

Roger Pearce Trevor Steil Benjamin W. Priest Geoffrey Sanders

We update our prior 2017 Graph Challenge submission [7] on large scale triangle counting in distributed memory by demonstrating scaling and validation trillion-edge scale-free graphs. incorporate recent communication optimizations developed for irregular workloads [1], demonstrate up to 1.5 million cores of IBM BG/Q Sequoia at LLNL. validate implementation using nonstochastic Kronecker graph generation where ground-truth local global counts are known, model inputs after the Graph500 [5]...

10.1109/hpec.2019.8916243 article EN 2019-09-01

TriPoll

OPENALEX - Publications

Trevor Steil Tahsin Reza Keita Iwabuchi Benjamin W. Priest Geoffrey Sanders and 1 more

Understanding the higher-order interactions within network data is a key objective of science. Surveys metadata triangles (or patterned 3-cycles in metadata-enriched graphs) are often interest this pursuit. In work, we develop TriPoll, prototype distributed HPC system capable surveying massive graphs containing on their edges and vertices. We contrast our approach with much prior effort triangle analysis, which focuses simple counting, usually no metadata. assess scalability TriPoll when...

10.1145/3458817.3476200 article EN 2021-10-21

Quantum Machine Learning using Gaussian Processes with Performant Quantum Kernels

OPENALEX - Publications

Matthew Otten Imene Goumiri Benjamin W. Priest George Chapline M. Schneider

Quantum computers have the opportunity to be transformative for a variety of computational tasks. Recently, there been proposals use unsimulatably large quantum devices perform regression, classification, and other machine learning tasks with advantage by using kernel methods. While is necessary condition in learning, it not sufficient, as all kernels are equally effective. Here, we study one- multi-dimensional well reinforcement Gaussian Processes. By approximations performant classical...

10.48550/arxiv.2004.11280 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Reinforcement Learning via Gaussian Processes with Neural Network Dual Kernels

OPENALEX - Publications

Imene Goumiri Benjamin W. Priest M. Schneider

While deep neural networks (DNNs) and Gaussian Processes (GPs) are both popularly utilized to solve problems in reinforcement learning, approaches feature undesirable drawbacks for challenging problems. DNNs learn complex non-linear embeddings, but do not naturally quantify uncertainty often data-inefficient train. GPs infer posterior distributions over functions, popular kernels exhibit limited expressivity on high-dimensional data. Fortunately, recently discovered conjugate tangent kernel...

10.1109/cog47356.2020.9231744 article EN 2021 IEEE Conference on Games (CoG) 2020-08-01

Estimating Edge-Local Triangle Count Heavy Hitters in Edge-Linear Time and Almost-Vertex-Linear Space

OPENALEX - Publications

Benjamin W. Priest Roger Pearce Geoffrey Sanders

We describe a methodology for estimating edge-local triangle counts using cardinality approximation sketches. While the approach does not guarantee relative error bounds, we will show that it preserves count heavy hitters - edges incident upon largest number of triangles well in practice. Furthermore, provide empirical evidence sum estimations yield reasonable estimates global free. In this paper two-pass algorithm hitters. The requires time linear edges, memory almost vertices, and is easy...

10.1109/hpec.2018.8547721 article EN 2018-09-01

MuyGPs: Scalable Gaussian Process Hyperparameter Estimation Using Local Cross-Validation

OPENALEX - Publications

Amanda Muyskens Benjamin W. Priest Imene Goumiri M. Schneider

Gaussian processes (GPs) are non-linear probabilistic models popular in many applications. However, naïve GP realizations require quadratic memory to store the covariance matrix and cubic computation perform inference or evaluate likelihood function. These bottlenecks have driven much investment development of approximate alternatives that scale large data sizes common modern data-driven We present this manuscript MuyGPs, a novel efficient hyperparameter estimation method. MuyGPs builds upon...

10.48550/arxiv.2104.14581 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Distributed Kronecker Graph Generation with Ground Truth of Many Graph Properties

OPENALEX - Publications

Trevor Steil Benjamin W. Priest Geoffrey Sanders Roger Pearce Timothy La Fond and 1 more

Computing various global and local topological graph features is an important facet of data analysis. To do so robustly scalably requires efficient algorithms that either calculate exactly or approximate accurately. For this reason researchers developing distributed analytic desire generated benchmarks share the challenging characteristics real-world graphs (small-world, scale-free, heavy-tailed degree distribution) with efficiently calculated ground truth to desired ouput. Given two small...

10.1109/ipdpsw.2019.00048 article EN 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) 2019-05-01

A Quantitative Framework to Model Advanced Persistent Threats

OPENALEX - Publications

Luan Huy Pham Massimiliano Albanese Benjamin W. Priest

10.5220/0006872604480459 article EN cc-by-nc-nd Proceedings of the 15th International Joint Conference on e-Business and Telecommunications 2018-01-01

The Catcher in the Ryeand the ill member of the group: Holden Caulfield and adolescent development

OPENALEX - Publications

Benjamin W. Priest

This paper is an attempt to explore J. D. Salinger’s The Catcher in the Rye (1951/1958) relation Winnicott’s theories of adolescent development, also with regard psychodynamic symbolism, mourning, defence mechanisms and containment. I consider significance novel’s protagonist narrator Holden Caulfield. What reason for enduring popularity his voice its influence on tone subsequent literature? To answer this question, examine role iconic, troubled character may play development reader.

10.1080/14753634.2016.1198273 article EN Psychodynamic Practice 2016-07-02

Stellar Blend Image Classification Using Computationally Efficient Gaussian Processes

OPENALEX - Publications

Chinedu Eleh Yunli Zhang Rafael Bidese Benjamin W. Priest Amanda Muyskens and 2 more

Stellar blends, where two or more stars appear blended in an image, pose a significant visualization challenge astronomy. Traditionally, distinguishing these blends from single has been costly and resource-intensive, involving sophisticated equipment extensive expert analysis. This is especially problematic for analyzing the vast data volumes surveys, such as Legacy Survey of Space Time (LSST), Sloan Digital Sky (SDSS), Dark Energy Spectroscopic Instrument (DESI), Imaging Zwicky Transient...

10.48550/arxiv.2407.19297 preprint EN arXiv (Cornell University) 2024-07-27

Correspondence of NNGP Kernel and the Matern Kernel

OPENALEX - Publications

Amanda Muyskens Benjamin W. Priest Imene Goumiri M. Schneider

Kernels representing limiting cases of neural network architectures have recently gained popularity. However, the application and performance these new kernels compared to existing options, such as Matern kernel, is not well studied. We take a practical approach explore Gaussian process (NNGP) kernel its data in regression. first demonstrate necessity normalization produce valid NNGP related numerical challenges. further that predictions from this model are quite inflexible, therefore do...

10.48550/arxiv.2410.08311 preprint EN arXiv (Cornell University) 2024-10-10

Identifiability and Sensitivity Analysis of Kriging Weights for the Matern Kernel

OPENALEX - Publications

Amanda Muyskens Benjamin W. Priest Imene Goumiri M. Schneider

Gaussian process (GP) models are effective non-linear for numerous scientific applications. However, computation of their hyperparameters can be difficult when there is a large number training observations (n) due to the O(n^3) cost evaluating likelihood function. Furthermore, non-identifiable hyperparameter values induce difficulty in parameter estimation. Because this, maximum estimation or Bayesian calibration sometimes omitted and estimated with prediction-based methods such as grid...

10.48550/arxiv.2410.08310 preprint EN arXiv (Cornell University) 2024-10-10

Enhancing Electrocardiography Data Classification Confidence: A Robust Gaussian Process Approach (MuyGPs)

OPENALEX - Publications

Ukamaka V. Nnyaba Hewan M. Shemtaga David W. Collins Amanda Muyskens Benjamin W. Priest and 1 more

Analyzing electrocardiography (ECG) data is essential for diagnosing and monitoring various heart diseases. The clinical adoption of automated methods requires accurate confidence measurements, which are largely absent from existing classification methods. In this paper, we present a robust Gaussian Process hyperparameter training model (MuyGPs) discerning normal heartbeat signals the affected by different arrhythmias myocardial infarction. We compare performance MuyGPs with traditional...

10.48550/arxiv.2409.04642 preprint EN arXiv (Cornell University) 2024-09-06

A Robust Approach to Gaussian Processes Implementation

OPENALEX - Publications

Juliette Mukangango Amanda Muyskens Benjamin W. Priest

Gaussian Process (GP) regression is a flexible modeling technique used to predict outputs and capture uncertainty in the predictions. However, GP process becomes computationally intensive when training spatial dataset has large number of observations. To address this challenge, we introduce scalable algorithm, termed MuyGPs, which incorporates nearest neighbor leave-one-out cross-validation during training. This approach enables evaluation datasets with state-of-the-art accuracy speed...

10.48550/arxiv.2409.11577 preprint EN arXiv (Cornell University) 2024-09-17

A robust approach to Gaussian process implementation

OPENALEX - Publications

Juliette Mukangango Amanda Muyskens Benjamin W. Priest

Abstract. Gaussian process (GP) regression is a flexible modeling technique used to predict outputs and capture uncertainty in the predictions. However, GP becomes computationally intensive when training spatial dataset has large number of observations. To address this challenge, we introduce scalable algorithm, termed MuyGPs, which incorporates nearest-neighbor leave-one-out cross-validation during training. This approach enables evaluation datasets with state-of-the-art accuracy speed...

10.5194/ascmo-10-143-2024 article EN cc-by Advances in statistical climatology, meteorology and oceanography 2024-10-29

Embracing Irregular Parallelism in HPC with YGM

OPENALEX - Publications

Trevor Steil Tahsin Reza Benjamin W. Priest Roger Pearce

YGM is a general-purpose asynchronous distributed computing library for C++/MPI, designed to handle the irregular data access patterns and small messages of graph algorithms science applications. It uses serialization give an easily usable active message interface aggregation maximize application throughput. Our design philosophy makes tradeoff that increases network bandwidth utilization at cost added latency. We provide suite benchmarks showcasing YGM's performance. Compared similar...

10.1145/3581784.3607103 article EN cc-by 2023-10-30

Coming Soon ...