Ben Clifford

ORCID: 0000-0001-6397-7239
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Distributed and Parallel Computing Systems
  • Scientific Computing and Data Management
  • Advanced Data Storage Technologies
  • Cancer Genomics and Diagnostics
  • Genomic variations and chromosomal abnormalities
  • Parallel Computing and Optimization Techniques
  • Research Data Management Practices
  • Computational Physics and Python Applications
  • Prenatal Screening and Diagnostics
  • Genomics and Phylogenetic Studies
  • Cloud Computing and Resource Management
  • Advanced MRI Techniques and Applications
  • Molecular Biology Techniques and Applications
  • Viral-associated cancers and disorders
  • Medical Imaging Techniques and Applications
  • Plasma and Flow Control in Aerodynamics
  • Combustion and Detonation Processes
  • Gene expression and cancer classification
  • Evolution and Genetic Dynamics
  • Cell Image Analysis Techniques
  • Acute Lymphoblastic Leukemia research
  • Political and Economic history of UK and US
  • Genomics and Rare Diseases
  • Regulation of Appetite and Obesity
  • Aquatic Ecosystems and Phytoplankton Dynamics

University of Chicago
2007-2024

BioNano Genomics (United States)
2021-2024

Cambridge Quantum Computing (United Kingdom)
2023

Southern Maine Community College
2021

University of Illinois Chicago
2019

University College London
2013

Argonne National Laboratory
2007-2009

University of Southern California
2005

University of Oklahoma
2004

We present Swift, a system that combines novel scripting language called SwiftScript with powerful runtime based on CoG Karajan, Falkon, and Globus to allow for the concise specification, reliable efficient execution, of large loosely coupled computations. Swift adopts adapts ideas first explored in GriPhyN virtual data system, improving many regards. describe its use XDTM logical structure complex file structures. also services dispatch manage execution tasks parallel grid environments....

10.1109/services.2007.63 article EN 2007-07-01

High-level programming languages such as Python are increasingly used to provide intuitive interfaces libraries written in lower-level and for assembling applications from various components. This migration towards orchestration rather than implementation, coupled with the growing need parallel computing (e.g., due big data end of Moore's law), necessitates rethinking how parallelism is expressed programs. Here, we present Parsl, a scripting library that augments simple, scalable, flexible...

10.1145/3307681.3325400 preprint EN 2019-06-17

Abstract The first Provenance Challenge was set up in order to provide a forum for the community understand capabilities of different provenance systems and expressiveness their representations. To this end, functional magnetic resonance imaging workflow defined, which participants had either simulate or run produce some representation, from identified queries be implemented executed. Sixteen teams responded challenge, submitted inputs. In paper, we present challenge queries, summarize...

10.1002/cpe.1233 article EN Concurrency and Computation Practice and Experience 2007-11-02

The Grid2003 Project has deployed a multivirtual organization, application-driven grid laboratory (Grid3) that sustained for several months the production-level services required by physics experiments of Large Hadron Collider at CERN (ATLAS and CMS), Sloan Digital Sky Survey project, gravitational wave search experiment LIGO, BTeV Fermilab, as well applications in molecular structure analysis genome analysis, computer science research projects such areas job data scheduling. infrastructure...

10.1109/hpdc.2004.36 article EN High Performance Distributed Computing 2004-06-04

Scripting accelerates and simplifies the composition of existing codes to form more powerful applications. Parallel scripting extends this technique allow for rapid development highly parallel applications that can run efficiently on platforms ranging from multicore workstations petascale supercomputers.

10.1109/mc.2009.365 article EN Computer 2009-11-01

We have extended the Falkon lightweight task execution framework to make loosely coupled programming on petascale systems a practical and useful model. This work studies measures performance factors involved in applying this approach enable use of by broader user community, with greater ease. Our enables highly parallel computations composed serial jobs no modifications respective applications. allows new---and potentially far larger---class applications leverage systems, such as IBM Blue...

10.5555/1413370.1413393 article EN IEEE International Conference on High Performance Computing, Data, and Analytics 2008-11-15

We have extended the Falkon lightweight task execution framework to make loosely coupled programming on petascale systems a practical and useful model. This work studies measures performance factors involved in applying this approach enable use of by broader user community, with greater ease. Our enables highly parallel computations composed serial jobs no modifications respective applications. allows new-and potentially far larger-class applications leverage systems, such as IBM Blue Gene/P...

10.1109/sc.2008.5219768 preprint EN 2008-11-01

Genomic structural variants comprise a significant fraction of somatic mutations driving cancer onset and progression. However, such are not readily revealed by standard next-generation sequencing. Optical genome mapping (OGM) surpasses short-read sequencing in detecting large (>500 bp) complex (SVs) but requires isolation ultra-high-molecular-weight DNA from the tissue interest. We have successfully applied protocol involving paramagnetic nanobind disc to wide range solid tumors. Using as...

10.3390/jpm11020142 article EN Journal of Personalized Medicine 2021-02-18

Abstract The virtual data model allows sets to be described prior to, and separately from, their physical materialization. We have implemented this in a Virtual Data Language (VDL) associated supporting tools, which provide for both the storage, query, retrieval of set descriptions, automated, on‐demand materialization sets. use standardized provenance challenge exercise illustrate powerful queries that can performed on maintained by these single include three elements: computational...

10.1002/cpe.1256 article EN Concurrency and Computation Practice and Experience 2007-08-21

The Grid2003 Project has deployed a multivirtual organization, application-driven grid laboratory ("Grid3") that sustained for several months the production-level services required by physics experiments of Large Hadron Collider at CERN (ATLAS and CMS), Sloan Digital Sky Survey project, gravitational wave search experiment LIGO, BTeV Fermilab, as well applications in molecular structure analysis genome analysis, computer science research projects such areas job data scheduling....

10.1109/hpdc.2004.1323544 article EN 2004-11-12

Structural variations (SVs) play a key role in the pathogenicity of hematological malignancies. Standard-of-care (SOC) methods such as karyotyping and fluorescence situ hybridization (FISH), which have been employed globally for past three decades, significant limitations terms resolution number recurrent aberrations that can be simultaneously assessed, respectively. Next-generation sequencing (NGS)-based technologies are now widely used to detect clinically sequence variants but limited...

10.3390/biomedicines11123263 article EN cc-by Biomedicines 2023-12-09

The recommended practice for individuals suspected of a genetic etiology disorders including unexplained developmental delay/intellectual disability (DD/ID), autism spectrum (ASD), and multiple congenital anomalies (MCA) involves testing workflow chromosomal microarray (CMA), Fragile-X testing, karyotype analysis, and/or sequencing-based gene panels. Since genomic imbalances are often found to be causative, CMA is as first tier many indications. Optical genome mapping (OGM) an emerging next...

10.3390/genes14101868 article EN Genes 2023-09-26

Parallel scripting is a loosely-coupled programming model in which applications are composed of highly parallel scripts program invocations that process and exchange data via files. We characterize here the can benefit from on petascale-class machines, describe mechanisms make this feasible such systems, present results achieved with currently available petascale computers.

10.1088/1742-6596/180/1/012046 article EN Journal of Physics Conference Series 2009-07-01

10.1016/j.future.2010.05.003 article EN Future Generation Computer Systems 2010-05-21

Python is increasingly the lingua franca of scientific computing. It used as a higher level language to wrap lower-level libraries and compose scripts from various independent components. However, scaling moving programs laptops supercomputers remains challenge. Here we present Parsl, parallel scripting library for Python. Parsl makes it straightforward developers implement parallelism in by annotating functions that can be executed asynchronously parallel, scale analyses laptop thousands...

10.1145/3332186.3332231 article EN Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning) 2019-07-28

The SREB (Super-conserved Receptors Expressed in Brain) family of G protein-coupled receptors is highly conserved across vertebrates and consists three members: SREB1 (orphan receptor GPR27), SREB2 (GPR85), SREB3 (GPR173). Ligands for these are largely unknown or only recently identified, functions all still beginning to be understood, including roles glucose homeostasis, neurogenesis, hypothalamic control reproduction. In addition the brain, expressed gonads, but relatively few studies have...

10.1038/s41598-021-91590-9 article EN cc-by Scientific Reports 2021-06-08

Large-scale HPC workflows are increasingly implemented in dynamic languages such as Python, which allow for more rapid development than traditional techniques. However, the cost of executing Python applications at scale is often dominated by distribution common datasets and complex software dependencies. As application scales up, data becomes a limiting factor that prevents scaling beyond few hundred nodes. To address this problem, we present integration Parsl (a Python-native parallel...

10.1145/3624062.3624136 article EN 2023-11-10

Abstract Genomic structural variants comprise a significant fraction of somatic mutations driving cancer onset and progression. However, such are not readily revealed by standard next generation sequencing. Optical genome mapping (OGM) surpasses short read sequencing in detecting large (>500bp) complex (SVs) but requires isolation ultra-high molecular weight DNA from the tissue interest. We have successfully applied protocol involving paramagnetic nanobind disc to wide range solid tumors....

10.1101/2021.02.04.21250683 preprint EN cc-by-nc-nd medRxiv (Cold Spring Harbor Laboratory) 2021-02-09
Coming Soon ...