- Parallel Computing and Optimization Techniques
- Advanced Data Storage Technologies
- Distributed and Parallel Computing Systems
- Advanced Numerical Methods in Computational Mathematics
- Scientific Computing and Data Management
- Computational Fluid Dynamics and Aerodynamics
- Cloud Computing and Resource Management
- Meteorological Phenomena and Simulations
- Interconnection Networks and Systems
- Oceanographic and Atmospheric Processes
- Medical Image Segmentation Techniques
- Embedded Systems Design Techniques
- Computer Graphics and Visualization Techniques
- DNA and Biological Computing
- Coronary Interventions and Diagnostics
- Fluid Dynamics and Turbulent Flows
- Seismology and Earthquake Studies
- Numerical methods for differential equations
- Climate variability and models
- Matrix Theory and Algorithms
- International Maritime Law Issues
- Geological Modeling and Analysis
- Caching and Content Delivery
- Advanced Image Processing Techniques
- Ocean Waves and Remote Sensing
King Abdullah University of Science and Technology
2015-2024
University of Tennessee at Knoxville
2009-2022
National University of Singapore
2022
Chengdu Research Base of Giant Panda Breeding
2022
Institut national de recherche en informatique et en automatique
2022
Sandia National Laboratories
2022
The University of Texas at San Antonio
2022
CSC - IT Center for Science (Finland)
2022
Beihang University
2022
Institute of High Performance Computing
2022
The emergence and continuing use of multi-core architectures graphics processing units require changes in the existing software sometimes even a redesign established algorithms order to take advantage now prevailing parallelism. Parallel Linear Algebra for Scalable Multi-core Architectures (PLASMA) Matrix on GPU Multics (MAGMA) are two projects that aims achieve high performance portability across wide range hybrid systems respectively. We present this document comparative study PLASMA's...
Abstract The Red Sea, home to the second-longest coral reef system in world, is a vital resource for Kingdom of Saudi Arabia. Sea provides 90% Kingdom’s potable water by desalinization, supporting tourism, shipping, aquaculture, and fishing industries, which together contribute about 10%–20% country’s GDP. All these activities, those elsewhere region, critically depend on oceanic atmospheric conditions. At time mega-development projects along coast, global warming, authorities are working...
The emergence and continuing use of multi-core architectures require changes in the existing software sometimes even a redesign established algorithms order to take advantage now prevailing parallelism. Parallel Linear Algebra for Scalable Multi-core Architectures (PLASMA) is project that aims achieve both high performance portability across wide range architectures. We present this paper comparative study PLASMA's against linear algebra packages (LAPACK ScaLAPACK), new approaches at...
To exploit the potential of multicore architectures, recent dense linear algebra libraries have used tile algorithms, which consist in scheduling a Directed Acyclic Graph (DAG) tasks fine granularity where nodes represent tasks, either panel factorization or update block-column, and edges dependencies among them. Although past approaches already achieve high performance on moderate large square matrices, their way processing sequence leads to limited when factorizing tall skinny matrices...
As tile linear algebra algorithms continue achieving high performance on shared-memory multicore architectures, it is a challenging task to make them scalable distributed-memory cluster machines. The main contribution of this paper the extension environment previous work done by Hadri et al. Communication- Avoiding QR (CA-QR) factorizations for tall and skinny matrices (initially systems). fine granularity associated with communicationavoiding techniques factorization presents degree...
SUMMARY Which physical parameters are the most influential when predicting earthquake ground motions in a 3-D sedimentary basin? We answer quantitatively by doing global sensitivity analysis of two quantities interest: peak (PGMs) and time–frequency representation (the S transform) resulting from synthetic anelastic responses EUROSEISTEST. This domain interest is modeled layers with uncertain depth-dependent mechanical properties illuminated plane S-wave propagating vertically upward an...
Abstract Ensemble Kalman Filters (EnKFs), which assimilate observations based on statistics derived from an ensemble of samples ocean states, have become the norm for data assimilation (DA) and forecasting. These schemes are commonly implemented with inflation localization techniques to increase their spread filter out spurious long‐range correlations resulting limited‐size ensembles imposed by computational burden constraints. Such ad‐hoc methods were found be not necessary in DA...
The European Extremely Large Telescope (E-ELT) is one of today's most challenging projects in ground-based astronomy. Addressing the key science cases for E-ELT, study early Universe, requires implementation multi-object adaptive optics (MOAO), a dedicated concept relying on turbulence tomography. We use novel pseudo-analytical approach to simulate performance tomographic reconstruction atmospheric an MOAO system real datasets. simultaneously 4K galaxies common field view massively parallel...
XALT collects accurate, detailed, and continuous job-level link-time data stores that in a database; all the collection is transparent to users. The stored can be mined generate picture of compilers, libraries, other software users need run their jobs successfully, highlighting products researchers use. We showcase how collected by easily into digestible format presenting from four separate HPC centers. already used many centers around world due its usefulness complementariness existing logs...
Monitoring of High Performance Computing (HPC) platforms is critical to successful operations, can provide insights into performance-impacting conditions, and inform methodologies for improving science throughput. However, monitoring systems are not generally considered core capabilities in system requirements specifications nor vendor development strategies. In this paper we present work performed at a number large-scale HPC sites towards developing that fill current gaps ease problem...
Summary We present in this paper a comprehensive performance study of highly efficient extreme scale direct numerical simulations secondary flows, using an optimized version Nek5000. Our investigations are conducted on various Cray XC40 systems, very high‐order spectral element method. Single‐node efficiency is achieved by auto‐generated assembly implementations small matrix multiplies and key vector‐vector operations, streaming lossless I/O compression, aggressive loop merging, selective...
A library tracking database has been developed to monitor software/library usage. This Automatic Library Tracking Database (ALTD) automatically and transparently stores, into a database, information about the libraries linked an application at compilation time also executables launched in batch job. Information gathered can then be mined provide reports. Analyzing results from data collected will help identify, for example, most frequently used least codes, those users that are using...
Carbon materials and nanostructures (fullerenes, nanotubes) are promising building blocks of nanotechnology. Potential applications include optical electronic devices, sensors, nano-scale machines. The multiscale character processes related to fabrication physics such requires using a combination different approaches as (a) classical dynamics, (b) direct Born-Oppenheimer (c) quantum dynamics for electrons (d) selected nuclei. We describe our effort on optimization reactive molecular...
Summary form only given. Performance Environment Autoconfiguration frameworK (PEAK) is presented to help developers and users of scientific applications find the optimal configurations for their application on a given platform with rich computational resources complicate options. The choices be made include compiler its settings compiling options, numerical libraries library parameters, other environment variables take advantage NUMA systems. A website based interface developed user†<sup...
As the leading distributed cyberinfrastructure for open scientific research in United States, XSEDE supports several supercomputers across country, as well computational tools that are critical to success of those researchers. In most cases, users looking a systematic way selecting and configuring available systems software libraries their applications so obtain optimal application performance. However, few developers have time an exhaustive search all possible configurations determine best...
With cost effective distributed memory computer systems reaching high performances, it may become feasible in the near future to provide routinely reliable blood flow simulations during angiographic procedures enhance standard medical imaging techniques. The long term goal of our work is produce a fast parallel image base Navier-Stokes solver for these kinds clinical procedures. To achieve such performance, method relies on proper combination three techniques that are L2 penalty deal with...
Tracking software usage is important for HPC centers, computer vendors, code developers and funding agencies to provide more efficient targeted support, forecast needs guide effort towards the Exascale era. However, accurately tracking on systems has been a challenging task. In this paper, we present tool called Automatic Library Database (ALTD) that developed put in production several Cray systems. The ALTD infrastructure prototype automatically transparently stores information about...