- Scientific Computing and Data Management
- Distributed and Parallel Computing Systems
- Genomics and Phylogenetic Studies
- Galaxies: Formation, Evolution, Phenomena
- Astronomy and Astrophysical Research
- Stellar, planetary, and galactic studies
- Algorithms and Data Compression
- Genetics, Bioinformatics, and Biomedical Research
- Visual Culture and Art Theory
- Cell Image Analysis Techniques
- Data Management and Algorithms
- Geographic Information Systems Studies
- Radio Astronomy Observations and Technology
- Medical Practices and Rehabilitation
- Flexible and Reconfigurable Manufacturing Systems
- Data Mining Algorithms and Applications
- Advanced Data Storage Technologies
- Simulation Techniques and Applications
- Computational Physics and Python Applications
- Public Spaces through Art
- International Science and Diplomacy
- Aquatic and Environmental Studies
- Gene expression and cancer classification
- Particle physics theoretical and experimental studies
- Particle Detector Development and Performance
EGI
2021-2024
Instituto de Astrofísica de Andalucía
2019-2022
University of Oxford
2017-2019
MRC Weatherall Institute of Molecular Medicine
2017-2019
Genomics England
2014
Abstract Summary: Computational genomics seeks to draw biological inferences from genomic datasets, often by integrating and contextualizing next-generation sequencing data. CGAT provides an extensive suite of tools designed assist in the analysis genome scale data a range standard file formats. The toolkit enables filtering, comparison, conversion, summarization annotation intervals, gene sets sequences. can both be run Unix command line installed into visual workflow builders, such as...
Abstract We present Bioconda ( https://bioconda.github.io ), a distribution of bioinformatics software for the lightweight, multiplatform and language-agnostic package manager Conda. Currently, offers collection over 3000 packages, which is continuously maintained, updated, extended by growing global community more than 200 contributors. improves analysis reproducibility allowing users to define isolated environments with defined versions, all are easily installed managed without...
<ns4:p>In the genomics era computational biologists regularly need to process, analyse and integrate large complex biomedical datasets. Analysis inevitably involves multiple dependent steps, resulting in pipelines or workflows, often with several branches. Large data volumes mean that processing needs be quick efficient scientific rigour requires analysis consistent fully reproducible. We have developed CGAT-core, a python package for rapid construction of workflows. CGAT-core seamlessly...
<ns4:p>In the genomics era computational biologists regularly need to process, analyse and integrate large complex biomedical datasets. Analysis inevitably involves multiple dependent steps, resulting in pipelines or workflows, often with several branches. Large data volumes mean that processing needs be quick efficient scientific rigour requires analysis consistent fully reproducible. We have developed CGAT-core, a python package for rapid construction of workflows. CGAT-core seamlessly...
Hickson Compact Groups (HCGs) are dense configurations of 4 to 10 galaxies, whose HI (neutral gas) morphology appears follow an evolutionary sequence three phases, with gas initially confined then significant amounts spread throughout the intra-group medium, and finally almost no remaining in galaxies themselves. The deficiency HCGs is expected increase as morphological phase progresses along this sequence, potentially making it a useful proxy for phase. We test hypothesis first time large...
The Square Kilometre Array Observatory (SKAO) will build the most sensitive radio telescopes on Earth. To address fundamental questions in astrophysics, physics, and astrobiology, it require processing handling complex extremely massive data close to exascale, hence constituting a technological challenge for next decade. Approximately 600 Peta-bytes (PB) of calibrated be delivered network SKA Regional Centers (SRCs) worldwide. As world-leading scientific instrument, SKAO aims pursue best...
Context. Hickson Compact Group (HCG) 16 is a prototypical compact group of galaxies in an intermediate stage the previously proposed evolutionary sequence, where its are losing gas to intra-group medium (IGrM). The hosts that H I -normal, -poor, and centrally active with both AGNs starbursts, addition likely new member tidal feature ∼160 kpc length. Despite being well-studied at all wavelengths, no previous study HCG has focused on extraordinary component. Aims. characteristics make it ideal...
Increasingly, high resolution coastal hydrodynamic and water quality models are executed on distributed, heterogenous compute data infrastructures. Working these infrastructures presents challenges associated with (1) accessing the requisite model input data, (2) preprocessing so that can ingest them, (3) compiling for different computing infrastructures, (4) running at scale. These bottlenecks setting up modelling systems in an interoperable reproducible way. Here we present a workflow...
In 1990, a mural entitled "My God, Help me to Survive This Deadly Love" was painted on the East Berlin Wall. The work depicts controversial kiss between Leonid Brezhnev and Erich Honecker. Roland Barthes has structured system of semiotic analysis, one which is code. analyzing this mural, author refers 5 codes Barthes' theory. 5-code theory considered most appropriate because it able present textual analysis structure in understanding all aspects text meaning, there framework thought that...
In the genomics era computational biologists regularly need to process, analyse and integrate large complex biomedical datasets. Analysis inevitably involves multiple dependent steps, resulting in pipelines or workflows, often with several branches. Large data volumes mean that processing needs be quick efficient scientific rigour requires analysis consistent fully reproducible. We have developed CGAT-core, a python package for rapid construction of workflows. CGAT-core seamlessly handles...
Research projects heavily rely on the exchange and processing of data in this context Pangeo (https://pangeo.io/), a world-wide community scientists developers, thrives to facilitate deployment ready use community-driven platforms for big geoscience. The European Open Science Cloud (EOSC) is main initiative Europe providing federated open multi-disciplinary environment where researchers, innovators, companies citizens can share, publish, find re-use data, tools services research, innovation...
The EC H2020 C-SCALE (Copernicus - eoSC AnaLytics Engine, https://c-scale.eu) project implements a European open source Big (Copernicus) Data Analytics platform by federating the best-of-breed tools, competences and services collaboratively building on experience of pan-European e-Infrastructures existing initiatives.The vision is to empower researchers, institutions initiatives easily discover, access, process, analyse share Copernicus data, resources through EOSC Portal. To this end,...