NFDI4DS | UHH-SEMS - Publication Details

Jan F. Prins

ORCID: 0000-0003-0853-6099

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5011567934

Research Areas

Parallel Computing and Optimization Techniques
Distributed and Parallel Computing Systems
Cloud Computing and Resource Management
RNA Research and Splicing
RNA modifications and cancer
Advanced Data Storage Technologies
Logic, programming, and type systems
Soil Mechanics and Vehicle Dynamics
Cancer-related molecular mechanisms research
Embedded Systems Design Techniques
Single-cell and spatial transcriptomics
Protein Structure and Dynamics
Molecular Biology Techniques and Applications
Interconnection Networks and Systems
Distributed systems and fault tolerance
Algorithms and Data Compression
Vehicle Dynamics and Control Systems
Data Mining Algorithms and Applications
Software Testing and Debugging Techniques
RNA and protein synthesis mechanisms
Data Management and Algorithms
Bioinformatics and Genomic Networks
Refrigeration and Air Conditioning Technologies
Lattice Boltzmann Simulation Studies
Generative Adversarial Networks and Image Synthesis

University of North Carolina at Chapel Hill
2011-2022

University of North Carolina Health Care
2017-2022

Jaguar Land Rover (United Kingdom)
2016-2021

Coventry (United Kingdom)
2021

Tusculum College
2015

National Institutes of Health
2014

University of Kentucky
2010-2012

North Carolina State University
2000-2004

Elon University
1992

University of Wisconsin–Madison
1988

Comprehensive genomic characterization of head and neck squamous cell carcinomas

OPENALEX - Publications

Michael S. Lawrence Carrie Sougnez Lee Lichtenstein Kristian Cibulskis Eric S. Lander and 95 more

The Cancer Genome Atlas profiled 279 head and neck squamous cell carcinomas (HNSCCs) to provide a comprehensive landscape of somatic genomic alterations. Here we show that human-papillomavirus-associated tumours are dominated by helical domain mutations the oncogene PIK3CA, novel alterations involving loss TRAF3, amplification cycle gene E2F1. Smoking-related HNSCCs demonstrate near universal loss-of-function TP53 CDKN2A inactivation with frequent copy number including 3q26/28 11q13/22. A...

10.1038/nature14129 article EN cc-by-nc-sa Nature 2015-01-27

MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery

OPENALEX - Publications

Kai Wang Darshan Singh Zheng Zeng S.J. Coleman Yan Huang and 9 more

The accurate mapping of reads that span splice junctions is a critical component all analytic techniques work with RNA-seq data. We introduce second generation detection algorithm, MapSplice, whose focus high sensitivity and specificity in the splices as well CPU memory efficiency. MapSplice can be applied to both short (<75 bp) long (≥75 bp). not dependent on site features or intron length, consequently it detect novel canonical non-canonical splices. leverages quality diversity read...

10.1093/nar/gkq622 article EN cc-by-nc Nucleic Acids Research 2010-08-28

Efficient mining of frequent subgraphs in the presence of isomorphism

OPENALEX - Publications

Jun Huan Wei Wang Jan F. Prins

Frequent subgraph mining is an active research topic in the data community. A graph a general model to represent and has been used many domains like cheminformatics bioinformatics. Mining patterns from databases challenging since related operations, such as testing, generally have higher time complexity than corresponding operations on itemsets, sequences, trees, which studied extensively. We propose novel frequent algorithm: FFSM, employs vertical search scheme within algebraic framework we...

10.1109/icdm.2003.1250974 article EN 2004-04-23

Integrating noninterfering versions of programs

OPENALEX - Publications

Susan Horwitz Jan F. Prins Thomas Reps

The need to integrate several versions of a program into common one arises frequently, but it is tedious and time consuming task programs by hand. To date, the only available tools for assisting with integration are variants text-based differential file comparators; these limited utility because has no guarantees about how that product an behaves compared were integrated. This paper concerns design semantics-based tool automatically integrating versions. main contribution algorithm takes as...

10.1145/65979.65980 article EN ACM Transactions on Programming Languages and Systems 1989-07-01

Single-cell transcriptomics reconstructs fate conversion from fibroblast to cardiomyocyte

OPENALEX - Publications

Ziqing Liu Li Wang Joshua D. Welch Hong Ma Yang Zhou and 10 more

10.1038/nature24454 article EN Nature 2017-10-24

SLICER: inferring branched, nonlinear cellular trajectories from single cell RNA-seq data

OPENALEX - Publications

Joshua D. Welch Alexander J. Hartemink Jan F. Prins

Single cell experiments provide an unprecedented opportunity to reconstruct a sequence of changes in biological process from individual "snapshots" cells. However, nonlinear gene expression changes, genes unrelated the process, and possibility branching trajectories make this challenging problem. We develop SLICER (Selective Locally Linear Inference Cellular Expression Relationships) address these challenges. can infer highly trajectories, select without prior knowledge automatically...

10.1186/s13059-016-0975-3 article EN cc-by Genome biology 2016-05-23

SPIN

OPENALEX - Publications

Jun Huan Wei Wang Jan F. Prins Jiong Yang

One fundamental challenge for mining recurring subgraphs from semi-structured data sets is the overwhelming abundance of such patterns. In large graph databases, total number frequent can become too to allow a full enumeration using reasonable computational resources. this paper, we propose new algorithm that mines only maximal subgraphs, i.e. are not part any other subgraphs. This may exponentially decrease size output set in best case; our experiments on practical sets, reduces mined...

10.1145/1014052.1014123 article EN 2004-08-22

Integrating non-intering versions of programs

OPENALEX - Publications

Susan Horwitz Jan F. Prins Thomas Reps

The need to integrate several versions of a program into common one arises frequently, but it is tedious and time consuming task programs by hand. main contribution this paper an algorithm, called integrate, that takes as input three A, B, Base, where A B are two variants Base. Whenever the changes made Base create do not “interfere” (in sense defined in paper), Integrate produces M integrates B.

10.1145/73560.73572 article EN 1988-01-01

Variation in chromatin accessibility in human kidney cancer links H3K36 methyltransferase loss with widespread RNA processing defects

OPENALEX - Publications

Jeremy M. Simon Kathryn E. Hacker Darshan Singh A. Rose Brannon Joel S. Parker and 9 more

Comprehensive sequencing of human cancers has identified recurrent mutations in genes encoding chromatin regulatory proteins. For clear cell renal carcinoma (ccRCC), three the five commonly mutated encode regulators PBRM1, SETD2, and BAP1. How these alter landscape transcriptional program ccRCC or other is not understood. Here, we alterations organization transcript profiles associated with a large cohort primary kidney tumors. By associating variation SETD2 , which encodes enzyme...

10.1101/gr.158253.113 article EN cc-by-nc Genome Research 2013-10-24

DiffSplice: the genome-wide detection of differential splicing events with RNA-seq

OPENALEX - Publications

Yin Hu Yan Huang Ying Du Christian F. Orellana Darshan Singh and 12 more

The RNA transcriptome varies in response to cellular differentiation as well environmental factors, and can be characterized by the diversity abundance of transcript isoforms. Differential transcription analysis, detection differences between transcriptomes different cells, may improve understanding cell development enable identification biomarkers that classify disease types. availability high-throughput short-read sequencing technologies provides in-depth sampling transcriptome, making it...

10.1093/nar/gks1026 article EN cc-by-nc Nucleic Acids Research 2012-11-15

MATCHER: manifold alignment reveals correspondence between single cell transcriptome and epigenome dynamics

OPENALEX - Publications

Joshua D. Welch Alexander J. Hartemink Jan F. Prins

Single cell experimental techniques reveal transcriptomic and epigenetic heterogeneity among cells, but how these are related is unclear. We present MATCHER, an approach for integrating multiple types of single measurements. MATCHER uses manifold alignment to infer multi-omic profiles from measurements performed on different cells the same type. Using scM&T-seq sc-GEM data, we confirm that accurately predicts true correlations between DNA methylation gene expression without using known...

10.1186/s13059-017-1269-0 article EN cc-by Genome biology 2017-07-24

Single-Cell Transcriptomic Analyses of Cell Fate Transitions during Human Cardiac Reprogramming

OPENALEX - Publications

Yang Zhou Ziqing Liu Joshua D. Welch Xu Gao Li Wang and 7 more

10.1016/j.stem.2019.05.020 article EN publisher-specific-oa Cell stem cell 2019-06-20

On the adequacy of program dependence graphs for representing programs

OPENALEX - Publications

Susan Horwitz Jan F. Prins Thomas Reps

Program dependence graphs were introduced by Kuck as an intermediate program representation well suited for performing optimizations, vectorization, and parallelization. There are also additional applications them internal in development environments.

10.1145/73560.73573 article EN 1988-01-01

OpenMP task scheduling strategies for multicore NUMA systems

OPENALEX - Publications

Stephen L. Olivier Allan Porterfield Kyle Wheeler Michael Spiegel Jan F. Prins

The recent addition of task parallelism to the OpenMP shared memory API allows programmers express concurrency at a high level abstraction and places burden scheduling parallel execution on run-time system. Efficient tasks modern multi-socket multicore systems requires careful consideration an increasingly complex hierarchy, including caches non-uniform access (NUMA) characteristics. In order evaluate strategies, we extended open source Qthreads threading library implement different...

10.1177/1094342011434065 article EN The International Journal of High Performance Computing Applications 2012-02-07

A novel heterogeneous algorithm to simulate multiphase flow in porous media on multicore CPU–GPU systems

OPENALEX - Publications

James E. McClure Jan F. Prins Cass T. Miller

10.1016/j.cpc.2014.03.012 article EN Computer Physics Communications 2014-03-22

Robust detection of alternative splicing in a population of single cells

OPENALEX - Publications

Joshua D. Welch Yin Hu Jan F. Prins

Single cell RNA-seq experiments provide valuable insight into cellular heterogeneity but suffer from low coverage, 3' bias and technical noise. These unique properties of single data make study alternative splicing difficult, thus most studies have restricted analysis transcriptome variation to the gene level. To address these limitations, we developed SingleSplice, which uses a statistical model detect genes whose isoform usage shows biological significantly exceeding noise in population...

10.1093/nar/gkv1525 article EN cc-by-nc Nucleic Acids Research 2016-01-05

A high-performance lattice Boltzmann implementation to model flow in porous media

OPENALEX - Publications

Chongxun Pan Jan F. Prins Cass T. Miller

10.1016/j.cpc.2003.12.003 article EN Computer Physics Communications 2004-02-14

Mining protein family specific residue packing patterns from protein structure graphs

OPENALEX - Publications

Jun Huan Wei Wang Deepak Bandyopadhyay Jack Snoeyink Jan F. Prins and 1 more

Finding recurring residue packing patterns, or spatial motifs, that characterize protein structural families is an important problem in bioinformatics. We apply a novel frequent subgraph mining algorithm to three graph representations of three-dimensional (3D) structure. In each graph, vertex represents amino acid. Vertex-residues are connected by edges using approaches: first, based on simple distance threshold between contact residues; second the Delaunay tessellation from computational...

10.1145/974614.974655 article EN 2004-01-01

Scheduling task parallelism on multi-socket multicore systems

OPENALEX - Publications

Stephen L. Olivier Allan Porterfield Kyle Wheeler Jan F. Prins

The recent addition of task parallelism to the OpenMP shared memory API allows programmers express concurrency at a high level abstraction and places burden scheduling parallel execution on run time system. This is welcome development for scientific computing as supercomputer nodes grow "fatter" with multicore manycore processors. But efficient tasks modern multi-socket systems requires careful consideration an increasingly complex hierarchy, including caches NUMA characteristics. In this...

10.1145/1988796.1988804 article EN 2011-05-31

Deep Sequencing Shows Multiple Oligouridylations Are Required for 3′ to 5′ Degradation of Histone mRNAs on Polyribosomes

OPENALEX - Publications

Michael K. Slevin Stacie Meaux Joshua D. Welch Rebecca L. Bigler Paula L. Miliani de Marval and 4 more

10.1016/j.molcel.2014.02.027 article EN publisher-specific-oa Molecular Cell 2014-03-01

NeoSplice: a bioinformatics method for prediction of splice variant neoantigens

OPENALEX - Publications

Shengjie Chai Christof C. Smith Tavleen K. Kochar Sally A. Hunsucker Wolfgang Beck and 6 more

Splice variant neoantigens are a potential source of tumor-specific antigen (TSA) that shared between patients in variety cancers, including acute myeloid leukemia. Current tools for genomic prediction splice demonstrate promise. However, many have not been well validated with simulated and/or wet lab approaches, no studies published presented targeted immunopeptidome mass spectrometry approach designed specifically identification predicted neoantigens.In this study, we describe NeoSplice,...

10.1093/bioadv/vbac032 article EN cc-by Bioinformatics Advances 2022-01-01

SMD: visual steering of molecular dynamics for protein design

OPENALEX - Publications

John Leech Jan F. Prins J. J. Hérmans

SMD, a system for interactively steering molecular dynamics calculations of protein molecules, includes computation, visualization, and communication components. Biochemists can "tug" molecules into different shapes by specifying external forces in the graphical interface, which are added to internal representing atomic bonds nonbonded interactions. SMD provides new tool biochemists use exploring structure proposed designs, as well more general applications such model itself. Its primary is...

10.1109/99.556511 article EN IEEE Computational Science and Engineering 1996-01-01

Comparing Graph Representations of Protein Structure for Mining Family-Specific Residue-Based Packing Motifs

OPENALEX - Publications

Jun Huan Deepak Bandyopadhyay Wei Wang Jack Snoeyink Jan F. Prins and 1 more

We find recurring amino-acid residue packing patterns, or spatial motifs, that are characteristic of protein structural families, by applying a novel frequent subgraph mining algorithm to graph representations three-dimensional structure. Graph nodes represent amino acids, and edges chosen in one three ways: first, using threshold for contact distance between residues; second, Delaunay tessellation; third, the recently developed almost-Delaunay edges. For set graphs representing family from...

10.1089/cmb.2005.12.657 article EN Journal of Computational Biology 2005-07-01

Dynamic Load Balancing of Unbalanced Computations Using Message Passing

OPENALEX - Publications

James Dinan Stephen L. Olivier Gerald Sabin Jan F. Prins P. Sadayappan and 1 more

This paper examines MPI's ability to support continuous, dynamic load balancing for unbalanced parallel applications. We use an tree search benchmark (UTS) compare two approaches, 1) work sharing using a centralized queue, and 2) stealing explicit polling handle steal requests. Experiments indicate that in addition parameter defining the granularity of balancing, message-passing paradigms require additional parameters such as intervals manage runtime overhead. Using these parameters, we...

10.1109/ipdps.2007.370581 article EN 2007-01-01

Coming Soon ...