Jan F. Prins

ORCID: 0000-0003-0853-6099
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Parallel Computing and Optimization Techniques
  • Distributed and Parallel Computing Systems
  • Cloud Computing and Resource Management
  • RNA Research and Splicing
  • RNA modifications and cancer
  • Advanced Data Storage Technologies
  • Logic, programming, and type systems
  • Soil Mechanics and Vehicle Dynamics
  • Cancer-related molecular mechanisms research
  • Embedded Systems Design Techniques
  • Single-cell and spatial transcriptomics
  • Protein Structure and Dynamics
  • Molecular Biology Techniques and Applications
  • Interconnection Networks and Systems
  • Distributed systems and fault tolerance
  • Algorithms and Data Compression
  • Vehicle Dynamics and Control Systems
  • Data Mining Algorithms and Applications
  • Software Testing and Debugging Techniques
  • RNA and protein synthesis mechanisms
  • Data Management and Algorithms
  • Bioinformatics and Genomic Networks
  • Refrigeration and Air Conditioning Technologies
  • Lattice Boltzmann Simulation Studies
  • Generative Adversarial Networks and Image Synthesis

University of North Carolina at Chapel Hill
2011-2022

University of North Carolina Health Care
2017-2022

Jaguar Land Rover (United Kingdom)
2016-2021

Coventry (United Kingdom)
2021

Tusculum College
2015

National Institutes of Health
2014

University of Kentucky
2010-2012

North Carolina State University
2000-2004

Elon University
1992

University of Wisconsin–Madison
1988

Michael S. Lawrence Carrie Sougnez Lee Lichtenstein Kristian Cibulskis Eric S. Lander and 95 more Stacey Gabriel Gad Getz Adrian Ally Miruna Balasundaram İnanç Birol Reanne Bowlby Denise Brooks Yaron S.N. Butterfield Rebecca Carlsen Dean Cheng Andy Chu Noreen Dhalla Ranabir Guin Robert A. Holt Steven J.M. Jones Darlene Lee Haiyan I. Li Marco A. Marra Michael Mayo Richard A. Moore Andrew J. Mungall A. Gordon Robertson Jacqueline E. Schein Payal Sipahimalani Angela Tam Nina Thiessen Tina Wong Alexei Protopopov Netty Santoso Semin Lee Michael Parfenov Jianhua Zhang Harshad S. Mahadeshwar Jiabin Tang Xiaojia Ren Sahil Seth Psalm Haseley Dong Zeng Lixing Yang Andrew Wei Xu Xingzhi Song Angeliki Pantazi Christopher A. Bristow Angela Hadjipanayis Jonathan G. Seidman Lynda Chin Peter J. Park Raju Kucherlapati Rehan Akbani Tod D. Casasent Wenbin Liu Yiling Lu Gordon B. Mills Thomas Motter John N. Weinstein Lixia Diao Jing Wang You Hong Fan Jinze Liu Kai Wang J. Todd Auman Saianand Balu Thomas Bodenheimer Elizabeth Buda D. Neil Hayes Katherine A. Hoadley Alan P. Hoyle Joshua M. Stuart Corbin D. Jones Patrick K. Kimes Yufeng Liu J. S. Marron Shaowu Meng Piotr A. Mieczkowski Lisle E. Mose Joel S. Parker Charles M. Perou Jan F. Prins Jeffrey Roach Yan Shi Janae V. Simons Darshan Singh Matthew G. Soloway Donghui Tan Umadevi Veluvolu Vonn Walter Stephen C. Waring Matthew D. Wilkerson Junyuan Wu Ni Zhao Andrew D. Cherniack Peter S. Hammerman Aaron D. Tward Chandra Sekhar Pedamallu Gordon Saksena

The Cancer Genome Atlas profiled 279 head and neck squamous cell carcinomas (HNSCCs) to provide a comprehensive landscape of somatic genomic alterations. Here we show that human-papillomavirus-associated tumours are dominated by helical domain mutations the oncogene PIK3CA, novel alterations involving loss TRAF3, amplification cycle gene E2F1. Smoking-related HNSCCs demonstrate near universal loss-of-function TP53 CDKN2A inactivation with frequent copy number including 3q26/28 11q13/22. A...

10.1038/nature14129 article EN cc-by-nc-sa Nature 2015-01-27

The accurate mapping of reads that span splice junctions is a critical component all analytic techniques work with RNA-seq data. We introduce second generation detection algorithm, MapSplice, whose focus high sensitivity and specificity in the splices as well CPU memory efficiency. MapSplice can be applied to both short (<75 bp) long (≥75 bp). not dependent on site features or intron length, consequently it detect novel canonical non-canonical splices. leverages quality diversity read...

10.1093/nar/gkq622 article EN cc-by-nc Nucleic Acids Research 2010-08-28

Frequent subgraph mining is an active research topic in the data community. A graph a general model to represent and has been used many domains like cheminformatics bioinformatics. Mining patterns from databases challenging since related operations, such as testing, generally have higher time complexity than corresponding operations on itemsets, sequences, trees, which studied extensively. We propose novel frequent algorithm: FFSM, employs vertical search scheme within algebraic framework we...

10.1109/icdm.2003.1250974 article EN 2004-04-23

The need to integrate several versions of a program into common one arises frequently, but it is tedious and time consuming task programs by hand. To date, the only available tools for assisting with integration are variants text-based differential file comparators; these limited utility because has no guarantees about how that product an behaves compared were integrated. This paper concerns design semantics-based tool automatically integrating versions. main contribution algorithm takes as...

10.1145/65979.65980 article EN ACM Transactions on Programming Languages and Systems 1989-07-01

Single cell experiments provide an unprecedented opportunity to reconstruct a sequence of changes in biological process from individual "snapshots" cells. However, nonlinear gene expression changes, genes unrelated the process, and possibility branching trajectories make this challenging problem. We develop SLICER (Selective Locally Linear Inference Cellular Expression Relationships) address these challenges. can infer highly trajectories, select without prior knowledge automatically...

10.1186/s13059-016-0975-3 article EN cc-by Genome biology 2016-05-23

One fundamental challenge for mining recurring subgraphs from semi-structured data sets is the overwhelming abundance of such patterns. In large graph databases, total number frequent can become too to allow a full enumeration using reasonable computational resources. this paper, we propose new algorithm that mines only maximal subgraphs, i.e. are not part any other subgraphs. This may exponentially decrease size output set in best case; our experiments on practical sets, reduces mined...

10.1145/1014052.1014123 article EN 2004-08-22

The need to integrate several versions of a program into common one arises frequently, but it is tedious and time consuming task programs by hand. main contribution this paper an algorithm, called integrate, that takes as input three A, B, Base, where A B are two variants Base. Whenever the changes made Base create do not “interfere” (in sense defined in paper), Integrate produces M integrates B.

10.1145/73560.73572 article EN 1988-01-01

Comprehensive sequencing of human cancers has identified recurrent mutations in genes encoding chromatin regulatory proteins. For clear cell renal carcinoma (ccRCC), three the five commonly mutated encode regulators PBRM1, SETD2, and BAP1. How these alter landscape transcriptional program ccRCC or other is not understood. Here, we alterations organization transcript profiles associated with a large cohort primary kidney tumors. By associating variation SETD2 , which encodes enzyme...

10.1101/gr.158253.113 article EN cc-by-nc Genome Research 2013-10-24

The RNA transcriptome varies in response to cellular differentiation as well environmental factors, and can be characterized by the diversity abundance of transcript isoforms. Differential transcription analysis, detection differences between transcriptomes different cells, may improve understanding cell development enable identification biomarkers that classify disease types. availability high-throughput short-read sequencing technologies provides in-depth sampling transcriptome, making it...

10.1093/nar/gks1026 article EN cc-by-nc Nucleic Acids Research 2012-11-15

Single cell experimental techniques reveal transcriptomic and epigenetic heterogeneity among cells, but how these are related is unclear. We present MATCHER, an approach for integrating multiple types of single measurements. MATCHER uses manifold alignment to infer multi-omic profiles from measurements performed on different cells the same type. Using scM&T-seq sc-GEM data, we confirm that accurately predicts true correlations between DNA methylation gene expression without using known...

10.1186/s13059-017-1269-0 article EN cc-by Genome biology 2017-07-24

Program dependence graphs were introduced by Kuck as an intermediate program representation well suited for performing optimizations, vectorization, and parallelization. There are also additional applications them internal in development environments.

10.1145/73560.73573 article EN 1988-01-01

The recent addition of task parallelism to the OpenMP shared memory API allows programmers express concurrency at a high level abstraction and places burden scheduling parallel execution on run-time system. Efficient tasks modern multi-socket multicore systems requires careful consideration an increasingly complex hierarchy, including caches non-uniform access (NUMA) characteristics. In order evaluate strategies, we extended open source Qthreads threading library implement different...

10.1177/1094342011434065 article EN The International Journal of High Performance Computing Applications 2012-02-07

Single cell RNA-seq experiments provide valuable insight into cellular heterogeneity but suffer from low coverage, 3' bias and technical noise. These unique properties of single data make study alternative splicing difficult, thus most studies have restricted analysis transcriptome variation to the gene level. To address these limitations, we developed SingleSplice, which uses a statistical model detect genes whose isoform usage shows biological significantly exceeding noise in population...

10.1093/nar/gkv1525 article EN cc-by-nc Nucleic Acids Research 2016-01-05

Finding recurring residue packing patterns, or spatial motifs, that characterize protein structural families is an important problem in bioinformatics. We apply a novel frequent subgraph mining algorithm to three graph representations of three-dimensional (3D) structure. In each graph, vertex represents amino acid. Vertex-residues are connected by edges using approaches: first, based on simple distance threshold between contact residues; second the Delaunay tessellation from computational...

10.1145/974614.974655 article EN 2004-01-01

The recent addition of task parallelism to the OpenMP shared memory API allows programmers express concurrency at a high level abstraction and places burden scheduling parallel execution on run time system. This is welcome development for scientific computing as supercomputer nodes grow "fatter" with multicore manycore processors. But efficient tasks modern multi-socket systems requires careful consideration an increasingly complex hierarchy, including caches NUMA characteristics. In this...

10.1145/1988796.1988804 article EN 2011-05-31

Splice variant neoantigens are a potential source of tumor-specific antigen (TSA) that shared between patients in variety cancers, including acute myeloid leukemia. Current tools for genomic prediction splice demonstrate promise. However, many have not been well validated with simulated and/or wet lab approaches, no studies published presented targeted immunopeptidome mass spectrometry approach designed specifically identification predicted neoantigens.In this study, we describe NeoSplice,...

10.1093/bioadv/vbac032 article EN cc-by Bioinformatics Advances 2022-01-01

SMD, a system for interactively steering molecular dynamics calculations of protein molecules, includes computation, visualization, and communication components. Biochemists can "tug" molecules into different shapes by specifying external forces in the graphical interface, which are added to internal representing atomic bonds nonbonded interactions. SMD provides new tool biochemists use exploring structure proposed designs, as well more general applications such model itself. Its primary is...

10.1109/99.556511 article EN IEEE Computational Science and Engineering 1996-01-01

We find recurring amino-acid residue packing patterns, or spatial motifs, that are characteristic of protein structural families, by applying a novel frequent subgraph mining algorithm to graph representations three-dimensional structure. Graph nodes represent amino acids, and edges chosen in one three ways: first, using threshold for contact distance between residues; second, Delaunay tessellation; third, the recently developed almost-Delaunay edges. For set graphs representing family from...

10.1089/cmb.2005.12.657 article EN Journal of Computational Biology 2005-07-01

This paper examines MPI's ability to support continuous, dynamic load balancing for unbalanced parallel applications. We use an tree search benchmark (UTS) compare two approaches, 1) work sharing using a centralized queue, and 2) stealing explicit polling handle steal requests. Experiments indicate that in addition parameter defining the granularity of balancing, message-passing paradigms require additional parameters such as intervals manage runtime overhead. Using these parameters, we...

10.1109/ipdps.2007.370581 article EN 2007-01-01
Coming Soon ...