NFDI4DS | UHH-SEMS - Publication Details

Jan Lause

ORCID: 0000-0003-0946-412X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5085123886

Research Areas

Single-cell and spatial transcriptomics
Gene expression and cancer classification
Retinal Development and Disorders
Statistical Methods and Inference
Neural dynamics and brain function
Visual perception and processing mechanisms
Cancer Genomics and Diagnostics
Extracellular vesicles in disease
Artificial Intelligence in Healthcare and Education
Photoreceptor and optogenetics research
Gene Regulatory Network Analysis
Neuroscience and Neuropharmacology Research
Cell Image Analysis Techniques
Topic Modeling

Hertie Institute for Clinical Brain Research
2023-2024

University of Tübingen
2020-2024

Bernstein Center for Computational Neuroscience Tübingen
2021-2023

Tübingen AI Center
2023

Analytic Pearson residuals for normalization of single-cell RNA-seq UMI data

OPENALEX - Publications

Jan Lause Philipp Berens Dmitry Kobak

Standard preprocessing of single-cell RNA-seq UMI data includes normalization by sequencing depth to remove this technical variability, and nonlinear transformation stabilize the variance across genes with different expression levels. Instead, two recent papers propose use statistical count models for these tasks: Hafemeister Satija (Genome Biol 20:296, 2019) recommend using Pearson residuals from negative binomial regression, while Townes et al. 20:295, fitting a generalized PCA model....

10.1186/s13059-021-02451-7 article EN cc-by Genome biology 2021-09-06

Delving into ChatGPT usage in academic writing through excess vocabulary

OPENALEX - Publications

Dmitry Kobak Rita González Márquez Emőke-Ágnes Horvát Jan Lause

Recent large language models (LLMs) can generate and revise text with human-level performance, have been widely commercialized in systems like ChatGPT. These come clear limitations: they produce inaccurate information, reinforce existing biases, be easily misused. Yet, many scientists using them to assist their scholarly writing. How wide-spread is LLM usage the academic literature currently? To answer this question, we use an unbiased, large-scale approach, free from any assumptions on...

10.48550/arxiv.2406.07016 preprint EN arXiv (Cornell University) 2024-06-11

The art of seeing the elephant in the room: 2D embeddings of single-cell data do make sense

OPENALEX - Publications

Jan Lause Dmitry Kobak Philipp Berens

A recent paper in

10.1101/2024.03.26.586728 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2024-03-28

The art of seeing the elephant in the room: 2D embeddings of single-cell data do make sense

OPENALEX - Publications

Jan Lause Philipp Berens Dmitry Kobak

A recent paper claimed that t -SNE and UMAP embeddings of single-cell datasets are “specious” fail to capture true biological structure. The authors argued such as arbitrary misleading forcing the data into an elephant shape. Here we show this conclusion was based on inadequate limited metrics embedding quality. More appropriate quantifying neighborhood class preservation reveal in room: while do not preserve high-dimensional distances, they can nevertheless provide biologically relevant information.

10.1371/journal.pcbi.1012403 article EN cc-by PLoS Computational Biology 2024-10-02

Retinal horizontal cells use different synaptic sites for global feedforward and local feedback signaling

OPENALEX - Publications

Christian Behrens Shubhash Chandra Yadav Maria M. Korympidou Yue Zhang Silke Haverkamp and 13 more

10.1016/j.cub.2021.11.055 article EN publisher-specific-oa Current Biology 2021-12-14

Analytic Pearson residuals for normalization of single-cell RNA-seq UMI data

OPENALEX - Publications

Jan Lause Philipp Berens Dmitry Kobak

Abstract Background Standard preprocessing of single-cell RNA-seq UMI data includes normalization by sequencing depth to remove this technical variability, and nonlinear transformation stabilize the variance across genes with different expression levels. Instead, two recent papers propose use statistical count models for these tasks: Hafemeister & Satija [1] recommend using Pearson residuals from negative binomial regression, while Townes et al. [2] fitting a generalized PCA model. Here,...

10.1101/2020.12.01.405886 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2020-12-02

Most discriminative stimuli for functional cell type identification

OPENALEX - Publications

Max F. Burg Thomas Zenkel Michaela Vystrčilová Jonathan Oesterle Larissa Höfling and 13 more

Identifying cell types and understanding their functional properties is crucial for unraveling the mechanisms underlying perception cognition. In retina, can be identified by carefully selected stimuli, but this requires expert domain knowledge biases procedure towards previously known types. visual cortex, it still unknown what exist how to identify them. Thus, unbiased identification of in retina new approaches are needed. Here we propose an optimization-based clustering approach using...

10.48550/arxiv.2401.05342 preprint EN cc-by-nc-sa arXiv (Cornell University) 2024-01-01

berenslab/umi-normalization: Submission v3

OPENALEX - Publications

Jan Lause

10.5281/zenodo.5150534 article EN 2021-07-31

Compound models and Pearson residuals for normalization of single-cell RNA-seq data without UMIs

OPENALEX - Publications

Jan Lause Christoph Ziegenhain Leonard Hartmanis Philipp Berens Dmitry Kobak

Abstract Recent work employed Pearson residuals from Poisson or negative binomial models to normalize UMI data. To extend this approach non-UMI data, we model the additional amplification step with a compound distribution: assume that sequenced RNA molecules follow distribution, and are then replicated following an distribution. We show how leads residuals, which yield meaningful gene selection embeddings of Smart-seq2 datasets. Further, suggest distributions across several sequencing...

10.1101/2023.08.02.551637 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2023-08-05

Coming Soon ...