- Gene expression and cancer classification
- Genomics and Phylogenetic Studies
- Image Retrieval and Classification Techniques
- Modular Robots and Swarm Intelligence
- Advanced Image and Video Retrieval Techniques
- Scientific Computing and Data Management
- Robotics and Automated Systems
- Chromosomal and Genetic Variations
- Educational Management and Quality
- Innovative Microfluidic and Catalytic Techniques Innovation
- Technology Adoption and User Behaviour
- Microbial Metabolic Engineering and Bioproduction
- Big Data and Business Intelligence
- Genetics, Bioinformatics, and Biomedical Research
- Natural Language Processing Techniques
- Plant Disease Resistance and Genetics
- Perovskite Materials and Applications
- Machine Learning in Materials Science
- Color Science and Applications
- Bacteriophages and microbial interactions
- Single-cell and spatial transcriptomics
- Solid-state spectroscopy and crystallography
- Handwritten Text Recognition Techniques
- Legume Nitrogen Fixing Symbiosis
- Educational Assessment and Improvement
Argonne National Laboratory
2019-2023
University of Chicago
2021-2023
University of Missouri
2022
The PathoSystems Resource Integration Center (PATRIC) is the bacterial Bioinformatics funded by National Institute of Allergy and Infectious Diseases (https://www.patricbrc.org). PATRIC supports bioinformatic analyses all bacteria with a special emphasis on pathogens, offering rich comparative analysis environment that provides users access to over 250 000 uniformly annotated publicly available genomes curated metadata. offers web-based visualization tools, private workspace in which can...
Abstract Background Recent advances in high-volume sequencing technology and mining of genomes from metagenomic samples call for rapid reliable genome quality evaluation. The current release the PATRIC database contains over 220,000 genomes, supports assemblies many draft-quality a single sample, most which will be novel. Description We have added two assessment tools to annotation pipeline. EvalCon uses supervised machine learning calculate an consistency score. EvalG implements variant...
Advances in robotic automation, high-performance computing, and artificial intelligence encourage us to propose large, general-purpose science factories with the scale needed tackle large discovery problems support thousands of scientists.
Large amounts of metagenomically-derived data are submitted to PATRIC for analysis. In the future, we expect even more jobs will use metagenomic data. One in-demand case is extraction near-complete draft genomes from assembled contigs origin. The metagenome binning service utilizes database furnish a large, diverse set reference genomes. We provide new supervised and annotation high-quality, contigs. Reference assigned putative genome bins based on presence single-copy universal marker roles...
Advances in robotic automation, high-performance computing (HPC), and artificial intelligence (AI) encourage us to conceive of science factories: large, general-purpose computation- AI-enabled self-driving laboratories (SDLs) with the generality scale needed both tackle large discovery problems support thousands scientists. Science factories require modular hardware software that can be replicated for (re)configured many applications. To this end, we propose a prototype factory architecture...
The strong spin–orbit coupling (SOC) in lead halide perovskites, when inversion symmetry is lifted, has provided opportunities for investigating the Rashba effect these systems. Moreover, orbital moment, which, turn, impacts spin-pair singlet and triplet electronic states, plays a significant role enhancing optoelectronic properties presence of external magnetic fields perovskites. Here, we investigate weak (<1 T) on photoluminescence (PL) CsPbBr3 nanocrystals with without...
Abstract The dynamics and structure of mixed phases in a complex fluid can significantly impact its material properties, such as viscoelasticity. Small-angle X-ray Photon Correlation Spectroscopy (SA-XPCS) probe the spontaneous spatial fluctuations under various situ environments over wide spatiotemporal ranges (10 −6 –10 3 s /10 −10 m). Tailored design, however, requires searching through massive number sample compositions experimental parameters, which is beyond bandwidth current coherent...
Self Driving Labs (SDLs) that combine automation of experimental procedures with autonomous decision making are gaining popularity as a means increasing the throughput scientific workflows. The task identifying quantities supplied colored pigments match target color, color matching problem, provides simple and flexible SDL test case, it requires experiment proposal, sample creation, analysis, three common components in discovery applications. We present robotic solution to problem allows for...
Abstract Background Large volumes of metagenomic samples are being processed and submitted to PATRIC for analysis as reads or assembled contigs. Effective these requires solutions a number problems, including the binning assembled, mixed, metagenomically-derived contigs into taxonomic units. Description The metagenome service utilizes database furnish large, diverse set reference genomes. Reference genomes assigned based on presence single-copy universal marker proteins in sample, bin...
Self Driving Labs (SDLs) that combine automation of experimental procedures with autonomous decision making are gaining popularity as a means increasing the throughput scientific workflows. The task identifying quantities supplied colored pigments match target color, color matching problem, provides simple and flexible SDL test case, it requires experiment proposal, sample creation, analysis, three common components in discovery applications. We present robotic solution to problem allows for...
We introduce WordScape, a novel pipeline for the creation of cross-disciplinary, multilingual corpora comprising millions pages with annotations document layout detection. Relating visual and textual items on has gained further significance advent multimodal models. Various approaches proved effective question answering or segmentation. However, interplay text, tables, visuals remains challenging variety understanding tasks. In particular, many models fail to generalize well diverse domains...