NFDI4DS | UHH-SEMS - Publication Details

Aravind Venkatesan

ORCID: 0000-0003-4019-1940

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5008629865

Research Areas

Biomedical Text Mining and Ontologies
Semantic Web and Ontologies
Research Data Management Practices
Scientific Computing and Data Management
Bioinformatics and Genomic Networks
Genomics and Phylogenetic Studies
Genetics, Bioinformatics, and Biomedical Research
Banana Cultivation and Research
Gene expression and cancer classification
Genomics and Rare Diseases
Data Quality and Management
Topic Modeling
Cell Image Analysis Techniques
Academic Publishing and Open Access
Computational Drug Discovery Methods
Cancer Genomics and Diagnostics
Cleft Lip and Palate Research
Data Mining Algorithms and Applications
Cancer Research and Treatments
Protein Tyrosine Phosphatases
Biofuel production and bioconversion
Mycorrhizal Fungi and Plant Interactions
Plant Pathogens and Fungal Diseases
Biomedical and Engineering Education
Advanced Text Analysis Techniques

European Bioinformatics Institute
2016-2025

Wellcome Trust
2017-2025

Laboratoire d'Informatique, de Robotique et de Microélectronique de Montpellier
2016-2018

Université de Montpellier
2018

Centre National de la Recherche Scientifique
2018

RedBite (United Kingdom)
2016-2017

Institut de Recherche pour le Développement
2017

Norwegian University of Science and Technology
2010-2014

Tamil Nadu Government Dental College and Hospital
2012

The ELIXIR Core Data Resources: fundamental infrastructure for the life sciences

OPENALEX - Publications

Rachel Drysdale Charles E. Cook Robert Petryszak Vivienne Baillie-Gerritsen Mary Barlow and 40 more

Supplementary data are available at Bioinformatics online.

10.1093/bioinformatics/btz959 article EN cc-by Bioinformatics 2020-01-07

Europe PMC in 2020

OPENALEX - Publications

Christine Ferguson Dayane Araújo L. Christine Faulk Yuci Gou Audrey Hamelers and 23 more

Abstract Europe PMC (https://europepmc.org) is a database of research articles, including peer reviewed full text articles and abstracts, preprints - all freely available for use via website, APIs bulk download. This article outlines new developments since 2017 where work has focussed on three key areas: (i) added to its core content include life science preprint abstracts special collection COVID-19-related preprints. unique as an aggregator biomedical alongside peer-reviewed with over 180...

10.1093/nar/gkaa994 article EN cc-by Nucleic Acids Research 2020-10-19

Europe PMC in 2017

OPENALEX - Publications

Maria Levchenko Yuci Gou Florian Graef Audrey Hamelers Zhan Huang and 14 more

Europe PMC (https://europepmc.org) is a comprehensive resource of biomedical research publications that offers advanced tools for search, retrieval, and interaction with the scientific literature. This article outlines new developments since 2014. In addition to delivering core database services, focuses on three areas development: individual user data integration, infrastructure support text mining. now provides accounts save search queries claim ORCIDs, as well open access profiles authors...

10.1093/nar/gkx1005 article EN cc-by Nucleic Acids Research 2017-11-13

Europe PMC in 2023

OPENALEX - Publications

Summer Rosonovski Maria Levchenko Rajat Bhatnagar U Chandrasekaran L. Christine Faulk and 17 more

Abstract Europe PMC (https://europepmc.org/) is an open access database of life science journal articles and preprints, which contains over 42 million abstracts 9 full text accessible via the website, APIs bulk download. This publication outlines new developments to platform since last update in 2020 (1) focuses on five main areas. (i) Improving discoverability, reproducibility trust preprints by indexing preprint content, enriching metadata identifying withdrawn removed preprints. (ii)...

10.1093/nar/gkad1085 article EN cc-by Nucleic Acids Research 2023-11-22

Lit-OTAR Framework for Extracting Biological Evidences from Literature

OPENALEX - Publications

Santosh Tirunagari Shyamasree Saha Aravind Venkatesan Dániel Süveges Miguel Carmona and 5 more

Abstract Summary The lit-OTAR framework, developed through a collaboration between Europe PMC and Open Targets, leverages deep learning to revolutionise drug discovery by extracting evidence from scientific literature for target identification validation. This novel framework combines Named Entity Recognition (NER) identifying gene/protein (target), disease, organism, chemical/drug within texts, entity normalisation map these entities databases like Ensembl, Experimental Factor Ontology...

10.1093/bioinformatics/btaf113 article EN cc-by Bioinformatics 2025-03-17

Reasoning with bio-ontologies: using relational closure rules to enable practical querying

OPENALEX - Publications

Ward Blondé Vladimir Mironov Aravind Venkatesan Erick Antezana Bernard De Baets and 1 more

Abstract Motivation: Ontologies have become indispensable in the Life Sciences for managing large amounts of knowledge. The use logics ontologies ranges from sound modelling to practical querying that knowledge, thus adding a considerable value. We conceive reasoning on bio-ontologies as semi-automated process three steps: (i) defining logic-based representation language; (ii) building consistent ontology using and (iii) exploiting through querying. Results: Here, we report how implemented...

10.1093/bioinformatics/btr164 article EN Bioinformatics 2011-04-05

Lit-OTAR Framework for Extracting Biological Evidences from Literature

OPENALEX - Publications

Santosh Tirunagari Shyamasree Saha Aravind Venkatesan Dániel Süveges Annalisa Buniello and 4 more

The lit-OTAR framework, developed through a collaboration between Europe PMC and Open Targets, leverages deep learning to revolutionise drug discovery by extracting evidence from scientific literature for target identification validation. This novel framework combines Named Entity Recognition (NER) identifying genes/proteins, diseases, organisms, chemicals/drugs within texts, entity normalisation map these entities databases like Ensembl, Experimental Factor Ontology (EFO), ChEMBL....

10.1101/2024.03.06.583722 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2024-03-11

OLSVis: an animated, interactive visual browser for bio-ontologies

OPENALEX - Publications

Steven Vercruysse Aravind Venkatesan Martin Kuiper

More than one million terms from biomedical ontologies and controlled vocabularies are available through the Ontology Lookup Service (OLS). Although OLS provides ample possibility for querying browsing terms, visualization of parts ontology graphs is rather limited inflexible. We created OLSVis web application, a visualiser all in database. shows customisable subgraphs ontologies. Subgraphs animated via real-time force-based layout algorithm which fully interactive: each time user makes...

10.1186/1471-2105-13-116 article EN cc-by BMC Bioinformatics 2012-07-10

SciLite: a platform for displaying text-mined annotations as a means to link research articles with biological data

OPENALEX - Publications

Aravind Venkatesan Jee-Hyub Kim Francesco Talo Michele Ide‐Smith Julien Gobeill and 5 more

<ns4:p>The tremendous growth in biological data has resulted an increase the number of research papers being published. This presents a great challenge for scientists searching and assimilating facts described those papers. Particularly, databases depend on curators to add highly precise useful information that are usually extracted by reading articles. Therefore, there is urgent need find ways improve linking literature underlying data, thereby minimising effort browsing content identifying...

10.12688/wellcomeopenres.10210.2 preprint EN cc-by Wellcome Open Research 2017-07-10

Developing data interoperability using standards: A wheat community use case

OPENALEX - Publications

Windpouire Esther Dzale Yeumo Michaël Alaux Elizabeth Arnaud Sophie Aubin Ute Baumann and 16 more

In this article, we present a joint effort of the wheat research community, along with data and ontology experts, to develop interoperability guidelines. Interoperability is ability two or more systems devices cooperate exchange data, interpret that shared information. growing concern scientific agriculture in general, as need deluge obtained through high-throughput technologies grows. Agreeing on common formats, metadata, vocabulary standards an important step obtain required level order...

10.12688/f1000research.12234.2 preprint EN cc-by F1000Research 2017-12-06

SciLite: a platform for displaying text-mined annotations as a means to link research articles with biological data

OPENALEX - Publications

Aravind Venkatesan Jee-Hyub Kim Francesco Talo Michele Ide‐Smith Julien Gobeill and 5 more

<ns4:p>Biological databases are fundamental to biological research and discovery. Database curation adds highly precise useful information, usually extracted from the literature through experts reading articles. The significant amount of time effort put in by curators, against backdrop tremendous data growth, makes manual a high value task. Therefore, there is an urgent need find ways scale efforts improving integration, linking underlying data.</ns4:p><ns4:p> As part development Europe PMC,...

10.12688/wellcomeopenres.10210.1 preprint EN cc-by Wellcome Open Research 2016-12-12

Agronomic Linked Data (AgroLD): A knowledge-based system to enable integrative biology in agronomy

OPENALEX - Publications

Aravind Venkatesan Gildas Tagny Ngompé Nordine El Hassouni Imène Chentli Valentin Guignon and 3 more

Recent advances in high-throughput technologies have resulted a tremendous increase the amount of omics data produced plant science. This increase, conjunction with heterogeneity and variability data, presents major challenge to adopt an integrative research approach. We are facing urgent need effectively integrate assimilate complementary datasets understand biological system as whole. The Semantic Web offers for integration heterogeneous their transformation into explicit knowledge thanks...

10.1371/journal.pone.0198270 article EN cc-by PLoS ONE 2018-11-30

Europe PMC annotated full-text corpus for gene/proteins, diseases and organisms

OPENALEX - Publications

Xiao Yang Shyamasree Saha Aravind Venkatesan Santosh Tirunagari Vid Vartak and 1 more

Named entity recognition (NER) is a widely used text-mining and natural language processing (NLP) subtask. In recent years, deep learning methods have superseded traditional dictionary- rule-based NER approaches. A high-quality dataset essential to fully leverage advancements. While several gold-standard corpora for biomedical entities in abstracts exist, only few are based on full-text research articles. The Europe PMC literature database routinely annotates Gene/Proteins, Diseases,...

10.1038/s41597-023-02617-x article EN cc-by Scientific Data 2023-10-19

Developing data interoperability using standards: A wheat community use case

OPENALEX - Publications

Windpouire Esther Dzale Yeumo Michaël Alaux Elizabeth Arnaud Sophie Aubin Ute Baumann and 16 more

<ns3:p>In this article, we present a joint effort of the wheat research community, along with data and ontology experts, to develop interoperability guidelines. Interoperability is ability two or more systems devices cooperate exchange data, interpret that shared information. growing concern scientific agriculture in general, as need deluge obtained through high-throughput technologies grows. Agreeing on common formats, metadata, vocabulary standards an important step obtain required level...

10.12688/f1000research.12234.1 preprint EN cc-by F1000Research 2017-10-16

ONTO-ToolKit: enabling bio-ontology engineering via Galaxy

OPENALEX - Publications

Erick Antezana Aravind Venkatesan Chris Mungall Vladimir Mironov Martin Kuiper

The biosciences increasingly face the challenge of integrating a wide variety available data, information and knowledge in order to gain an understanding biological systems. Data integration is supported by diverse series tools, but lack consistent terminology label these data still presents significant hurdles. As consequence, much remains disconnected or worse: becomes misconnected. need address this problem has spawned building large number bio-ontologies. OBOF, RDF OWL are among most...

10.1186/1471-2105-11-s12-s8 article EN cc-by BMC Bioinformatics 2010-12-01

EMBL’s European Bioinformatics Institute (EMBL-EBI) in 2024

OPENALEX - Publications

Matthew Thakur Cath Brooksbank ROBERT FINN Helen V. Firth Julia Foreman and 21 more

The European Molecular Biology Laboratory's Bioinformatics Institute (EMBL-EBI) is one of the world's leading sources public biomolecular data. Based at Wellcome Genome Campus in Hinxton, UK, EMBL-EBI six sites Laboratory, Europe's only intergovernmental life sciences organization. This overview summarizes latest developments services that data resources provide to scientific communities globally (https://www.ebi.ac.uk/services).

10.1093/nar/gkae1089 article EN cc-by Nucleic Acids Research 2024-11-28

Finding gene regulatory network candidates using the gene expression knowledge base

OPENALEX - Publications

Aravind Venkatesan Sushil Tripathi Alejandro Sanz de Galdeano Ward Blondé Astrid Lægreid and 2 more

Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which patterns and dynamics 'omics' can be interpreted. The background information required construction such is often dispersed across multitude bases in variety formats. seamless integration this one main challenges bioinformatics. Semantic Web offers powerful technologies assembly integrated that are computationally comprehensible,...

10.1186/s12859-014-0386-y article EN cc-by BMC Bioinformatics 2014-12-01

Development of a knowledge system for Big Data

OPENALEX - Publications

Ngoc Luyen Lê Anne Tireau Aravind Venkatesan Pascal Neveu Pierre Larmande

In the recent years, data deluge in many areas of scientific research brings challenges treatment and improvement agricultural data. Research bioinformatics field does not outside this trend. This paper presents some approaches aiming to solve Big Data problem by combining increase semantic search capacity on existing plant laboratories. helps us strengthen user experiments obtained infering new knowledge. To achieve this, there exist several having different characteristics using platforms....

10.1145/2912845.2912869 preprint EN 2016-06-02

Understanding life sciences data curation practices via user research

OPENALEX - Publications

Aravind Venkatesan Nikiforos Karamanis Michele Ide‐Smith Jonathan Hickford Johanna McEntyre

<ns4:p><ns4:bold>Background:</ns4:bold> Manual curation is a cornerstone of public biological data resources. However, it time-consuming process that urgently needs supportive technical solutions in the face rapid growth. Supporting scalable part mission Elixir Data Platform. Thus far, we have established infrastructure capable ingesting and aggregating text-mined outputs from multiple providers making these available via an API. This API used by Europe PMC to display specific entities...

10.12688/f1000research.19427.1 preprint EN cc-by F1000Research 2019-09-11

Wheat Data Interoperability Guidelines, Ontologies and User Cases. Recommendations from the RDA Wheat Data Interoperability Working Group

OPENALEX - Publications

Windpouire Esther Dzale Yeumo Richard Fulss Michaël Alaux Sophie Aubin Elizabeth Arnaud and 12 more

10.15497/rda00018 article EN Reproduction in Domestic Animals 2016-02-10

The ELIXIR Core Data Resources: fundamental infrastructure for the life sciences

OPENALEX - Publications

Rachel Drysdale Charles E. Cook Robert Petryszak Vivienne Baillie-Gerritsen Mary Barlow and 12 more

Abstract Motivation Life science research in academia, industry, agriculture, and the health sector depends critically on free open data resources. ELIXIR ( www.elixir-europe.org ), European Research Infrastructure for life sciences data, has identified a set of Core Data Resources within Europe that are most fundamental importance long-term preservation biological data. We explore characteristics their usage, impact assured funding horizon to assess value as an infrastructure, understand...

10.1101/598318 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2019-04-05

Agronomic Linked Data (AgroLD): a Knowledge-based System to Enable Integrative Biology in Agronomy

OPENALEX - Publications

Aravind Venkatesan Gildas Tagny Nordine El Hassouni Imène Chentli Valentin Guignon and 3 more

Abstract Recent advances in high-throughput technologies have resulted a tremendous increase the amount of omics data produced plant science. This increase, conjunction with heterogeneity and variability data, presents major challenge to adopt an integrative research approach. We are facing urgent need effectively integrate assimilate complementary datasets understand biological system as whole. The Semantic Web offers for integration heterogeneous their transformation into explicit...

10.1101/325423 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2018-05-17

Semantic systems biology

OPENALEX - Publications

Erick Antezana Ward Blondé Aravind Venkatesan Bernard De Baets Vladimir Mironov and 1 more

The vast amounts of knowledge in the biomedical domain have paved way for a new paradigm biological research called Systems Biology, essentially an approach that relies on integration all available system single model. This promotes comprehensive understanding systems, driven by data and mathematical modelling. However, sheer volume, variation complexity current pose number hurdles management need to be overcome. Semantic Web offers various solutions these challenges. With our initiative,...

10.1145/1988688.1988756 article EN 2011-05-25

Coming Soon ...