- Semantic Web and Ontologies
- Advanced Database Systems and Queries
- Data Quality and Management
- Natural Language Processing Techniques
- Biomedical Text Mining and Ontologies
- Topic Modeling
- Graph Theory and Algorithms
- Advanced Graph Neural Networks
- Service-Oriented Architecture and Web Services
- Scientific Computing and Data Management
- Data Management and Algorithms
- Web Data Mining and Analysis
- Library Science and Information Systems
- Genomics and Phylogenetic Studies
- Wikis in Education and Collaboration
- Advanced Text Analysis Techniques
- Algorithms and Data Compression
- Image Retrieval and Classification Techniques
- Mobile and Web Applications
- Digital Accessibility for Disabilities
- Bioinformatics and Genomic Networks
- Peer-to-Peer Network Technologies
- Bayesian Modeling and Causal Inference
- Research Data Management Practices
- Privacy-Preserving Technologies in Data
National Textile University
2024
Victoria University
2024
Melbourne Polytechnic
2024
Paderborn University
2018-2023
Leipzig University
2013-2022
Shanghai Jiao Tong University
2020-2021
University of Chile
2020-2021
King Abdulaziz University
2017-2021
Edith Cowan University
2018
University of Eastern Finland
2017
The Web of Data has grown enormously over the last years. Currently, it comprises a large compendium interlinked and distributed datasets from multiple domains. Running complex queries on this often requires accessing data different endpoints within one query. abundance da tasets need for running query thus motivated considerable body work SPARQL federation systems, dedicated means to access Data. However, granularity previous evaluations such systems not allowed deriving insights concerning...
Biomedical data, e.g. from knowledge bases and ontologies, is increasingly made available following open linked data principles, at best as RDF triple data. This a necessary step towards unified access to biological sets, but this still requires solutions query multiple endpoints for their heterogeneous eventually retrieve all the meaningful information. Suggested are based on federation approaches, which require submission of SPARQL queries endpoints. Due size complexity these have be...
Triplestores are data management systems for storing and querying RDF data. Over recent years, various benchmarks have been proposed to assess the performance of triplestores across different measures. However, choosing most suitable benchmark evaluating in practical settings is not a trivial task. This because experience varying workloads when deployed real applications. We address problem determining an appropriate given real-life workload by providing fine-grained comparative analysis...
The runtime optimization of federated SPARQL query engines is central importance to ensure the usability Web Data in real-world applications. efficient selection sources (SPARQL endpoints our case) as well generation optimized plans belong most important steps this respect. This paper presents CostFed, an index-assisted federation engine for processing. CostFed makes use statistical information collected from perform source and cost-based planning. In contrast state art, it relies on a...
Knowledge Graph Question Answering (KGQA) has gained attention from both industry and academia over the past decade. Researchers proposed a substantial amount of benchmarking datasets with different properties, pushing development in this field forward. Many these benchmarks depend on Freebase, DBpedia, or Wikidata. However, KGQA that Freebase DBpedia are gradually less studied used, because is defunct lacks structural validity Therefore, research gravitating toward Wikidata-based...
Abstract The rapid generation of large amounts information about the coronavirus SARS-CoV-2 and disease COVID-19 makes it increasingly difficult to gain a comprehensive overview current insights related disease. With this work, we aim support access data source on targeted especially at researchers. Our knowledge graph, C ovid P ub G raph , an RDF graph scientific publications, abides by Linked Data FAIR principles. base dataset for extraction is CORD-19, COVID-19-related which updated...
The Cancer Genome Atlas (TCGA) is a multidisciplinary, multi-institutional pilot project to create an atlas of genetic mutations responsible for cancer. One the aims this develop infrastructure making cancer related data publicly accessible, enable researchers anywhere around world make and validate important discoveries. However, in genome are organized as text archives set directories. Devising bioinformatics applications analyse such still challenging, it requires downloading very large...
The Cancer Genome Atlas (TCGA) is a multidisciplinary, multi-institutional effort to catalogue genetic mutations responsible for cancer using genome analysis techniques. One of the aims this project create comprehensive and open repository related molecular analysis, be exploited by bioinformaticians towards advancing knowledge. However, devising bioinformatics applications analyse such large dataset still challenging, as it often requires downloading archives parsing relevant text files....
Diabetes is one of the ever-increasing menace crippling millions people worldwide. It an independent risk factor for many cardiovascular diseases including medium and small vessels results in heart attack, stroke, kidney failure, blindness, lower-limb amputations. According to a World Health Organization (WHO) report estimated 1.6 million deaths were direct result diabetes. Nutrition plays vital role diabetes management alongside physical activity, drugs, insulin. Weight can help avert or...
Gathering information from the distributed Web of Data is commonly carried out by using SPARQL query federation approaches. However, fitness current approaches for real applications difficult to evaluate with benchmarks as they are either synthetic, too small in size and complexity or do not provide means a fine-grained evaluation. We propose LargeRDFBench, billion-triple benchmark which encompasses data well queries pertaining bio-medical use cases. state-of-the-art endpoint on this respect...
Several query federation engines have been proposed for accessing public Linked Open Data sources. However, in many domains, resources are sensitive and access to these is tightly controlled by stakeholders; consequently, privacy a major concern when federating queries over such datasets. In the Healthcare Life Sciences (HCLS) domain real-world datasets contain statistical information: strict ownership granted individuals working hospitals, research labs, clinical trial organisers, etc....
Development in the field of opinion mining and sentiment analysis has been rapid aims to explore views or texts on various social media sites through machine-learning techniques with sentiment, subjectivity calculations polarity. Sentiment is a natural language processing strategy used decide if information positive, negative, neutral it frequently performed literature help organizations screen brand, item client input, comprehend needs. In this paper, two strategies for proposed word...
In the last few years, SMS (Short Message Service)has made a big impact on way we communicate. Instead of communicating over phone using voice, people rather prefer not only for messaging but also information exchange. This paper proposes method building an extendable generic application which can be used to provide various types services mobile SMS. Mobile users send required through gateway that forwards it application. Given user-provided information, automatically generates appropriate query.