- scientometrics and bibliometrics research
- Biomedical Text Mining and Ontologies
- Topic Modeling
- Research Data Management Practices
- Innovation, Technology, and Society
- Data Quality and Management
- Meta-analysis and systematic reviews
- Advanced Research in Science and Engineering
- Expert finding and Q&A systems
- Advanced Clustering Algorithms Research
- Data-Driven Disease Surveillance
- Advanced Data Processing Techniques
- Natural Language Processing Techniques
- Advanced Text Analysis Techniques
- Astronomical Observations and Instrumentation
- Corporate Governance and Management
- Online Learning and Analytics
- Regional Development and Policy
- Spatial and Panel Data Analysis
- Web visibility and informetrics
- Evaluation and Performance Assessment
- Technology Adoption and User Behaviour
- Gender and Technology in Education
- Advanced Statistical Process Monitoring
- Health and Medical Studies
German Centre for Higher Education Research and Science Studies
2017-2025
Institute for Research Information and Quality Assurance
2014
Humboldt-Universität zu Berlin
2011
Abstract Author self-citations are a somewhat controversial phenomenon. Some scholars maintain they normal, even indispensable, part of scientific referencing practice, while others claim frequently an expression vanity and self-promotion. Citations the basic data for citation network clustering, important approach to creating bottom-up, data-driven, global taxonomic systems research publications. Thus topical information content is particular interest in this context. Since it not yet known...
Abstract OpenAlex is a promising open source of scholarly metadata, and competitor to established proprietary sources, such as the Web Science Scopus. As provides its data freely openly, it permits researchers perform bibliometric studies that can be reproduced in community without licensing barriers. However, rapidly evolving contained within expanding also quickly changing, question naturally arises trustworthiness data. In this report, we will study reference coverage selected metadata...
The present study is an evaluation of three frequently used institution name disambiguation systems. Web Science normalized names and Organization Enhanced system the Scopus Affiliation ID are tested against a complete, independent for sample German public sector research organizations. as gold standard in evaluations that we perform. We coverage systems and, particular, differences number commonly bibliometric indicators. key finding institutions, studied provide indicator values have only...
OpenAlex is a promising open source of scholarly metadata, and competitor to the established proprietary sources, Web Science Scopus. As provides its data freely openly, it permits researchers perform bibliometric studies that can be reproduced in community without licensing barriers. However, as rapidly evolving contained within expanding also quickly changing, question naturally arises trustworthiness data. In this empirical paper, we will study reference metadata coverage each database...
Purpose The purpose of these experiments is to find out whether and how reading behavior might be influenced by devices. Design/methodology/approach In total, three experiments, the first one more independent from second third, investigate European Library Information Science students react electronic devices, unfamiliar as they are with them. third explore implications such rate, concentration symptoms fatigue in conjunction Test objects were Sony eBook Reader, IREX iLiad, LCD computer...
Abstract This study introduces an approach to estimate the uncertainty in bibliometric indicator values that is caused by data errors. utilizes Bayesian regression models, estimated from empirical samples, which are used predict error-free data. Through direct Monte Carlo simulation—drawing many replicates of predicted models for same input data—probability distributions can be obtained provide information on their due It demonstrated how base quantities, such as number publications certain...
Abstract By individually associating articles to basic or applied research, it is shown that are cited more frequently than ones. Dividing the subject categories of Web Science into a and an part, mean field-normalization rate referred part depending on research orientation paper analysed. this approach, distinct difference citations for parts most found. However, differences citation scores organisations found as well, but less clear. The explanation generally publish mix articles. In...
In this article I investigate the shortcomings of exact string match‐based author self‐citation detection methods. The contributions study are twofold. First, apply a fuzzy matching algorithm for and benchmark approach other common methods exclusively name‐based against manually curated ground truth sample. Near full recall can be achieved with proposed method while incurring only negligible precision loss. Second, report some important observations from results about extent latent...
Abstract Cumulative dissertations are doctoral theses comprised of multiple published articles. For studies publication activity and citation impact early career researchers, it is important to identify these articles link them their associated theses. Using a new benchmark data set, this paper reports on experiments measuring the bilingual textual similarity between, one hand, titles keywords theses, and, other articles’ abstracts. The tested methods cosine L1 distance in Vector Space Model...
Abstract This study investigates the potential of citation analysis Ph.D. theses to obtain valid and useful early career performance indicators at level university departments. For German from 1996 2018 suitability data Scopus Google Books is studied found be sufficient quantitative estimates researchers’ departmental in terms scientific recognition use their dissertations as reflected citations. citations complement each other have little overlap. Individual theses’ counts are much higher...
A perennial problem in bibliometrics is the appropriate distribution of authorship credit for coauthored publications. Several allocation methods and formulas have been introduced, but there has little empirical validation as to which method best reflects typical contributions coauthors. This paper presents a using new data set author-provided percentage contribution figures obtained from publications cumulative PhD theses by authors three countries that contain statements. The comparison...
The concept of epistemic breadth the work a researcher refers to scope their knowledge claims, as reflected in published research reports. Studies have been hampered by lack validated measure concept. Here we introduce space approach measurement and propose use semantic similarity network an author's publication record operationalize measure. In this approach, each paper has its own location common abstract vector based on content. Proximity corresponds thematic publications. Candidate...
Abstract In this study we propose and evaluate a method to automatically identify the journal publications that are related Ph.D. thesis using bibliographical data of both items. We build manually curated ground truth dataset from German cumulative doctoral theses explicitly list included publications, which match with records in Scopus database. then test supervised classification methods on task identifying correct associated among high numbers potential candidates features publication...