- Semantic Web and Ontologies
- Biomedical Text Mining and Ontologies
- Data Quality and Management
- Service-Oriented Architecture and Web Services
- Natural Language Processing Techniques
- Advanced Database Systems and Queries
- Library Science and Information Systems
- Advanced Text Analysis Techniques
- Topic Modeling
- Mathematics, Computing, and Information Processing
- Research in Social Sciences
- Web Data Mining and Analysis
- Library Collection Development and Digital Resources
- Geographic Information Systems Studies
- Culinary Culture and Tourism
- Business Process Modeling and Analysis
- Data Management and Algorithms
- Crafts, Textile, and Design
- Data Visualization and Analytics
- Access Control and Trust
- Credit Risk and Financial Regulations
- Dental Education, Practice, Research
- Distributed systems and fault tolerance
- Digital and Traditional Archives Management
- Research Data Management Practices
National Library of Finland
2016-2024
Library Network
2013-2015
Aalto University
2007-2013
University of Helsinki
2007-2009
This article presents the vision and results of creating basis for a national semantic Web content infrastructure in Finland 2003-2007. The main elements are shared open metadata schemas, core ontologies, public ontology services. Several practical applications testing demonstrating usefulness overviewed fields eculture, ehealth, egovernment, elearning, ecommerce.
Purpose In order to estimate the value of semi-automated subject indexing in operative library catalogues, study aimed investigate five different automated implementations an open source software package on a large set Swedish union catalogue metadata records, with Dewey Decimal Classification (DDC) as target classification system. It also contribute body research aboutness and related challenges evaluation. Design/methodology/approach On sample over 230,000 records close 12,000 distinct DDC...
Manually indexing documents for subject-based access is a labour-intensive process. We propose using metadata gathered from bibliographic databases to train algorithms that assist librarians in work. have developed Annif, an open source tool and microservice automated subject indexing. After training it with vocabulary existing metadata, Annif can be used assign headings new documents. tested different document collections including scientific papers, old scanned books contemporary e-books,...
The widely used paradigm of faceted browsing is limited by the fact that only one query and result set are displayed at a time. This demonstrator introduces an interaction design for parallel makes it easy user to construct view results multiple interrelated queries. offers general benefits variety application areas.
Purpose The purpose of this paper is threefold: to focus on the process multilingual concept scheme construction and challenges involved; addresses concrete faced in especially those related equivalence between terms concepts; briefly outlines translation strategies developed during construction. Design/methodology/approach analysis based experience acquired establishment Finnish thesaurus ontology service Finto as well trilingual General Ontology YSO, both which are being maintained further...
Libraries are opening up their bibliographic metadata as Linked Data. However, they have all used different data models for structuring data. Some using a FRBR-based model with several layers of entities while others use flat, record-oriented models. The proliferation limits the reusability In effect, libraries moved from MARC silos to Data incompatible sets can be difficult combine and reuse. Small modelling differences may overcome by schema mappings, but it is not clear that...
Risto Luukkonen, Ville Komulainen, Jouni Luoma, Anni Eskelinen, Jenna Kanerva, Hanna-Mari Kupari, Filip Ginter, Veronika Laippala, Niklas Muennighoff, Aleksandra Piktus, Thomas Wang, Nouamane Tazi, Teven Scao, Wolf, Osma Suominen, Samuli Sairanen, Mikko Merioksa, Jyrki Heinonen, Aija Vahtola, Samuel Antao, Sampo Pyysalo. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023.
Finnish libraries are in the process of preparing a tender to acquire new back-end system. The system is expected provide well operating and deep integration possibilities existing national infrastructures.