- Data Quality and Management
- Semantic Web and Ontologies
- Topic Modeling
- Data Visualization and Analytics
- Natural Language Processing Techniques
- Research Data Management Practices
- Complex Network Analysis Techniques
- Advanced Graph Neural Networks
- VLSI and FPGA Design Techniques
- Cognitive Computing and Networks
- Scientific Research and Philosophical Inquiry
- Scientific Computing and Data Management
- Library Science and Information
- Advancements in Photolithography Techniques
- Material Science and Thermodynamics
- Biomedical Text Mining and Ontologies
- Authorship Attribution and Profiling
- VLSI and Analog Circuit Testing
- Glass properties and applications
- Mathematics, Computing, and Information Processing
- Atmospheric Ozone and Climate
- Data Mining Algorithms and Applications
- Information Systems and Technology Applications
- Big Data and Business Intelligence
- Topological and Geometric Data Analysis
Siberian Branch of the Russian Academy of Sciences
2011-2024
Institute of Informatics of the Slovak Academy of Sciences
2018-2023
Novosibirsk State University
2014-2023
A.P. Ershov Institute of Informatics Systems, Siberian Branch of the Russian Academy of Sciences
2019-2021
Institute of Informatics Problems
2011-2016
Russian Academy of Sciences
2013
Research Institute of Technology (Russia)
1985
The current status of the W@DIS information system used for systematization spectroscopic data, including rovibronic transitions and energy levels, data sources is reviewed, where abbreviation stands Water Internet @ccesible Distributed Information System. Functionalities are outlined. primary emphasis on properties characterizing quality. Several examples describe interfaces to create molecular spectral line lists representation binary relations between typical individuals ontology...
Аннотация.Задача кросс-языкового сопоставления авторов и публикаций является частным случаем задачи присваивания уникального идентификатора одной той же сущности реального мира в разноязычных источниках данных.В данной работе представлены результаты экспериментов с несколькими версиями системы англоязычном источнике на основе русскоязычного источника этих версиях тестировались различные эвристики, поэтому рассматриваются те из них, которые давали наилучшие результаты.Важным элементом...
Abstract The problem of data fusion from bases and knowledge graphs in different languages is becoming increasingly important. main step such a the identification equivalent entities merging their descriptions. This known as identity resolution, or entity alignment problem. Recently, large group new methods has emerged. They look for so called “embeddings” establish equivalence by comparing embeddings. paper presents experiments with embedding-based algorithms on Russian-English dataset....
1. Ferreira A. A., Goncalves M. Laender H. F. A brief survey of automatic methods for author name disambiguation // ACM SIGMOD Record. 2012. Vol. 41. No. 2. 2. Shen Q., Wu T., Yang H., Y., Qu Cui W. Nameclarifier: visual analytics system IEEE Trans. Vis. Comput. Graph. 2017. 23. P. 141–150. 3. Apanovich Z. V., Cherepanov D. N., Marchuk G. Сross-language identity resolution and approaches to its solution Bulletin the Novosibirsk Computing Center. Series: Computer Science. 2014. 41–54. 4....
This paper describes a pipeline for extracting the author's terms and definitions from mathematical texts.We used two models: one, detecting formulas to clear text noise other, converting images into LaTeX restore deleted formulas.Experimental data show that clearing is an essential step, because it improves all quality metrics.To recognize terminology, we applied rule-based syntactic approach.The idea of "negative" rules shown here increases final precision significantly, though does not...
Knowledge graphs have come a long way in evolution from simple set of RDF triples to systems for obtaining new knowledge. While previous years semantic search was considered the main application knowledge graphs, nowadays penetrate into all areas industrial production. This work is an survey applications intended use modern
International and Russian-language data sources that provide information about Russian research-related organizations are considered. It is demonstrated contain more than most international sources, but this remains unavailable for English-language sources. Experiments on comparison integration of research in outlined. Data such as GRID, English chapters Wikipedia, Wikidata eLIBRARY.ru The work an intermediate step towards the creation open extensible knowledge graph.
Exponential size growth of such graphs as social networks, Internet graphs, etc. requires new approaches to their visualization. Along with node-link diagram representations, adjacency matrices and various hybrid representations are increasingly used for large visualizations. This survey discusses the visualization using gives examples applications where these used. We describe types patterns arising when corresponding modern networks ordered, algorithms making it possible reveal patterns....
The evolution of the concept "knowledge graph" from moment its inception to present is considered. paper also discusses how systems that position themselves as knowledge graphs has affected definition and life cycle graphs.
The approach to technology migration presented in this paper is based on a compaction and rerouting strategy. It takes as input the full-chip mask layout hierarchical description (CIF format) produces output target design rules. applicability of facilities, flexibility routing layers redistribution between different levels hierarchy, are provided by procedure for decomposition. decomposition any node extracts fragments which should be transformed means compaction. size extracted controlled...
Information about research organizations is an important attribute that enables identifying authors of scientific publications, as well analyzing the geographical distribution publications and assessing impact on citation associated with a geographic factor. Unfortunately, information national research-related often incomplete or distorted in international databases. This applies, particular, to Russian represented English-language The paper presents experiments data matching integration...
Аннотация.В данной работе описан алгоритм установления кроссязыковой идентичности авторов научных публикаций.Для их идентификации
This paper describes approaches to the vocabulary normalization and cross-language identity resolution problems that arise when LOD datasets are used populate content of scholarly knowledge bases.We have proposed several new heuristics, using additional information extracted from full text sources data.The first heuristics uses record track a person, second self-citation networks third textual analysis documents.The dataset Open Archive Russian Academy Sciences bibliographic as test examples.