NFDI4DS | UHH-SEMS - Publication Details

Indrė Žliobaitė

ORCID: 0000-0003-2427-5407

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5037208593

Research Areas

Data Stream Mining Techniques
Evolution and Paleontology Studies
Machine Learning and Data Classification
Time Series Analysis and Forecasting
Species Distribution and Climate Change
Pleistocene-Era Hominins and Archaeology
Wildlife Ecology and Conservation
Advanced Bandit Algorithms Research
Anomaly Detection Techniques and Applications
Human Mobility and Location-Based Analysis
Fault Detection and Control Systems
Evolutionary Game Theory and Cooperation
Bat Biology and Ecology Studies
Data Mining Algorithms and Applications
Explainable Artificial Intelligence (XAI)
Remote Sensing in Agriculture
Machine Learning and Algorithms
Advanced Radiotherapy Techniques
Ethics and Social Impacts of AI
Imbalanced Data Classification Techniques
Primate Behavior and Ecology
Transportation Planning and Optimization
Traffic Prediction and Management Techniques
Geology and Paleoclimatology Research
Evolution and Genetic Dynamics

University of Helsinki
2015-2024

Finnish Museum of Natural History
2018-2024

Aalto University
2012-2017

Helsinki Institute for Information Technology
2013-2016

University of Technology
2014

Bournemouth University
2011-2013

Eindhoven University of Technology
2009-2011

Tamedia (Switzerland)
2011

Vilnius University
2005-2010

Active Learning With Drifting Streaming Data

OPENALEX - Publications

Indrė Žliobaitė Albert Bifet Bernhard Pfahringer Geoffrey Holmes

In learning to classify streaming data, obtaining true labels may require major effort and incur excessive cost. Active focuses on carefully selecting as few labeled instances possible for an accurate predictive model. Streaming data poses additional challenges active learning, since the distribution change over time (concept drift) models need adapt. Conventional strategies concentrate querying most uncertain instances, which are typically concentrated around decision boundary. Changes...

10.1109/tnnls.2012.2236570 article EN IEEE Transactions on Neural Networks and Learning Systems 2013-01-17

Open challenges for data stream mining research

OPENALEX - Publications

Georg Krempl Indrė Žliobaitė Dariusz Brzeziński Eyke Hüllermeier Mark Last and 6 more

Every day, huge volumes of sensory, transactional, and web data are continuously generated as streams, which need to be analyzed online they arrive. Streaming can considered one the main sources what is called big data. While predictive modeling for streams have received a lot attention over last decade, many research approaches typically designed well-behaved controlled problem settings, overlooking important challenges imposed by real-world applications. This article presents discussion on...

10.1145/2674026.2674028 article EN ACM SIGKDD Explorations Newsletter 2014-09-25

Measuring discrimination in algorithmic decision making

OPENALEX - Publications

Indrė Žliobaitė

10.1007/s10618-017-0506-1 article EN Data Mining and Knowledge Discovery 2017-03-31

Learning under Concept Drift: an Overview

OPENALEX - Publications

Indrė Žliobaitė

Concept drift refers to a non stationary learning problem over time. The training and the application data often mismatch in real life problems. In this report we present context of concept 1. We focus on issues relevant adaptive set formation. framework terminology, formulate global picture learners design. start with formalizing for drifting Section 2 discuss adaptivity mechanisms learners. 3 overview principle chapter give general available algorithms categorize them based their...

10.48550/arxiv.1010.4784 preprint EN other-oa arXiv (Cornell University) 2010-01-01

Dealing With Concept Drifts in Process Mining

OPENALEX - Publications

R. P. Jagadeesh Chandra Bose Wil M. P. van der Aalst Indrė Žliobaitė Mykola Pechenizkiy

Although most business processes change over time, contemporary process mining techniques tend to analyze these as if they are in a steady state. Processes may suddenly or gradually. The drift be periodic (e.g., because of seasonal influences) one-of-a-kind the effects new legislation). For management, it is crucial discover and understand such concept drifts processes. This paper presents generic framework specific detect when changes localize parts that have changed. Different features...

10.1109/tnnls.2013.2278313 article EN IEEE Transactions on Neural Networks and Learning Systems 2013-10-16

Using sensitive personal data may be necessary for avoiding discrimination in data-driven decision models

OPENALEX - Publications

Indrė Žliobaitė Bart Custers

10.1007/s10506-016-9182-5 article EN Artificial Intelligence and Law 2016-05-07

Evaluation methods and decision theory for classification of streaming data with temporal dependence

OPENALEX - Publications

Indrė Žliobaitė Albert Bifet Jesse Read Bernhard Pfahringer Geoffrey Holmes

10.1007/s10994-014-5441-4 article EN Machine Learning 2014-04-25

The rise and fall of the Old World savannah fauna and the origins of the African savannah biome

OPENALEX - Publications

Ferhat Kaya Faysal Bibi Indrė Žliobaitė Jussi T. Eronen Hui Tang and 1 more

10.1038/s41559-017-0414-1 article EN Nature Ecology & Evolution 2017-12-22

On the relation between accuracy and fairness in binary classification

OPENALEX - Publications

Indrė Žliobaitė

Our study revisits the problem of accuracy-fairness tradeoff in binary classification. We argue that comparison non-discriminatory classifiers needs to account for different rates positive predictions, otherwise conclusions about performance may be misleading, because accuracy and discrimination naive baselines on same dataset vary with predictions. provide methodological recommendations sound classifiers, present a brief theoretical empirical analysis tradeoffs between non-discrimination.

10.48550/arxiv.1505.05723 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Identifying climate thresholds for dominant natural vegetation types at the global scale using machine learning: Average climate versus extremes

OPENALEX - Publications

Rita Beigaitė Hui Tang Anders Bryn Olav Skarpaas Frøde Stordal and 2 more

The global distribution of vegetation is largely determined by climatic conditions and feeds back into the climate system. To predict future changes in response to change, it crucial identify understand key patterns processes that couple climate. Dynamic models (DGVMs) have been widely applied describe types their dynamics change. As a process-based approach, partly relies on hard-coded thresholds constrain vegetation. What implement DGVMs how replace them with more descriptions remain among...

10.1111/gcb.16110 article EN cc-by-nc Global Change Biology 2022-02-24

Quantifying explainable discrimination and removing illegal discrimination in automated decision making

OPENALEX - Publications

Faisal Kamiran Indrė Žliobaitė Toon Calders

10.1007/s10115-012-0584-8 article EN Knowledge and Information Systems 2012-11-17

Next challenges for adaptive learning systems

OPENALEX - Publications

Indrė Žliobaitė Albert Bifet Mohamed Medhat Gaber Bogdan Gabryś João Gama and 2 more

Learning from evolving streaming data has become a 'hot' research topic in the last decade and many adaptive learning algorithms have been developed. This was stimulated by rapidly growing amounts of industrial, transactional, sensor other business that arrives real time needs to be mined time. Under such circumstances, constant manual adjustment models is in-efficient with increasing becoming infeasible. Nevertheless, are still rarely employed applications practice. In light structurally...

10.1145/2408736.2408746 article EN ACM SIGKDD Explorations Newsletter 2012-12-10

Reconciling taxon senescence with the Red Queen’s hypothesis

OPENALEX - Publications

Indrė Žliobaitė Mikael Fortelius Nils Chr. Stenseth

10.1038/nature24656 article EN Nature 2017-11-28

Eco-ISEA3H, a machine learning ready spatial database for ecometric and species distribution modeling

OPENALEX - Publications

Michael Mechenich Indrė Žliobaitė

Abstract We present the Eco-ISEA3H database, a compilation of global spatial data characterizing climate, geology, land cover, physical and human geography, geographic ranges nearly 900 large mammalian species. The are tailored for machine learning (ML)-based ecological modeling, intended primarily continental- to global-scale ecometric species distribution modeling. Such models trained on present-day applied geologic past, or future scenarios climatic environmental change. Model training...

10.1038/s41597-023-01966-x article EN cc-by Scientific Data 2023-02-07

Change with Delayed Labeling: When is it Detectable?

OPENALEX - Publications

Indrė Žliobaitė

Handling changes over time in supervised learning (concept drift) lately has received a great deal of attention, number adaptive strategies have been developed. Most them make an optimistic assumption that the new labels become available immediately. In real sequential classification tasks it is often unrealistic due to task specific delayed labeling or associated costs. We address problem change detectability, given, are not available. this analytical study we look at space from...

10.1109/icdmw.2010.49 article EN IEEE ... International Conference on Data Mining workshops 2010-12-01

Adaptive Preprocessing for Streaming Data

OPENALEX - Publications

Indrė Žliobaitė Bogdan Gabryś

Many supervised learning approaches that adapt to changes in data distribution over time (e.g., concept drift) have been developed. The majority of them assume the comes already preprocessed or preprocessing is an integral part a algorithm. In real-application tasks, from, e.g., sensor readings, typically noisy, contain missing values, redundant features, and very large model development efforts devoted preprocessing. As evolving time, models need be able automatically. From practical...

10.1109/tkde.2012.147 article EN IEEE Transactions on Knowledge and Data Engineering 2012-08-22

Coming Soon ...