- Semantic Web and Ontologies
- Advanced Database Systems and Queries
- Service-Oriented Architecture and Web Services
- Geographic Information Systems Studies
- Data Management and Algorithms
- Sentiment Analysis and Opinion Mining
- Data Quality and Management
- Complex Network Analysis Techniques
- Advanced Text Analysis Techniques
- Topic Modeling
- Social Media and Politics
- Innovative Teaching and Learning Methods
- Data Mining Algorithms and Applications
- Advanced Graph Neural Networks
- Privacy, Security, and Data Protection
- 3D Modeling in Geospatial Applications
- Web Data Mining and Analysis
- Human Mobility and Location-Based Analysis
- Data Visualization and Analytics
- Misinformation and Its Impacts
- Open Education and E-Learning
- Biomedical Text Mining and Ontologies
- Privacy-Preserving Technologies in Data
- Collaboration in agile enterprises
- Noise Effects and Management
Equipes Traitement de l'Information et Systèmes
2016-2024
Centre National de la Recherche Scientifique
2015-2024
CY Cergy Paris Université
2015-2024
École Nationale Supérieure de l'Électronique et de ses Applications
2015-2024
Université Paris-Seine
2017-2020
Institut Lavoisier de Versailles
2020
New Jersey Institute of Technology
2020
RMIT University
2020
Ashoka University
2020
Foundation for Research and Technology Hellas
2010-2016
Abstract In recent years, we have witnessed the proliferation of knowledge graphs (KG) in various domains, aiming to support applications like question answering, recommendations, etc. A frequent task when integrating from different KGs is find which subgraphs refer same real-world entity, a largely known as Entity Alignment. Recently, embedding methods been used for entity alignment tasks, that learn vector-space representation entities preserves their similarity original KGs. wide variety...
With the increasing use of Web 2.0 to create, disseminate, and consume large volumes data, more information is published becomes available for potential data consumers, that is, applications/services, individual users communities, outside their production site. The most representative example this trend Linked Open Data (LOD), a set interlinked knowledge bases. main challenge in context governance within loosely coordinated organizations are publishing added-value on Web, bringing together...
In this paper, we measure and analyze the graph features of semantic Web (SW) schemas with focus on power-law degree distributions. Our main finding is that majority SW a significant number properties (respectively, classes) approximate power law for total-degree subsumed distribution. Moreover, our analysis revealed some emerging conceptual modeling practices schema developers: (1) each has few focal classes have been analyzed in detail (that is, they numerous subclasses), which are further...
How to protect people from algorithmic harms? A promising solution, although in its infancy, is impact assessment (AIA). AIAs are iterative processes used investigate the possible short and long terms societal impacts of AI systems before their use, but with ongoing monitoring periodic revisiting even after implementation. When conducted a participatory transparent fashion, they could create bridges across legal, social computer science domains, promoting accountability entity performing...
Data sharing in the European Union (EU) has gained new momentum, among others for machine learning (ML) and artificial intelligence (AI) training purposes. By enabling models' whilst preserving privacy of data, Privacy Enhancing Technologies (PETs) have therefore popularity, especially policymakers. So far, computer science research focused on advancing state-of-the-art engineering exploring trade-offs between accuracy. Meanwhile, legal scholarship began investigating challenges arising...
Spatial Data Infrastructures (SDIs) are a key asset for Europe. This paper concentrates on unsolved issues in SDIs Europe related to the management of semantic heterogeneities. It studies contributions and competences from two communities this field: cartographers, authoritative data providers, geographic information scientists one hand, computer working Web other. During several workshops organized by EuroSDR Eurogeographics organizations, authors analyzed their complementarity discovered...
Several application domains, including healthcare, smart building, and traffic monitoring, require the continuous publishing of data, also known as time series. In many cases, series are geotagged data containing sensitive personal details, thus their processing entails privacy concerns. definitions have been proposed that allow for preservation while such with differential being most prominent one. Most existing schemes protect either a single timestamp (event-level), or all per user...
The Linked Data Paradigm is a promising technology for publishing, sharing, and connecting data on the Web, which provides new perspectives integration interoperability. However, proliferation of distributed, interconnected linked sources Web poses significant challenges consistently managing vast number potentially large datasets their interdependencies. In this article we focus key problem preserving evolving structured interlinked data. We argue that issues, hinder applications users, are...
RDF Graph Summarization pertains to the process of extracting concise but meaningful summaries from Knowledge Bases (KBs) representing as close possible actual contents KB both in terms structure and data. allows for better exploration visualization underlying graphs, optimization queries or query evaluation multiple steps, understanding connections Linked Datasets many other applications. In literature, there are efforts reported presenting algorithms KBs. These though provide different...
Sensors, portable devices, and location-based services, generate massive amounts of geo-tagged, and/or location- user-related data on a daily basis. The manipulation such is useful in numerous application domains, e.g., healthcare, intelligent buildings, traffic monitoring, to name few. A high percentage these carry information users' activities other personal details, thus their sharing arise concerns about the privacy individuals involved. To enable secure—from perspective—data sharing,...
Abstract We study the dynamics of interactions between a traditional medium, New York Times journal, and its followers in Twitter, using massive dataset. It consists metadata articles published by journal during first year COVID-19 pandemic, posts Twitter large set @nytimes account along with those several other media different kind. The discussions held exclusive medium show strong dependence on they follow: @FoxNews highest similarity to each differentiation interests general group. Our...
The Linked Open Data (LOD) cloud brings together information described in RDF and stored on the web (possibly distributed) Knowledge Bases (KBs). data these KBs are not necessarily by a known schema many times it is extremely time consuming to query all interlinked order acquire necessary information. To tackle this problem, we propose method of summarizing large using approximate graph patterns calculating number instances covered each pattern. Then transform an that describes contents KB....
Online Curriculum Portals aim to support networks of instructors and learners by providing a space convergence for enhancing peer-to-peer learning interactions among individuals an educational institution. To this end, effective, open scalable e-learning systems are required acquire, store, share knowledge under the form objects (LO). In paper, we interested in exploiting semantic relationships that characterize these LOs (e.g., prerequisite, part-of or see-also) order capture access...
Abstract We present a study of the evolution political landscape during 2015 and 2019 presidential elections in Argentina, based on data obtained from micro-blogging platform Twitter. build semantic network hashtags used by all users following at least one main candidates. With this we can detect topics that are discussed society. At difference with most studies opinion social media, do not choose priori, they emerge community structure instead. assign to each user dynamical topic vector...