- Data Quality and Management
- Privacy-Preserving Technologies in Data
- Advanced Database Systems and Queries
- Data Management and Algorithms
- Topic Modeling
- Data-Driven Disease Surveillance
- Web Data Mining and Analysis
- Natural Language Processing Techniques
- Dental Radiography and Imaging
- Access Control and Trust
- Time Series Analysis and Forecasting
Prince Sultan University
2018-2020
Applied Science Private University
2017
Australian National University
2013-2015
Real-time entity resolution (ER) is the process of matching a query record in sub-second time with records database that represent same real-world entity. To facilitate real-time on large databases, appropriate indexing approaches are required to reduce search space. Most available techniques based batch algorithms work only static databases and not suitable for ER. In this paper, we propose forest-based sorted neighborhood index uses multiple trees different sorting keys ER read-most...
Real-time Entity Resolution (ER) is the process of matching query records in subsecond time with a database that represent same real-world entity. Indexing techniques are generally used to efficiently extract set candidate from similar record, and be compared record more detail. The sorted neighborhood indexing method, which sorts compares within sliding window, has been successfully for ER large static databases. However, because it based on arrays designed batch resolves all rather than...
Entity resolution (ER) is the operation of distinguishing records that return to same real world entity. It used link among datasets and match query in real-time with existing datasets. Indexing a major step ER process reduces search space. Most indexing techniques are utilized designed work English Such may not be suitable for use other languages, such as Arabic. In this paper, enhancement has been proposed Arabic language by applying transliteration on strings before performing process....