NFDI4DS | UHH-SEMS - Publication Details

Automatic diacritization of Arabic text using recurrent neural networks

OPENALEX - Publications

Gheith A. Abandah Alex Graves Balkees Al-Shagoor Alaa Arabiyat Fuad Jamour and 1 more

10.1007/s10032-015-0242-2 article EN International Journal on Document Analysis and Recognition (IJDAR) 2015-03-11

ScaleMine: Scalable Parallel Frequent Subgraph Mining in a Single Large Graph

OPENALEX - Publications

Ehab Abdelhamid Ibrahim Abdelaziz Panos Kalnis Zuhair Khayyat Fuad Jamour

Frequent Subgraph Mining is an essential operation for graph analytics and knowledge extraction. Due to its high computational cost, parallel solutions are necessary. Existing approaches either suffer from load imbalance, or communication synchronization overheads. In this paper we propose ScaleMine; a novel frequent subgraph mining system single large graph. ScaleMine introduces two-phase approach. The first phase approximate; it quickly identifies subgraphs that with probability, while...

10.1109/sc.2016.60 article EN 2016-11-01

Parallel Algorithm for Incremental Betweenness Centrality on Large Graphs

OPENALEX - Publications

Fuad Jamour Spiros Skiadopoulos Panos Kalnis

Betweenness centrality quantifies the importance of nodes in a graph many applications, including network analysis, community detection and identification influential users. Typically, graphs such applications evolve over time. Thus, computation betweenness should be performed incrementally. This is challenging because updating even single edge may trigger all-pairs shortest paths entire graph. Existing approaches cannot scale to large graphs: they either require excessive memory (i.e.,...

10.1109/tpds.2017.2763951 article EN IEEE Transactions on Parallel and Distributed Systems 2017-10-17

Recognizing handwritten Arabic words using grapheme segmentation and recurrent neural networks

OPENALEX - Publications

Gheith A. Abandah Fuad Jamour Esam A. AlQaralleh

10.1007/s10032-014-0218-7 article EN International Journal on Document Analysis and Recognition (IJDAR) 2014-02-28

Scalemine: scalable parallel frequent subgraph mining in a single large graph

OPENALEX - Publications

Ehab Abdelhamid Ibrahim Abdelaziz Panos Kalnis Zuhair Khayyat Fuad Jamour

Frequent Subgraph Mining is an essential operation for graph analytics and knowledge extraction. Due to its high computational cost, parallel solutions are necessary. Existing approaches either suffer from load imbalance, or communication synchronization overheads. In this paper we propose ScaleMine; a novel frequent subgraph mining system single large graph. ScaleMine introduces two-phase approach. The first phase approximate; it quickly identifies subgraphs that with probability, while...

10.5555/3014904.3014986 article EN IEEE International Conference on High Performance Computing, Data, and Analytics 2016-11-13

A Benchmark for Betweenness Centrality Approximation Algorithms on Large Graphs

OPENALEX - Publications

Ziyad AlGhamdi Fuad Jamour Spiros Skiadopoulos Panos Kalnis

Betweenness centrality quantifies the importance of graph nodes in a variety applications including social, biological and communication networks. Its computation is very costly for large graphs; therefore, many approximate methods have been proposed. Given lack golden standard, accuracy most evaluated on tiny graphs not guaranteed to be representative realistic datasets that are orders magnitude larger. In this paper, we develop BeBeCA, benchmark betweenness approximation graphs....

10.1145/3085504.3085510 article EN 2017-06-05

Linguistically-Enriched and Context-AwareZero-shot Slot Filling

OPENALEX - Publications

A. B. Siddique Fuad Jamour Vagelis Hristidis

Slot filling is identifying contiguous spans of words in an utterance that correspond to certain parameters (i.e., slots) a user request/query. one the most important challenges modern task-oriented dialog systems. Supervised approaches have proven effective at tackling this challenge, but they need significant amount labeled training data given domain. However, new domains unseen training) may emerge after deployment. Thus, it imperative these models seamlessly adapt and fill slots from...

10.1145/3442381.3449870 article EN 2021-04-19

Generalized Zero-shot Intent Detection via Commonsense Knowledge

OPENALEX - Publications

A. B. Siddique Fuad Jamour Luxun Xu Vagelis Hristidis

Identifying user intents from natural language utterances is a crucial step in conversational systems that has been extensively studied as supervised classification problem. However, practice, new emerge after deploying an intent detection model. Thus, these models should seamlessly adapt and classify with both seen unseen -- deployment they do not have training data. The few existing target this setting rely heavily on the data of consequently overfit to intents, resulting bias misclassify...

10.1145/3404835.3462985 article EN Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval 2021-07-11

Matrix Algebra Framework for Portable, Scalable and Efficient Query Engines for RDF Graphs

OPENALEX - Publications

Fuad Jamour Ibrahim Abdelaziz Chen Yuanzhao Panos Kalnis

Existing query engines for RDF graphs follow one of two design paradigms: relational or graph-based. We explore sparse matrix algebra as a third paradigm and propose MAGiQ: framework implementing SPARQL that are portable on various hardware architectures, scalable over thousands compute nodes, efficient very large datasets. MAGiQ represents the graph defines domain-specific language algebraic operations. queries translated into programs oblivious to underlying computing infrastructure....

10.1145/3302424.3303962 article EN 2019-03-22

App-Aware Response Synthesis for User Reviews

OPENALEX - Publications

Umar Farooq A. B. Siddique Fuad Jamour Zhijia Zhao Vagelis Hristidis

Hundreds of thousands mobile app users post their reviews online. Responding to user promptly and satisfactorily improves application ratings, which is key popularity success. The proliferation such makes it virtually impossible for developers keep up with responding manually. To address this challenge, recent work has shown the possibility automatic response generation by training a seq2seq model large collection review-response pairs. However, because pairs are aggregated from many...

10.1109/bigdata50022.2020.9377983 article EN 2021 IEEE International Conference on Big Data (Big Data) 2020-12-10

Recognizing handwritten Arabic script through efficient skeleton-based grapheme segmentation algorithm

OPENALEX - Publications

Gheith A. Abandah Fuad Jamour

To recognize unlimited set of handwritten Arabic words, an efficient segmentation algorithm is needed to segment these cursive words into a limited primal graphemes. We propose rule-based that segments graphemes through collecting special feature points from the word skeleton. The development this motivated by need solve problems and limitations available in state-of-the-art algorithms area. preliminary evaluation proposed promising with over 96% accuracy on sample subset IFN/ENIT database.

10.1109/isda.2010.5687062 article EN 2010-11-01

A demonstration of MAGiQ

OPENALEX - Publications

Fuad Jamour Ibrahim Abdelaziz Panos Kalnis

Existing RDF engines follow one of two design paradigms: relational or graph-based. Such are typically designed for specific hardware architectures, mainly CPUs, and not easily portable to new architectures. Porting an existing engine a different architecture (e.g., many-core architectures) entails almost redesign from scratch. We explore sparse matrix algebra as third paradigm designing portable, scalable, efficient engine. demonstrate MAGiQ; approach evaluating complex SPARQL queries over...

10.14778/3229863.3236239 article EN Proceedings of the VLDB Endowment 2018-08-01

Predictable and Adaptive Goal-oriented Dialog Policy Generation

OPENALEX - Publications

Nhat Le A. B. Siddique Fuad Jamour Samet Oymak Vagelis Hristidis

Most existing commercial goal-oriented chatbots are diagram-based; i.e., they follow a rigid dialog flow to fill the slot values needed achieve user's goal. Diagram-based predictable, thus their adoption in settings; however, lack of flexibility may cause many users leave conversation before achieving On other hand, state-of-the-art research use Reinforcement Learning (RL) generate flexible policies. However, such can be unpredictable, violate intended business constraints, and require large...

10.1109/icsc50631.2021.00012 article EN 2021-01-01

Tuning Recurrent Neural Networks for Recognizing Handwritten Arabic Words

OPENALEX - Publications

Esam A. AlQaralleh Gheith A. Abandah Fuad Jamour

Artificial neural networks have the abilities to learn by example and are capable of solving problems that hard solve using ordinary rule-based programming. They many design parameters affect their performance such as number sizes hidden layers. Large slow small generally not accurate. Tuning network size is a task because space often large training long process. We use experiments techniques tune recurrent used in an Arabic handwriting recognition system. show best results achieved with...

10.4236/jsea.2013.610064 article EN cc-by Journal of Software Engineering and Applications 2013-01-01

Generating Predictable and Adaptive Dialog Policies in Single- and Multi-domain Goal-oriented Dialog Systems

OPENALEX - Publications

Nhat Le A. B. Siddique Fuad Jamour Samet Oymak Vagelis Hristidis

Most existing commercial goal-oriented chatbots are diagram-based; i.e. they follow a rigid dialog flow to fill the slot values needed achieve user’s goal. Diagram-based predictable, thus their adoption in settings; however, lack of flexibility may cause many users leave conversation before achieving On other hand, state-of-the-art research use Reinforcement Learning (RL) generate flexible policies. However, such can be unpredictable, violate intended business constraints, and require large...

10.1142/s1793351x21400109 article EN International Journal of Semantic Computing 2021-12-01

Generalized Zero-shot Intent Detection via Commonsense Knowledge

OPENALEX - Publications

A. B. Siddique Fuad Jamour Luxun Xu Vagelis Hristidis

Identifying user intents from natural language utterances is a crucial step in conversational systems that has been extensively studied as supervised classification problem. However, practice, new emerge after deploying an intent detection model. Thus, these models should seamlessly adapt and classify with both seen unseen -- deployment they do not have training data. The few existing target this setting rely heavily on the scarcely available data overfit to data, resulting bias misclassify...

10.48550/arxiv.2102.02925 preprint EN cc-by arXiv (Cornell University) 2021-01-01

QuAX

OPENALEX - Publications

Muhammad Shihab Rashid Fuad Jamour Vagelis Hristidis

Frequently Asked Questions (FAQ) are a form of semi-structured data that provides users with commonly requested information and enables several natural language processing tasks. Given the plethora such question-answer pairs on Web, there is an opportunity to automatically build large FAQ collections for any domain, as COVID-19 or Plastic Surgery. These can be used by information-seeking portals applications, AI chatbots. Automatically identifying extracting high-utility challenging...

10.1145/3459637.3482289 article EN 2021-10-26

App-Aware Response Synthesis for User Reviews

OPENALEX - Publications

Umar Farooq A. B. Siddique Fuad Jamour Zhijia Zhao Vagelis Hristidis

Responding to user reviews promptly and satisfactorily improves application ratings, which is key popularity success. The proliferation of such makes it virtually impossible for developers keep up with responding manually. To address this challenge, recent work has shown the possibility automatic response generation. However, because training review-response pairs are aggregated from many different apps, remains challenging models generate app-specific responses, which, on other hand, often...

10.48550/arxiv.2007.15793 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Linguistically-Enriched and Context-Aware Zero-shot Slot Filling

OPENALEX - Publications

A. B. Siddique Fuad Jamour Vagelis Hristidis

Slot filling is identifying contiguous spans of words in an utterance that correspond to certain parameters (i.e., slots) a user request/query. one the most important challenges modern task-oriented dialog systems. Supervised learning approaches have proven effective at tackling this challenge, but they need significant amount labeled training data given domain. However, new domains unseen training) may emerge after deployment. Thus, it imperative these models seamlessly adapt and fill slots...

10.48550/arxiv.2101.06514 preprint EN cc-by arXiv (Cornell University) 2021-01-01