NFDI4DS | UHH-SEMS - Publication Details

Rashel Fam

ORCID: 0000-0003-2842-3227

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5012903555

Research Areas

Natural Language Processing Techniques
Topic Modeling
Artificial Intelligence in Games
Speech and dialogue systems
Authorship Attribution and Profiling
Machine Learning in Bioinformatics
Text and Document Classification Technologies
Second Language Acquisition and Learning
Edcuational Technology Systems
Multimodal Machine Learning Applications
Lexicography and Language Studies
Linguistic Variation and Morphology
Handwritten Text Recognition Techniques
Advanced Text Analysis Techniques
Linguistics and Language Analysis
Syntax, Semantics, Linguistic Variation

Waseda University
2018-2024

University of Indonesia
2013-2014

Designing an Indonesian part of speech tagset and manually tagged Indonesian corpus

OPENALEX - Publications

Arawinda Dinakaramani Rashel Fam Andry Luthfi Hendra Manurung

We describe our work on designing a linguistically principled part of speech (POS) tagset for the Indonesian language. The process involves detailed study and analysis existing tagsets manual tagging an corpus. results this are POS consisting 23 tags corpus over 250.000 lexical tokens that have been manually tagged using tagset.

10.1109/ialp.2014.6973519 article EN 2014-10-01

Building an Indonesian rule-based part-of-speech tagger

OPENALEX - Publications

Rashel Fam Andry Luthfi Arawinda Dinakaramani Hendra Manurung

This paper describes work on a part-of-speech tagger for the Indonesian language by employing rule-based approach. The system tokenizes documents while also considering multi-word expressions and recognizes named entities. It then applies tags to every token, starting from closed-class words open-class disambiguates based set of manually defined rules. currently obtains an accuracy 79% tagged corpus roughly 250.000 tokens.

10.1109/ialp.2014.6973521 article EN 2014-10-01

Neural Morphological Segmentation Model for Mongolian

OPENALEX - Publications

Weihua Wang Rashel Fam Feilong Bao Yves Lepage Guanglai Gao

Morphological segmentation is useful for processing Mongolian. In this paper, we manually build a morphological data set We then present character-based encoder-decoder model with attention mechanism to perform the task. further investigate influence of analogy features extracted from scratch and improve performance our using multi languages setting. Experimental results show that provides strong baseline Mongolian segmentation. The provide information system. use shows capability acquire...

10.1109/ijcnn.2019.8852050 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2019-07-01

OPENALEX - Publications

Rashel Fam Yves Lepage

This paper presents the system submitted by IPS-WASEDA University for CoNLL-SIGMORPHON 2018 Shared Task 1: Type level inflection.We develop a based on holistic approach which considers wholeword form as unit, instead of breaking them into smaller pieces (e,g.morphemes) like baseline systems does.We also implement an encoder-decoder model has recently become new standard in many natural language processing (NLP) tasks.The results show that neural outperforms and our bigger resources...

10.18653/v1/k18-3003 article EN cc-by Proceedings of the اولین کنفرانس بین المللی پیشرفت های نوین در مهندسی عمران 2018-01-01

A study of universal morphological analysis using morpheme-based, holistic, and neural approaches under various data size conditions

OPENALEX - Publications

Rashel Fam Yves Lepage

10.1007/s10472-024-09944-8 article EN Annals of Mathematics and Artificial Intelligence 2024-05-11

Organising lexica into analogical grids: a study of a holistic approach for morphological generation under various sizes of data in various languages

OPENALEX - Publications

Rashel Fam Yves Lepage

Morphological generation is a task where given lemma and morphosyntactic description of the target form, we are asked to generate form. Knowing that syntactic semantic relations other forms reflected by word form itself, show how exploit these between forms, holistically, is, as whole, derive without even breaking them into morphemes. Experimental results organising lexica analogical grids able improve accuracy morphological up 8% in low data scenarios. Our holistic approach always performs...

10.1080/0952813x.2022.2078890 article EN Journal of Experimental & Theoretical Artificial Intelligence 2022-06-01

A Study of Analogical Density in Various Corpora at Various Granularity

OPENALEX - Publications

Rashel Fam Yves Lepage

In this paper, we inspect the theoretical problem of counting number analogies between sentences contained in a text. Based on this, measure analogical density We focus analogy at sentence level, based level form rather than semantics. Experiments are carried two different corpora six European languages known to have various levels morphological richness. Corpora tokenised using several tokenisation schemes: character, sub-word and word. For scheme, employ popular models: unigram language...

10.3390/info12080314 article EN cc-by Information 2021-08-05

Poetry generation for Bahasa Indonesia using a constraint satisfaction approach

OPENALEX - Publications

Rashel Fam Hendra Manurung

This paper describes work on a poetry generator that is capable of generating poems in Indonesian based certain contexts by employing constraint satisfaction approach. The system retrieves language resources such as templates and slot fillers combines them to instantiate lines, which turn are composed into set given constraints. output this was evaluated through an online questionnaire involving 180 respondents. results showed generated using the full constraints were consistently measured...

10.1109/icacsis.2013.6761579 article EN 2013-09-01

A study of analogical grids extracted using feature vectors on varying vocabulary sizes in Indonesian

OPENALEX - Publications

Rashel Fam Yves Lepage

Indonesian as an agglutinating language is known for its derivative morphological richness. Word forms are constructed by combining stem and affixes. In this paper, we study the influence of surface form information in analogical grids extracted from a set word with varying sizes. Each represented feature vector. experiment setting, consider three features: characters, affixes, morphosyntactic definition. The sizes saturation then observed to characterize grids.

10.1109/icacsis47736.2019.8979864 article EN 2019-10-01

Coming Soon ...