Arabic Word Structure
Correctness of Arabic Words
Sarf is able to identify if an input word is a valid Arabic word. Whenever the input word is valid, Sarf provides its possible morphological analyses. Otherwise, the word is not valid in Arabic.
Sarf provides all possible morphological analyses for an input Arabic word. Each analysis consists of the diacritized word and the morphological breakdown of the analysis in terms of prefixes, stem, and suffixes. The stem is further decomposed into its root and morphological pattern. Moreover, each analysis carries the part of speech and a set of morpho-syntactic features such as gender and number. The analyses are ranked to reflect the actual language usage of each analysis.
Sarf constructs a final-from word from its morphological analysis. This analysis must contain root, pattern, stem, part of speech, prefixes, and suffixes.
Generation of Derivatives
Given a specific analysis, Sarf can derive all valid stems having the same root. The derivation may be limited by a specified target part of speech.
Generation of Inflections
Given a specific analysis, Sarf can derive all valid inflected forms having the same stem. The inflected forms are a combination of all valid prefix-stem-suffix combinations.
Awareness of Input Diacritics
Input-text diacritics are noted during analysis. Diacritics found in the input will be used as a filter on the generated analyses, but if the input diacritics are determined to be wrong, they are ignored.
• Analyze Word
• Synthesize Word
• Get Derivatives
• Get Inflections
• Get Plural
• Get Singular