Finite state and Constraint Grammar based analysers, proofing tools and other resources
Digit-strings are used in place of numerals. They inflect with colon, like acronyms, and compound with hyphen only.
Digits are constructed as several cyclic structures: integers, decimals or roman numerals. Zero alone works quite differently:
0: 0+Num+Sg+Nom
13+Num+Card+Sg+Nom100+Num+Card+Sg+Nom0000005550000000+Num+Card+Sg+Nom**LEXICON ARABICLOOP_pirinen ** essentially allows any number-sign combination, but is like the other lgs
**LEXICON ARABICLOOP_pirinen ** is for entries not looping back
The digit strings that end in 10 to 12 + 6n 0’s are inflected alike:
1000000000+Num+Card+Sg+Nom300000000000+Num+Card+Sg+Nom123456000000000000000000+Num+Card+Sg+NomThe digit strings that end in 6 to 9 + 6n 0’s are inflected alike:
1000000+Num+Card+Sg+Nom300000000+Num+Card+Sg+Nom123456000000000000000+Num+Card+Sg+NomDecimal digit strings start with any number of digits 0 to 9, followed by decimal separator comma. The decimal dot may be allowed as substandard variant.
1,0+Num+Sg+Nom314,1+Num+Sg+NomThe decimal digit strings end in any number of digits 0 to 9, inflected along the last part.
3,141+Num+Sg+Nom123,345+Num+Sg+NomThe decimal digit strings with dot may be allowed as sub-standard option with respective analysis.
Roman numerals are composed the symbols M, D, C, L, X, V, I in ascending scale and some combinations, they denote ordinal numbers and inflect like ones.
This lexicon divides into four groups
MM+Num+Ord+Sg+NomMCXI+Num+Ord+Sg+NomCMXCIX+Num+Ord+Sg+NomThousands can be followed by any of other parts
MII+Num+Ord+Sg+NomMCCCXXII+Num+Ord+Sg+NomHundreds can be followed by anything but thousands:
CXXI+Num+Ord+Sg+NomDXXIV+Num+Ord+Sg+NomTens can be followed by ones:
XIX+Num+Ord+Sg+NomXXVII+Num+Ord+Sg+NomOnes come alone
IX+Num+Ord+Sg+NomVIII+Num+Ord+Sg+NomII+Num+Ord+Sg+NomThis (part of) documentation was generated from src/fst/morphology/stems/digits.lexc