Finite state and Constraint Grammar based analysers, proofing tools and other resources
Digit-strings are used in place of numerals. They inflect with colon, like acronyms, and compound with hyphen only.
Digits are constructed as several cyclic structures: integers, decimals or roman numerals. Zero alone works quite differently:
0: 0+Num+Sg+Nom
13+Num+Card+Sg+Nom
100+Num+Card+Sg+Nom
0000005550000000+Num+Card+Sg+Nom
**LEXICON ARABICLOOP_pirinen ** essentially allows any number-sign combination, but is like the other lgs
**LEXICON ARABICLOOP_pirinen ** is for entries not looping back
The digit strings that end in 10 to 12 + 6n 0’s are inflected alike:
1000000000+Num+Card+Sg+Nom
300000000000+Num+Card+Sg+Nom
123456000000000000000000+Num+Card+Sg+Nom
The digit strings that end in 6 to 9 + 6n 0’s are inflected alike:
1000000+Num+Card+Sg+Nom
300000000+Num+Card+Sg+Nom
123456000000000000000+Num+Card+Sg+Nom
Decimal digit strings start with any number of digits 0 to 9, followed by decimal separator comma. The decimal dot may be allowed as substandard variant.
1,0+Num+Sg+Nom
314,1+Num+Sg+Nom
The decimal digit strings end in any number of digits 0 to 9, inflected along the last part.
3,141+Num+Sg+Nom
123,345+Num+Sg+Nom
The decimal digit strings with dot may be allowed as sub-standard option with respective analysis.
Roman numerals are composed the symbols M, D, C, L, X, V, I in ascending scale and some combinations, they denote ordinal numbers and inflect like ones.
This lexicon divides into four groups
MM+Num+Ord+Sg+Nom
MCXI+Num+Ord+Sg+Nom
CMXCIX+Num+Ord+Sg+Nom
Thousands can be followed by any of other parts
MII+Num+Ord+Sg+Nom
MCCCXXII+Num+Ord+Sg+Nom
Hundreds can be followed by anything but thousands:
CXXI+Num+Ord+Sg+Nom
DXXIV+Num+Ord+Sg+Nom
Tens can be followed by ones:
XIX+Num+Ord+Sg+Nom
XXVII+Num+Ord+Sg+Nom
Ones come alone
IX+Num+Ord+Sg+Nom
VIII+Num+Ord+Sg+Nom
II+Num+Ord+Sg+Nom
This (part of) documentation was generated from src/fst/morphology/stems/digits.lexc