GiellaLT

1 alphabetically 2 by probabilities

1 rel freqency WP 2 rel freq actual corpus

only words with higher frequency in fo than in wp

we are looking for terms

could be but not so frequently

-6.146 = 50 – jo närare null desto meir frekvent confidence is conficence for the pair

likelihood of these words to be trans of each other

sme = dynamic compound first part nom, gen, pl

if it never changes I can add it back the reason they are removed is to get a smaller vocabulary size

lemma for compound ok for sme

updated, with all nouns, not the ones with high containing also absolute freq

giza++ ??

n a v exit rest

árvalit+V+TV+Der2+Der/eapmi+N+SgCmp#