Sentence delimiters are the following: <.> <!> <?> <…> <¶>
TAGS AND SETS
Tags
This section lists all the tags inherited from the fst, and used as tags
in the syntactic analysis. The next section, Sets, contains sets defined
on the basis of the tags listed here, those set names are not visible in the output.
Beginning and end of sentence
BOS
EOS
Parts of speech tags
N
A
Adv
V
Pron
CS
CC
Po
Pr
Pcle
Num
Interj
ABBR
ACR
CLB
LEFT
RIGHT
WEB
PUNCT
### Semantic tags
### Syntactic tags
@+FAUXV
@+FMAINV
@-FAUXV
@-FMAINV
@-FSUBJ>
@-F<OBJ
@-FOBJ>
@-FSPRED<OBJ
@-F<ADVL
@-FADVL>
@-F<SPRED
@-F<OPRED
@-FSPRED>
@-FOPRED>
@>ADVL
@ADVL<
@<ADVL
@ADVL>
@ADVL
@HAB>
@<HAB
@>N
@>Interj
@N<
@>A
@P<
@>P
@HNOUN
@INTERJ
@>Num
@Pron<
@>Pron
@Num<
@OBJ
@<OBJ
@OBJ>
@OPRED
@<OPRED
@OPRED>
@PCLE
@COMP-CS<
@SPRED
@<SPRED
@SPRED>
@SUBJ
@<SUBJ
@SUBJ>
@PPRED
@APP
@APP-N<
@APP-Pron<
@APP>Pron
@APP-Num<
@APP-ADVL<
@VOC
@CVP
@CNP
@X
## Sets containing sets of lists and tags
This part of the file lists a large number of sets based partly upon the tags defined above, and
partly upon lexemes drawn from the lexicon.
See the sourcefile itself to inspect the sets, what follows here is an overview of the set types.
### Sets for Single-word sets
OKTA and go, and the set INITIAL for initial letters
OKTA
go
INITIAL
### Sets for word or not
WORD
REAL-WORD
WORD-NOT-de
NOT-COMMA
### Derivational affixes
DER-V
DER-N
DER-A1
DER-A
A-V
A-NOT-V
### Case sets
ADLVCASE
CASE-HALFAGREEMENT
CASE-AGREEMENT
CASE
NOT-NOM
NOT-GEN
NOT-ACC
### Verb sets
NOT-V
### Sets for finiteness and mood
REAL-NEG
MOOD-V
GC
VFIN
VFIN-POS
VFIN-NOT-IMPRT
VFIN-NOT-NEG
NOT-PRFPRC
### Sets for person
### Sets consisting of forms of "leđe" (these ones need to be rewritten)
### Pronoun sets
### Adjectival sets and their complements
### Adverbial sets and their complements
### Sets for coordinators
### Sets for adverbs that have lookalikes
Here come some adverbs that have identical twins in other POS.
If these are found in Adv contexts, we treat them as adverbs.
### Sets of elements with common syntactic behaviour
### Sets for verbs
V is all readings with a V tag in them, REAL-V should
be the ones without an N tag following the V.
The REAL-V set thus awaits a fix to the preprocess V ... N bug.
* The set COPULAS is for predicative constructions
TRANS-V is the set for verbs really taking objects
* Sets for verbs choosing oblique objects or adverbials
* **STVLIST** is the list of strictly transitive verbs. In the rules, refer not to STVLIST, but to the set STV defined below.
STRICT-TRANS-V is the set for verbs which don't let a GenAcc be a modifier of anything else than an object, e.g. Mun organiseren eatni gievkkanis. - eatni wants to be the object
### Valency sets
* **PLACE-V** Those get only not locative if the target is a member TOOL, ABSTR-TOOL or ANIMATE or CONCEPT. Selects more locatives than ONLY-PLACE-LOC-V
### Adverb sets
### Adjective sets
### NP sets defined according to their morphosyntactic features
### The PRE-NP-HEAD family of sets
These sets model noun phrases (NPs). The idea is to first define whatever can
occur in front of the head of the NP, and thereafter negate that with the
expression **WORD - premodifiers**.
The set **NOT-NPMOD** is used to find barriers between NPs.
Typical usage: ... (*1 N BARRIER NPT-NPMOD) ...
meaning: Scan to the first noun, ignoring anything that can be
part of the noun phrase of that noun (i.e., "scan to the next NP head")
### Other negatively defined morphosyntactic noun sets
### Noun sets
Nominal sets defined according to their morphophonological properties
Sets for lexeme homonymy (most of them are moved to where the actual rules are.)
The words in the set **N-PO** can be both N and Po, the set takes that into account.
### The LAHKA set family
### Nominal sets defined according to their semantical properties
* Spatial noun sets. These nouns behave like postpositions
* Time sets
* Amount sets
* Sets for nouns with morpho-syntactic preferences
* Number-related sets
* Sets for case, possessive, etc.
* Sets for nouns as pred
* Sets for animals
* Sets for things
* Sets for qualities
* Sets for things, not necessarily tools
* Sets for things such that people can be inside them:
* Sets for things such that people cannot be inside them:
* Part-whole sets for human
* Sets for places
* Sets that can both be buildings/places and represent humans
* Sets denoting relations
### Miscellaneous sets
### Border sets and their complements
### Multilingual sets
Conjunction sets
### Syntactic sets
ALLSYNTAG
NON-APP
*These were the set types.*
* * *
This (part of) documentation was generated from [src/cg3/introdisambiguation.cg3](https://github.com/giellalt/lang-smn/blob/main/src/cg3/introdisambiguation.cg3)