Token
Token (word, separator etc.)
Fields
Property | Required | Type | Description |
---|---|---|---|
id | True | string | ID of the token used to refer to it from other objects |
off | True | integer | The offset in token's parent paragraph |
text | True | string | The token's text |
origOff | False | integer | The token's offset in the original paragraph (omitted when identical to "off") |
origText | False | string | The token's text in the original paragraph (omitted when identical to "text") |
lemma | False | string | Lemma, the canonical or dictionary form of the word |
pos | False | string | Universal POS |
parId | False | string | ID of the syntactic parent token; missing for the root token |
fnc | False | string | Grammatical function of this token according to UD (nsubj, obj, ...) |
feats | False | Map[string, string] | Universal and custom features |