Token
A token represents a unit of text, which could be a word, separator, or othere meaningul component.
Fields
| Property | Required | Type | Description |
|---|---|---|---|
| id | True | string | ID of the token used to refer to it from other objects |
| off | True | integer | Offset in the token's parent paragraph |
| text | True | string | Token's text |
| origOff | False | integer | Token's offset in the original paragraph (omitted when identical to "off") |
| origText | False | string | Token's text in the original paragraph (omitted when identical to "text") |
| lemma | False | string | Lemma, the canonical or dictionary form of the word |
| pos | False | string | Universal POS |
| parId | False | string | ID of the syntactic parent token; missing for the root token |
| fnc | False | string | Grammatical function of this token according to Universal Dependencies (e.g., nsubj, obj) |
| feats | False | Map[string, string] | Universal and custom features |