Token¶
Token (word, separator etc.)
Fields¶
Property |
Required |
Type |
Description |
---|---|---|---|
id |
True |
string |
ID of the token used to refer to it from other objects |
off |
True |
integer |
The offset in token’s parent paragraph |
text |
True |
string |
The token’s text |
origOff |
False |
integer |
The token’s offset in the original paragraph (omitted when identical to “off”) |
origText |
False |
string |
The token’s text in the original paragraph (omitted when identical to “text”) |
lemma |
False |
string |
Lemma, the canonical or dictionary form of the word |
pos |
False |
string |
Universal POS |
parId |
False |
string |
ID of the syntactic parent token; missing for the root token |
fnc |
False |
string |
Grammatical function of this token according to UD (nsubj, obj, …) |
feats |
False |
Map[string, string] |
Universal and custom features |