Now the entity occurrences are represented as ent
nodes and these nodes have the features eid
and kind
for entity ID and entity kind. There are also entity
nodes that collect entity occurrences with the same eid
and kind
.
The edge feature eoccs
links entity
nodes to their occurrences, the ent
nodes.
So, multiword entity occurrences now corresponds to a single ent
node, linked to the words the entity occupies.
The ent
and entity
nodes are added to the original dataset. The version of the dataset is still 1.0e.
Note that most tutorials work with version 1.0, but not version 1.0e.
If you need to work with earlier versions of the missieven, specify the version in the use command, like so:
A = use("CLARIAH/wp6-missieven", version="1.0")
This works best if you have installed Text-Fabric as
pip install --upgrade 'text-fabric[all]'
because then TF can use the GitHub API to fetch the data.
If you only work with the latest version (1.0e
) this is not needed.