Skip to content

LindgeW/Ontonotes5.0-Chinese-NER

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Ontonotes5.0 Chinese NER dataset

Notes:

Statistics

#doc #sent #word
1911 48K 988K
Genre #train #dev #test
BC 7862 2239 885
BN 8149 949 985
MZ 3988 362 451
NW 3569 425 516
TC 7510 1129 643
WB 6479 1113 813
#sum 37557 6217 4293

Description

GENRE = {bc bn mz nw tc wb}
SPLIT = {train dev test}

{GENRE}.{SPLIT}.id: document id collections
{GENRE}.{SPLIT}.char: char-level annotated data collections
{GENRE}.{SPLIT}.txt: word-level annotated data collections
{GENRE}.{SPLIT}.raw.txt: raw sentence-level data collections

About

Ontonotes5.0 Chinese NER dataset

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published