-
Notifications
You must be signed in to change notification settings - Fork 7
/
ner_eksperimenti
141 lines (127 loc) · 5.48 KB
/
ner_eksperimenti
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
Oktobris:
Entity P R F1 TP FP FN
location 0.7333 0.7857 0.7586 11 4 3
organization 0.6383 0.7317 0.6818 30 17 11
person 0.9722 0.8974 0.9333 35 1 4
product 1.0000 0.0769 0.1429 1 0 12
profession 0.6522 0.3571 0.4615 15 8 27
sum 0.9167 0.9167 0.9167 22 2 2
time 1.0000 0.5313 0.6939 17 0 15
Totals 0.7988 0.6268 0.7024 131 33 78
Entity P R F1 TP FP FN
location 0.7857 0.7857 0.7857 11 3 3
organization 0.6596 0.7561 0.7045 31 16 10
person 0.9722 0.8974 0.9333 35 1 4
product 1.0000 0.0769 0.1429 1 0 12
profession 0.6818 0.3571 0.4688 15 7 27
sum 0.9167 0.9167 0.9167 22 2 2
time 1.0000 0.5625 0.7200 18 0 14
Totals 0.8160 0.6364 0.7151 133 30 76
25.11.2013
Pielaboti treniņdati
10x_00
-----------
Totals 0.8137 0.7112 0.7590 2869 657 1165
event 0.3333 0.0417 0.0741 1 2 23
location 0.8056 0.8582 0.8311 829 200 137
media 0.9556 0.7167 0.8190 43 2 17
organization 0.7311 0.6197 0.6708 647 238 397
person 0.8563 0.7944 0.8242 429 72 111
product 0.4783 0.1000 0.1654 11 12 99
profession 0.6969 0.5174 0.5939 223 97 208
sum 0.9728 0.9158 0.9434 250 7 23
time 0.9417 0.8305 0.8826 436 27 89
5x_00
---------
Totals 0.7997 0.7045 0.7491 2842 712 1192
event 0.0000 0.0000 0.0000 0 1 13
location 0.7759 0.8602 0.8159 831 240 135
media 0.9500 0.6333 0.7600 38 2 22
organization 0.7448 0.6456 0.6916 674 231 370
person 0.8193 0.7389 0.7770 399 88 141
product 0.6667 0.0901 0.1587 10 5 101
profession 0.6605 0.4965 0.5669 214 110 217
sum 0.9764 0.9084 0.9412 248 6 25
time 0.9365 0.8152 0.8717 428 29 97
5x_01 + DB gazetieri
------
Totals 0.8033 0.7107 0.7542 2867 702 1167
event 0.0000 0.0000 0.0000 0 1 13
location 0.7826 0.8644 0.8214 835 232 131
media 0.9500 0.6333 0.7600 38 2 22
organization 0.7555 0.6571 0.7029 686 222 358
person 0.8269 0.7519 0.7876 406 85 134
product 0.5714 0.0930 0.1600 12 9 117
profession 0.6485 0.4965 0.5624 214 116 217
sum 0.9724 0.9048 0.9374 247 7 26
time 0.9387 0.8171 0.8737 429 28 96
5x_02 + Vārdšķira + Skaitlis + Locījums + LETA_lemma
------
Totals 0.8110 0.7172 0.7612 2893 674 1141
event 0.3333 0.0357 0.0645 1 2 27
location 0.7921 0.8716 0.8300 842 221 124
media 0.9512 0.6500 0.7723 39 2 21
organization 0.7582 0.6456 0.6974 674 215 370
person 0.8384 0.7593 0.7969 410 79 130
product 0.5789 0.0991 0.1692 11 8 100
profession 0.6817 0.5267 0.5942 227 106 204
sum 0.9728 0.9158 0.9434 250 7 23
time 0.9281 0.8362 0.8798 439 34 86
5x_03 + pārvaldes struktūras_GAZ + surnames atsevišķa kategorija (bet taču katram failam taisa citu)
------
Totals 0.8091 0.7164 0.7599 2890 682 1144
event 0.3333 0.0357 0.0645 1 2 27
location 0.7917 0.8696 0.8288 840 221 126
media 0.9512 0.6500 0.7723 39 2 21
organization 0.7514 0.6456 0.6945 674 223 370
person 0.8367 0.7593 0.7961 410 80 130
product 0.5500 0.0853 0.1477 11 9 118
profession 0.6848 0.5244 0.5940 226 104 205
sum 0.9728 0.9158 0.9434 250 7 23
time 0.9281 0.8362 0.8798 439 34 86
5x_04 - locījums, -dzimte
------
Totals 0.8065 0.7087 0.7545 2859 686 1175
event 0.5000 0.0357 0.0667 1 1 27
location 0.7807 0.8696 0.8227 840 236 126
media 0.9500 0.6333 0.7600 38 2 22
organization 0.7615 0.6360 0.6931 664 208 380
person 0.8214 0.7407 0.7790 400 87 140
product 0.6250 0.0775 0.1379 10 6 119
profession 0.6677 0.5035 0.5741 217 108 214
sum 0.9728 0.9158 0.9434 250 7 23
time 0.9340 0.8362 0.8824 439 31 86
5x_05 + gazetieru sakārtošana, lemmatizēšana (org_el, ..)
------
Totals 0.8067 0.7097 0.7551 2863 686 1171
event 0.5000 0.0435 0.0800 2 2 44
location 0.7932 0.8654 0.8277 836 218 130
media 0.9512 0.6500 0.7723 39 2 21
organization 0.7466 0.6264 0.6813 654 222 390
person 0.8433 0.7574 0.7980 409 76 131
product 0.4000 0.1081 0.1702 12 18 99
profession 0.6758 0.5128 0.5831 221 106 210
sum 0.9728 0.9158 0.9434 250 7 23
time 0.9263 0.8381 0.8800 440 35 85
5x2_06 - clean lemma capitalization constraint <- neko nedot
------
Totals 0.7986 0.6610 0.7233 468 118 240
location 0.7914 0.8462 0.8178 110 29 20
media 1.0000 1.0000 1.0000 1 0 0
organization 0.7617 0.5365 0.6296 147 46 127
person 0.8182 0.8182 0.8182 27 6 6
product 0.2353 0.1053 0.1455 4 13 34
profession 0.7111 0.6667 0.6882 32 13 16
sum 0.9792 0.9216 0.9495 47 1 4
time 0.9091 0.8130 0.8584 100 10 23
+occurancePattern, sentece begin <-neko nedot
------
Totals 0.7986 0.6610 0.7233 468 118 240
location 0.7914 0.8462 0.8178 110 29 20
media 1.0000 1.0000 1.0000 1 0 0
organization 0.7617 0.5365 0.6296 147 46 127
person 0.8182 0.8182 0.8182 27 6 6
product 0.2353 0.1053 0.1455 4 13 34
profession 0.7111 0.6667 0.6882 32 13 16
sum 0.9792 0.9216 0.9495 47 1 4
time 0.9091 0.8130 0.8584 100 10 23