Oracle entity in Table 2 VS. Oracle keywords in Table 7 #10

lifelongeek · 2021-06-21T02:19:09Z

I am trying to reproduce ROUGE on CNNDM with 'oracle keyword in Table 7'. 'oracle entity setting in Table 2' sounds similar to 'oracle keyword in Table 7', however, ROUGE score is very different. Could you explain how these settings are different?

jxhe · 2021-06-21T03:18:17Z

Hi,

"Oracle entity" in Table 2 uses only the entity words in the groud-truth target, while "oracle keywords" contains non-entity words as well, as described in the paper

lifelongeek · 2021-06-21T13:21:53Z

Thanks for the clarification.
I have some follow-up questions.

Does example_dataset/test.oraclewordns imply "oracle keywords"?
Does "longest sub-sequences" used for training automatic keyword extractor imply "oracle keywords"?

jxhe · 2021-07-28T08:22:24Z

Yes, example_dataset/test.oraclewordns imply "oracle keywords"
The keywords used for training automatic keyword extractor are "oracle keywords", yet strictly speaking "oracle keywords" are not exactly "longest sub-sequences" -- as described in your screenshot, "we remove duplicate words and stop words and keep the remaining tokens as keywords"

Wendy-Xiao · 2022-07-04T23:36:34Z

Hi,

I have a quick follow-up question on this point. For 'oracle entities', which NER tool did you used for extacting oracle entities from the reference summary?

Thanks a lot!!

jxhe · 2022-07-05T13:28:09Z

Hi, we use stanza for NER, you may refer to some examples here:

ctrl-sum/scripts/preprocess.py

Line 890 in 6468bea

def entity_random(split, src, datadir, nsample=100, human_study=False):

lifelongeek added the question Further information is requested label Jun 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Oracle entity in Table 2 VS. Oracle keywords in Table 7 #10

Oracle entity in Table 2 VS. Oracle keywords in Table 7 #10

lifelongeek commented Jun 21, 2021

jxhe commented Jun 21, 2021

lifelongeek commented Jun 21, 2021

jxhe commented Jul 28, 2021 •

edited

Loading

Wendy-Xiao commented Jul 4, 2022

jxhe commented Jul 5, 2022

Oracle entity in Table 2 VS. Oracle keywords in Table 7 #10

Oracle entity in Table 2 VS. Oracle keywords in Table 7 #10

Comments

lifelongeek commented Jun 21, 2021

jxhe commented Jun 21, 2021

lifelongeek commented Jun 21, 2021

jxhe commented Jul 28, 2021 • edited Loading

Wendy-Xiao commented Jul 4, 2022

jxhe commented Jul 5, 2022

jxhe commented Jul 28, 2021 •

edited

Loading