Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"date of birth of Barack Obama" #64

Closed
Tpt opened this issue Dec 11, 2014 · 3 comments
Closed

"date of birth of Barack Obama" #64

Tpt opened this issue Dec 11, 2014 · 3 comments
Labels

Comments

@Tpt
Copy link
Member

Tpt commented Dec 11, 2014

queries like: http://ppp.pony.ovh/?lang=en&q=date+of+birth+of+Barack+Obama

@yhamoudi yhamoudi changed the title Queries like "date of birth of Barack Obama" do not work "date of birth of Barack Obama" Jan 19, 2015
@yhamoudi
Copy link
Member

This kind of queries is really difficult because we have to perform multiwords expressions (MWE) recognition if we want to parse it correctly. (from the grammatical point of view, barack obama and birth have exactly the same role).

We didn't find efficient + easy to use tools that perform MWE recognition. However, it seems now that conceptnet could be used to perform this task. Indeed, if a MWE exists then it is a concept and it can be in conceptnet (first president is not in conceptnet, but prime minister or date of birth are \o/). I think we could investigate this a little more. If conceptnet is powerful enough, here is what we can do to deal with MWE:

  • use conceptnet to find all the MWE in the input sentence
  • concatenate each MWE into a single word (same thing than with quotations). Be sure that it has no effect on the stanford parsing
  • parse with the stanford parser, run the usual algorithm

@yhamoudi
Copy link
Member

yhamoudi commented Feb 7, 2015

The parsing of date of birth of Barack Obama is not good (same problem of "distributivity" than the example here: #90).

However, let's assume the parsing was good, we will be able to produce ((Barack Obama, birth,?),date,?). Then we could use the same collapsing trick than here: #85

(this solution is less powerful than detecting the multiwords expressions and merging them before using the stanford parser, but it is more likely to succeed)

@yhamoudi
Copy link
Member

See #123

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants