Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle "strange" expressions (802.11, P=NP,...) #118

Open
yhamoudi opened this issue Feb 15, 2015 · 1 comment
Open

Handle "strange" expressions (802.11, P=NP,...) #118

yhamoudi opened this issue Feb 15, 2015 · 1 comment

Comments

@yhamoudi
Copy link
Member

Questions like What is 802.11 or What is P=NP? are poorly parsed because they contain expressions like 802.11 or P=NP.

Possible solution for such cases: apply the algorithm used with quotations:

  • identify "strange" expressions (an expression = a sequence of letters without space, "strange" = contain a strange symbol : ., =, ...). Ex: 802.11, P=NP,... (be careful: do not take P=NP?)
  • replace them by a random string
  • parse with the stanford parser
  • replace the random string by the initial word

If someone wants to implement this, please do it in branch reverse_predicates in file preprocessingMerge.py.

@yhamoudi yhamoudi changed the title Handle "strange" expressions Handle "strange" expressions (802.11, P=NP,...) Feb 16, 2015
@Ezibenroc
Copy link
Member

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants