-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Additional Entity Types & Models #7
Comments
The following suggestions come courtesy of Pete Smith:
|
Bit of advice: It seems very unlikely to me that a model you train will be able to tell the difference between Consider that the model has literally zero knowledge about the world and is essentially operating on features extracted from the text only. As an example, IBA is a Similar comments apply to the difference between In comparison, Government department also seems like a reasonable NER label, I think. |
At some point the model is able to memorize things, and even if it has zero world knowledge, seeing enough data points is often good enough. For instance, it can remember that the word Treaty is a text, and the word organization is an organization, then learn how to use both of them. |
I would be pleased to volunteer |
The prototype Blackstone model,
en_blackstone_proto
, was trained to detect six entity types that apply generally across legal texts (in the sense that they're not specific to any legal sub-discipline, such as criminal law, company law etc).If you have any ideas for additional entity types that we should consider adding to future models, this is the place to add them.
Preferred method for setting out your ideas
For the sake of consistency, please add comments to this issue in the following format:
For example, if you're submitting an idea for a new entity type that you think would apply generally across legal text (i.e. something that is not specific to any sub-discipline of law) you're comment should look like this:
If, on the other hand, you're submitting an idea for a new entity type that you think applies to a particular sub-discipline, you're comment should look like this:
The text was updated successfully, but these errors were encountered: