Very high false coding rate #52

ahalterman · 2018-10-29T20:49:46Z

(Just making an issue for what @philip-schrodt has reported elsewhere so we can consolidate discussion):

UniversalPetrarch is enormously overproducing events. Not sure what the source of the false positives is. E.g.:

Records evaluated:    1018
Correct events:        565    44.21%
Uncoded events:        713    55.79%
Extra events:         3212   315.52%

The text was updated successfully, but these errors were encountered:

PTB-OEDA · 2018-10-30T03:54:09Z

This should be documented better, since it is from the Spanish cases and cannot be repo'd in this branch.

…

On Mon, Oct 29, 2018, 15:49 Andy Halterman ***@***.***> wrote: (Just making an issue for what @philip-schrodt <https://github.com/philip-schrodt> has reported elsewhere so we can consolidate discussion): UniversalPetrarch is enormously overproducing events. Not sure what the source of the false positives is. E.g.: Records evaluated: 1018 Correct events: 565 44.21% Uncoded events: 713 55.79% Extra events: 3212 315.52% — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#52>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AJrP1uNZMrRygUC7omvzSEP_197rIBQJks5up2nrgaJpZM4YAUtT> .

ahalterman · 2018-11-05T19:25:08Z

Any update on this? This is one of the biggest obstacles to using UniversalPetrarch in production.

PTB-OEDA · 2018-11-05T19:31:44Z

In progress. Should be getting an update this next week from what's been told to me by the team last week.

…

On Mon, Nov 5, 2018, 13:25 Andy Halterman ***@***.*** wrote: Any update on this? This is one of the biggest obstacles to using UniversalPetrarch in production. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#52 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AJrP1up5mZqA0ImEOVbvrxnXpIPiI9LJks5usJCUgaJpZM4YAUtT> .

ahalterman · 2018-11-12T15:01:45Z

From talking with @JingL1014, it sounds like she has a couple theories for what could be causing the high false positives:

other true events in the sentences (we only annotated protest and assault events, so events beyond those will be seen as "extra events")
the actor spans could be too large, leading to too many actors being coded, and therefore too many events.

There's also the possibility there's some larger unknown problem in how it's handling the coding.

I think a few tests could help figure out which are going on:

what's the overproduction of protest or assault events only? (eliminate cases from theory 1)
how many of the extra events share an event code and just have different actors? (That would test theory 2)
look at the spans UP is returning (see Use GSR phrase info to improve UP #44) to make sure nothing wild is happening in the phrase extraction part.

ahalterman · 2018-12-04T14:27:33Z

Is there an update on this? Have you looked into those tests? It sounds from @philip-schrodt like some sentences are still producing many events so something's still going wrong.

PTB-OEDA · 2018-12-07T03:56:54Z

This is specific to Spanish and some issues in the validation code for it. Should be getting some more specifics with updates from @jinglu, and Phil and Javier et al on the ES cases.

…

On Tue, Dec 4, 2018, 08:27 Andy Halterman ***@***.*** wrote: Is there an update on this? Have you looked into those tests? It sounds from @philip-schrodt <https://github.com/philip-schrodt> like some sentences are still producing many events so something's still going wrong. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#52 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AJrP1tkKraOUbxHMbg5Hkh_bcJG5gej4ks5u1oZWgaJpZM4YAUtT> .

philip-schrodt · 2018-12-07T20:27:21Z

I added this in validation.py to

return_dict = petrarch_ud.do_coding(dict)
PETRwriter.write_events(return_dict, "evts.validation.txt")`

where write_events() is the routine that petrarch_ud uses to write the final output file, and for AFP_SPA_19940921.0205_7.0_0 is still produces 51 events in the regular output: I really need to be able to look at the code that Arizona is running which is producing fewer events, since all I'm doing is making calls to the petrarch_ud code I've downloaded from the -master branch: downloaded a new copy this morning and confirmed it is still generating this behavior.

ahalterman added the critical must address before program functions label Jan 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Very high false coding rate #52

Very high false coding rate #52

ahalterman commented Oct 29, 2018

PTB-OEDA commented Oct 30, 2018 via email

ahalterman commented Nov 5, 2018

PTB-OEDA commented Nov 5, 2018 via email

ahalterman commented Nov 12, 2018

ahalterman commented Dec 4, 2018

PTB-OEDA commented Dec 7, 2018 via email

philip-schrodt commented Dec 7, 2018

Very high false coding rate #52

Very high false coding rate #52

Comments

ahalterman commented Oct 29, 2018

PTB-OEDA commented Oct 30, 2018 via email

ahalterman commented Nov 5, 2018

PTB-OEDA commented Nov 5, 2018 via email

ahalterman commented Nov 12, 2018

ahalterman commented Dec 4, 2018

PTB-OEDA commented Dec 7, 2018 via email

philip-schrodt commented Dec 7, 2018