v0.13.0
Changelog
Added
data.set_processing(...)
now expose anautocast
parameter to disable or tweak the automatic casting of the tensor
during the processing. Autocasting should result in a slight speedup, but may lead to numerical instability.- Use
torch.inference_mode
to disable view tracking and version counter bumps during inference. - Added a new NER pipeline for suicide attempt detection
- Added date cues (regular expression matches that contributed to a date being detected) under the extension
ent._.date_cues
- Added tables processing in eds.measurement
- Added 'all' as possible input in eds.measurement measurements config
- Added new units in eds.measurement
Changed
- Default to mixed precision inference
Fixed
edsnlp.load("your/huggingface-model", install_dependencies=True)
now correctly resolves the python pip
(especially on Colab) to auto-install the model dependencies- We now better handle empty documents in the
eds.transformer
,eds.text_cnn
andeds.ner_crf
components - Support mixed precision in
eds.text_cnn
andeds.ner_crf
components - Support pre-quantization (<4.30) transformers versions
- Verify that all batches are non empty
- Fix
span_context_getter
forcontext_words
= 0,context_sents
> 2 and support assymetric contexts - Don't split sentences on rare unicode symbols
- Better detect abbreviations, like
E.coli
, now split as [E.
,coli
] and not [E
,.
,coli
]
What's Changed
- Various ml fixes by @percevalw in #303
- TS by @aricohen93 in #269
- date cues by @cvinot in #265
- Fix fast inference by @percevalw in #305
- Fix typo in diabetes patterns by @isabelbt in #306
- Fix span context getter by @aricohen93 in #307
- Fix sentences by @percevalw in #310
- chore: bump version to 0.13.0 by @percevalw in #312
New Contributors
Full Changelog: v0.12.3...v0.13.0