You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
what is the configuration option {"harvested_portal": "abcde"} good for?
Is there any benefit to have this?
Why is it required? Unfortunately it is not documented and the source code didn't gave me deeper insights.
Is it possible to make it optional?
Or define it within an own backend field, instead of the configuration field?
One has to define it manually, which increases harvester definition complexity, especially when creating harvesters via API or CLI.
The text was updated successfully, but these errors were encountered:
Currently the harvester adds a field metadata_harvested_portal to all harvested datasets containing the value of harvested_portal. This is used to identify datasets that are no longer provided to the harvester and should be deleted after a new harvesting run. This could be implemented differently to make the config optional, but it would require a few changes in the code.
However currently it needs an unique string value, and usually the name of harvested portal it used here.
When reading settings from configuration - why not using the harvester uuid as relation identifier by default?
Instead i propose to define metadata_harvested_portal only then, when you need to actually identify the records in special cases, e.g. when collecting data of a single source via multiple endpoints (e.g. using multiple harvesters in parallel).
This will keep the same behaviour, but without the need to specify this setting manually in "normal" cases.
Hi there,
what is the configuration option
{"harvested_portal": "abcde"}
good for?Is there any benefit to have this?
Why is it required? Unfortunately it is not documented and the source code didn't gave me deeper insights.
Is it possible to make it optional?
Or define it within an own backend field, instead of the configuration field?
One has to define it manually, which increases harvester definition complexity, especially when creating harvesters via API or CLI.
The text was updated successfully, but these errors were encountered: