Skip to content

Catalogs management

Emanuele Tajariol edited this page Nov 17, 2017 · 24 revisions

Testing the existing PA's catalogs across the current developments

Below some examples of some relevant catalogs to test during the ongoing developments:

1. CKAN catalogs not compliant with the DCAT-AP_IT profile:

  • Bari http://opendata.comune.bari.it/

    • Harvester: "CKAN harvester for DCATAPIT"

      • Groups are mapped to themes
    • Configuration:

      {"default_extras":{
          "source_catalog_title":"Comune di Bari",
          "source_catalog_description":"Catalogo OpenData del Comune di Bari",  
          "source_catalog_homepage":"http://opendata.comune.bari.it/", 
          "source_catalog_language":"ITA", 
          "source_catalog_modified":"2017-01-01T00:00:00"}}
      
  • Basilicata http://dati.regione.basilicata.it/

    • Harvester: "CKAN harvester for DCATAPIT"

      • Groups are mapped to themes
    • Configuration:

      {"default_extras":{
          "source_catalog_title":"Regione Basilicata",
          "source_catalog_description":"Catalogo OpenData della Regione Basilicata",  
          "source_catalog_homepage":"http://dati.regione.basilicata.it/", 
          "source_catalog_language":"ITA", 
          "source_catalog_modified":"2017-01-01T00:00:00"}}
      
  • Umbria http://dati.umbria.it/

    • Harvester: "CKAN harvester for DCATAPIT"

      • Groups are mapped to themes
    • Configuration:

      {"clean_tags":true,
       "default_extras":{
           "source_catalog_title":"Regione Umbria",
           "source_catalog_description":"Catalogo OpenData della Regione Umbria",  
           "source_catalog_homepage":"http://dati.umbria.it/", 
           "source_catalog_language":"ITA", 
           "source_catalog_modified":"2017-01-01T00:00:00"}}
      
  • Marche http://goodpa.regione.marche.it/

    • Questo catalogo non suddivide i dataset tramite gruppi.
  • Veneto http://dati.veneto.it/

    • Harvester: "CKAN harvester for DCATAPIT"

      • Groups are mapped to themes
    • Configuration:

      {"clean_tags":true,
       "default_extras":{
           "source_catalog_title":"Regione Veneto",
           "source_catalog_description":"Catalogo OpenData della Regione Veneto",
           "source_catalog_homepage":"http://dati.veneto.it/", 
           "source_catalog_language":"ITA", 
           "source_catalog_modified":"2017-01-01T00:00:00"}}
      

2. CKAN catalogs not compliant with the DCAT-AP_IT profile which expose json file (data.json DCAT JSON harvester):

Genova http://dati.comune.genova.it/data.json

Lombardia https://www.dati.lombardia.it/data.json —> here there are blockers during harvesting but they are not still compliant with dcatapit profile (work in progress)

3. CKAN catalogs with several problems on Tags:

  • Regione Sardegna (several errors on tags, 801 dataset, only the half part can be currently harvested due to the exsisting issues on tags validation), http://opendata.regione.sardegna.it/

    • URL: http://dati.regione.sardegna.it/

    • Harvester: "CKAN harvester for DCATAPIT"

    • Configuration:

      {"clean_tags":true,
       "default_extras":{
           "source_catalog_title":"Regione Sardegna",
           "source_catalog_description":"Catalogo OpenData della Regione Sardegna", 
           "source_catalog_homepage":"http://opendata.regione.sardegna.it/", 
           "source_catalog_language":"ITA", 
           "source_catalog_modified":"2017-01-01T00:00:00"}}
      
  • Lombardia (https://www.dati.lombardia.it/data.json)

The involved tags are mainly in geospatial data (same problem for Sardegna, Bolzano and Trentino)

4. Good RDF catalogs:

Complete list of Harvest Sources

HERE a complete list of Harvest Sources useful for testing