Installation:
pip install ckanext-harvest-basket
Optional CSW support:
pip install ckanext-harvest-basket[csw]
Development installation:
git clone https://github.com/DataShades/ckanext-harvest-basket.git
cd ckanext-harvest-basket
pip install -e ‘.[dev]’
Add to ckan.plugins (main plugin + desired harvesters):
ckan.plugins = … harvest_basket arcgis_harvester socrata_harvester dkan_harvester junar_harvester ods_harvester custom_ckan_harvester basket_dcat_json_harvester basket_csw_harvester basket_csiro_harvester …
Available harvesters:
- dkan_harvester: DKAN platform harvester
- junar_harvester: Junar platform harvester
- socrata_harvester: Socrata platform harvester
- arcgis_harvester: ArcGIS platform harvester
- custom_ckan_harvester: Custom CKAN harvester
- ods_harvester: OpenDataSoft (ODS) harvester
- basket_dcat_json_harvester: DCAT JSON harvester
- basket_csw_harvester: CSW (Catalogue Service for the Web) harvester (requires OWSLib)
- basket_csiro_harvester: CSIRO platform harvester
Requirements:
- ckanext-transmute>=1.7.0,<2 (for dataset transformation)
- html2markdown>=0.1.7,<1
- geojson>=2.5.0,<4
- jsonschema>=3.0.0,<4
- OWSLib>=0.31.0,<1 (optional, for CSW harvester)
Features:
- Integration with ckanext-transmute for dataset transformation during harvest
- Source checkup preview: tests connectivity and harvests sample dataset
- Anonymous user restrictions for harvester pages
- Supports harvest source configuration for transformations
Documentation: https://datashades.github.io/ckanext-harvest-basket/
Testing: pytest –ckan-ini=test.ini