Extension Data Validation


Extension Basics

Title
Data Validation
Name
ckanext-validation
Type
Public extension
Description
Advanced data validation framework with tabular data schema validation and quality reporting capabilities.
CKAN versions
Download-Url (zip)
Download-Url commit date
2024-06-18
Url to repo
Category
Data Management & Quality


Background Infos

Description (long)
Show details

The Data Validation extension provides a comprehensive framework for validating tabular data within CKAN, ensuring data quality and schema compliance through automated validation processes. This powerful extension leverages the Frictionless Data specifications and Goodtables validation library to perform detailed schema validation, data type checking, constraint verification, and quality assessment on CSV, Excel, and other tabular formats. The system provides real-time validation during data upload with detailed error reporting, validation summaries, and actionable feedback for data publishers. Advanced features include custom validation rule configuration, scheduled validation jobs for existing datasets, validation history tracking, and integration with data harvesting workflows. The extension supports schema definition through JSON Table Schema, enabling precise data type specifications, constraint definitions, and relationship validations. Quality reports include statistical summaries, data profiling information, and visual representations of validation results through charts and dashboards. Administrative features provide validation rule management, batch validation operations, and comprehensive audit trails. Essential for data portals requiring quality assurance, government agencies publishing statistical data, research institutions maintaining data repositories, and organizations with strict data governance requirements where data accuracy, consistency, and reliability are critical for decision-making and public trust.

Version
Latest
Version release date
2024-06-18
Contact name
Datopian Team
Contakt email
Contact Url
(not set)


Installation Guide

Configuration hints

Requires Goodtables library for tabular data validation

Plugins to configure (ckan.ini)
validation
CKAN Settings (ckan.ini)
# ckanext.validation.run_on_create = true # ckanext.validation.run_on_update = true # ckanext.validation.formats = csv xls xlsx # ckanext.validation.async_validation = true # ckanext.validation.default_validation_options = {} # ckanext.validation.store_results = true
DB migration to be executed
validation initdb
<< back to Extensions