Soda

Soda

✔ An open-source, CLI tool and Python library for data quality testing
✔ Compatible with the Soda Checks Language (SodaCL)
✔ Enables data quality testing both in and out of your data pipelines and development workflows
✔ Integrated to allow a Soda scan in a data pipeline, or programmatic scans on a time-based schedule

Soda Core is a free, open-source, command-line tool and Python library that enables you to use the Soda Checks Language to turn user-defined input into aggregated SQL queries.

When it runs a scan on a dataset, Soda Core executes the checks to find invalid, missing, or unexpected data. When your Soda Checks fail, they surface the data that you defined as bad-quality.

Last Releases

  • v3.5.5
    What’s Changed Update README.md with launch banner by @santiviquez in #2292 Fix authentication inside Fabric Notebooks by @sdebruyn in #2299 Add dotenv to deps, fixes #2285 by @m1n0 in #2312… Read more: v3.5.5
  • v4.0.0b1
    v4.0.0b1   Source: https://github.com/sodadata/soda-core/releases/tag/v4.0.0b1
  • v3.5.4
    v3.5.4   Source: https://github.com/sodadata/soda-core/releases/tag/v3.5.4
  • SODA: A Data Quality Tool for Modern Pipelines

    Introduction SODA is an open-source data quality and observability platform designed for data engineers and analysts who need to ensure reliable, high-quality data in ETL workflows. It helps detect anomalies, enforce data quality rules, and provide insights into potential data issues before they impact downstream processes. By integrating with modern data warehouses, lakes, and pipelines,…

Recent Comments

No comments to show.