Skip to content

[FEATURE]: Add summary statistics as an additional output of quality checking #376

@mwojtyczka

Description

@mwojtyczka

Is there an existing issue for this?

  • I have searched the existing issues

Problem statement

Currently DQX outputs detailed logs in reporting columns for each row that has issues.
Users are requesting also to provide some summary (e.g. count of checks on error, warning, etc).

The output should be centralized in one table for alerting.

Proposed Solution

Add a method to retrieve quality checking summary stats on demand. They cannot be stored in the resulting DataFrame, it can be a separate object returned from quality checking.

Additional Context

No response

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions