Site icon Roel Peters

Data Testing

The truth is that you rarely completely control how or what data is collected. That’s why you should evaluate your data for its quality. There are many dimensions to data quality. The list will be longer or shorter, depending on who you ask.

To mitigate problems with data not behaving as expected, data engineers implement data tests throughout an organization’s data pipelines. Data tests encode your knowledge about assumptions that need to hold for data to be processed as planned. When a test detects issues with the data, a specific action needs to be taken. The data could be marked, processed differently, stored for later processing, or trigger a notification asking for manual intervention.

Different types of data testing

There are multiple types of tests, and they can be implemented and executed in multiple phases of the DataOps development process.

Exit mobile version