Data Quality Configuration
Note
To understand the concept of Data Quality, you can read a full description in the Concept Overview page here.
To enable Data Quality metrics on any of your Asset / Data Streams, you will need to configure them in the Data Quality Configuration.
This will start the Data Quality metrics which can then be seen in the Dashboard as well as be used in any custom Applications.
There are five built-in metrics for Asset / Data Stream pairs can be enabled;
Note
Developers can also create their own Data Metric Applications.
- Data Availability
- Outlier Detection
- Range Validation
- Duplication Detection
- Timestamp Anomalies.
You can also hover your mouse over the metric names in the column headers to get more details on it.
Overview
To go to the configuration overview screen, click on the Data Quality menu and then the Configuration tab.
Here you will see the current configurations per Asset that are enabled.
| Column | Description |
|---|---|
| Asset & Data Stream | The Asset / Data Stream pair |
| Active | Whether metric algorithms are done on this Asset / Data Stream pair |
| Other Values | The values available and their settings will depend on the Data Quality metric algorithm |
Basic Features
On the overview page you have a number of common features that help you quickly navigate, find and filter the assets of interest.
Once you have configured and filtered your list, the list as you see it will be utilized as the reference list on the Asset Details page which we discuss in detail in the View Asset Details page, allowing for easier navigation and scrolling.
There are some quick filters that can help filter the table you want to display.
Note
The table will only show Asset / Data Streams that match ALL the filter options only.
| Filter | Description |
|---|---|
| Assets | Select the Assets that you want to see. You can select one or more Assets. |
| Data. Streams | Select the Data Streams that you want to see. You can select one or more Data Streams. |
You can sort the table rows simply by clicking on the header name. Clicking it will cancel the other sorts in other headers and each click will rotate between ascending, descending and no sorting.
When dealing with large quantities of Assets, you can search for the Assets by name.
This is dynamic and will update the list as you type.
The search feature is not case-sensitive and will search for any matches anywhere in the Asset name, i.e. it does not have to be at the beginning of the name.
You can choose the height of the row to fit as many rows as possible on the screen, or space them out for easier readability.
Table Configurations
Asset-centric tables bring many more advanced features to the lists to give you greater control in setting up the right overview experience.
When you click on the table configuration button ;
You will then see the Configuration Table options bar open up on the right hand side.
Options
There are a number of options available to customize the Table.
When you are inside the column settings with a large number of columns, this feature is useful to find columns and to hide the hidden columns from the list to make it easier to see what is relevant.
When you are inside the column settings you can show or hide available columns to ensure you only see what you need to see.
When you are inside the column settings you can see an overview of the columns available and showing.
| Item | Description |
|---|---|
| Total | Total number of columns available (both active and hidden) |
| Frozen | Total number of columns that are in the left or right freeze positions out of a total number of columns that can be moved to the frozen sections |
| Locked | Total number of columns that can not be moved or edited |
| Active | Total number of columns that will be shown on the table |
| Hidden | Total number of columns that will not be shown on the table |
Configure Data Quality
You can insert new Asset / Data Stream pair metrics into the Data Quality table.
Note
When inserting, you do not need to enable all the in-built metric algorithms.
You will have the option to select the metrics required and set their parameters.
To start, click on the Configure Data Quality button.
Select Assets
A popup will appear showing you the different ways you can select a list of Assets to insert Data Quality metric algorithms.
By Type Asset
This option allows you to choose one Asset Type.
The Assets drop down list will then be populated with Assets associated with the Asset Type.
Note
All the Assets will all be automatically selected, though you have the option to only select some of the Assets.
When ready click the Next button.
You will then see the Configure Data Quality screen where you can edit the metric settings for each Asset / Data Stream pair.
Note
The settings will be applied to all Assets in the connection automatically.
You can not individually edit which Assets to insert Data Quality metrics.
Note
Turn on Live Metrics from the switch in the top right corner to see all cells which need to be filled in.
Turn on Filter Errors to only see the rows that have metric errors.
From a Connection
This option allows you to choose one Connection.
All the Asset / Data Stream pairs in this Connection will be automatically selected.
Note
There is no option to select individual Assets.
When ready click the Next button.
You will then see the Configure Data Quality screen where you can edit the metric settings for each Data Stream.
Note
Turn on Live Metrics from the switch in the top right corner to see all cells which need to be filled in.
Turn on Filter Errors to only see the rows that have metric errors.
From a CSV file
This option allows you to create a full list as a CSV file using Google Sheets or Microsoft Excel and upload it to the Kelvin Platform.
Note
This is the optimal solution when you are inserting a large number of Asset / Data Stream pair metrics.
When you click next, you will then be asked to upload the CSV file.
Note
You will also have the option to download a sample template.
After uploading the file will be validated.
When ready click the Next button.
You will then see the Configure Data Quality screen where you can edit the metric settings for each Asset / Data Stream pair.
Note
Turn on Live Metrics from the switch in the top right corner to see all cells which need to be filled in.
Turn on Filter Errors to only see the rows that have metric errors.
Start from scratch
This option allows you to start with a blank page and you can add the Asset / Data Stream pairs manually.
Note
Turn on Live Metrics from the switch in the top right corner to see all cells which need to be filled in.
Turn on Filter Errors to only see the rows that have metric errors.
When ready click the Next button.
You will then see the Configure Data Quality screen where you can add Asset / Data Stream pairs and edit the metric settings.
Group By
If you have either multiple Assets or multiple Data Streams that are the same, then a dropdown option will appear on the top.
And select whether to group by Assets or Data Streams.
Success
This speeds up the process by only needing to select the wanted items on multiple rows in a group.
Configure Data Quality
This screen may look slightly different depending on how you selected your Assets.
Note
Details of how the Configure Data Quality screen will look is explained above in the Select Assets chapter.
Here we will show you the common features.
Metrics
For each row you can setup the settings for each Data Metric algorithm.
| Header | Sub Header | Description |
|---|---|---|
| Edge Data Availability | Active | Enable or disable this Data Quality metric algorithm. |
| Expected # of Messages | Number of data messages expected from the Asset. | |
| Interval | Time window used to count the messages. | |
| Range Validation | Active | Enable or disable this Data Quality metric algorithm. |
| Min Threshold | Minimum allowed value for incoming data. | |
| Max Threshold | Maximum allowed value for incoming data. | |
| Outlier Detection | Active | Enable or disable this Data Quality metric algorithm. |
| Model | The outlier detection model used. Currently supports only the Z-Score model. | |
| Window Size | Number of samples used to calculate the Z-Score. | |
| Threshold | Deviation limit used to flag outliers. | |
| Duplication Detection | Active | Enable or disable this Data Quality metric algorithm. |
| Window Size | Number of recent messages checked for duplicates. | |
| Timestamp Anomaly Detection | Active | Enable or disable this Data Quality metric algorithm. |
| Allowed Deviation | Maximum time deviation allowed between consecutive messages. |
Search
CSV Files
You can download the current table as a file in a CSV format.
You can then edit all the Asset / Data Stream pairs and the associated Data Quality metric settings in your favorite spreadsheet editor such as Google Sheets or Microsoft Excel.
When finished you can upload it back into the table. While uploading, Kelvin will validate the file for errors in the CSV file format before processing.
Warning
The current values in the table will be overwritten.
Preview Charts
When Preview Charts toggle is on then when you click on a setting the preview will appear below the relevant Asset / Data Stream row.
Submit
When ready, click on the Submit button.
Data Quality will immediately start monitoring the new Asset / Data stream pairs and they will be added to the Data Quality Dashboard.
Edit Configurations
You can edit individual metric settings for Asset / Data Stream Pairs.
To start either click on the edit icon .
Or if you want to edit multiple rows together select a number of Asset / Data Stream rows and click the Edit button.
You will then see the configuration page where you can update the metric settings.
When ready, click the Submit button to save your changes.
Delete Configurations
You can delete individual Data Quality checks for Asset / Data Stream Pairs.
Note
You can not delete any Data Quality monitors if they are being used in a Kelvin SmartApp™.
The delete icon will be gray and unselectable.
To start either click on the delete icon .
Or if you want to delete multiple rows together select a number of Asset / Data Stream rows and click the Delete button.
You will then see the delete confirmation popup. Type the word PERMANENTLY DELETE to confirm and then click the Delete button.


































