Paper Quality Optimization for Developers¶

This demonstration video was done in v5.5.

There may be slight differences with the latest version.

Check the latest documentation for the specific tasks should any feature not quite work as expected.

The Paper Quality Optimization is also known as Multi-Objective Optimization with Machine Learning.

This is the Developer edition of a 4 part tutorial on turning an inhouse machine learning program developed in a Jupyter Notebook into a scalable solution for deployment to the edge in a Kelvin Instance;

Overview - Gives an overview workflow over all three detailed tutorials. Especially good for the non-technical who just want to have an understanding how to integrate and scale an inhouse developed machine learning program using Kelvin.
Developer - Detailed walk through to take a Jupyter Notebook containing an inhouse machine learning model program and integrating it as a Kelvin SmartApp™ ready for upload and deployment to the edge.
Platform Administrator - Detailed walk through to manage all administrative matters for the tutorial example for the Units, Asset Types, Assets, Data Streams, Connections, Upload to App Registry.
Process Engineer - Detailed walk through to add Assets to Kelvin SmartApps™, monitor the performance of the Assets and take action on new Recommendations.

Video Tutorial¶

This tutorial is also in a video format, so you can choose your preferred medium to understand more about Kelvin.

Chapters In Video

Requirements¶

For this tutorial you will need to have the following tasks completed first;

Installed the kelvin-sdk onto your computer.
Have access to a Kelvin Instance and logged into the Kelvin Instance in the Terminal.
Clone the Kelvin example from github. This is a fully functional App ready for local testing.

For this tutorial we will only need three files from the github example; the multi_objective_optimization.py and rolling_window.py programs and the csv\data.csv test data.

These files are assumed to be your machine learning program you want to use in a Kelvin Instance. The test data is assumed to be your data for testing the App locally before you upload to the Cloud.

Introduction¶

Briefly, the example is monitoring key set point inputs and paper quality check outputs data in a paper mill press line from the steam boiler, dryer, refiner, forming line, press and paper quality.

A machine learning model fits the input and output data to four random forest regression models, one for each output. Each of the four resulting objective functions are put through a generic algorithm (NGSA II) to find the optimized input set points.

The aim of the Multi-Objective Optimization with Machine Learning example is to demonstrate how easy it is to embed your machine learning Python program into an App, connect it to live data and send Recommendations to the Operations Engineers.

In this tutorial you will learn how to;

Create a new App
Import your machine learning Python Kelvin SmartApps™
Write the Kelvin SmartApp™
- Setup the logic to connect your ML to the data stream
- Create Control Changes and Recommendations from your ML output
- Update requirements.txt file
- Update app.yaml file
Test your App locally

With that, let's get started !

Creating an App¶

First create a new blank folder and create a new App and type in the name of the App when asked.

Kelvin SDK Command Line Example
$ kelvin app create
Please provide a name for the application: multi-objective-optimization-ml
[kelvin.sdk][2024-01-26 20:52:46][I] Creating new application "multi-objective-optimization-ml"
[kelvin.sdk][2024-01-26 20:52:46][I] Retrieving the included schema version
[kelvin.sdk][2024-01-26 20:52:52][I] Retrieving the included schema version
[kelvin.sdk][2024-01-26 20:52:52][R] Successfully created new application: "multi-objective-optimization-ml".
[kelvin.sdk][2024-01-26 20:52:52][I] Kelvin code samples are available at: https://github.com/kelvininc/app-samples

This will create a new folder and fill it with a blank App structure.

Import your Machine Learning Code¶

Machine Learning Code¶

In the video tutorial we have shown you the original Jupyter Notebook used by the Data Scientist to create and test a new machine learning model.

You can find the Jupyter Notebook in Kelvin's github example repository that you have cloned.

In this step we will assume that you are familiar with the process of copying all the code from a Jupyter Notebook into a Python file, including all the machine learning code contained within functions and declarations.

You will find the Jupyter Notebook to python file we have already created called multi_objective_optimization.py in the cloned repository. Copy this file into the new folder you created.

Rolling Window Code¶

In the Jupyter Notebook we have assumed the data fed to the machine learning model is a full snapshot of data to process which is in the csv file.

In real life conditions, the data is coming into our program as a stream of live data.

So we will need to implement a rolling window program to process the incoming data and create a timeseries snapshot of data that will be passed to the machine learning model functions for processing.

You can find a tutorial and the code on Rolling Window programming in Kelvin's github example repository.

You will find the Jupyter Notebook to python file we have already created called rolling_window.py in the cloned repository. Copy this file into the new folder you created.

Write the Kelvin SmartApp™¶

Now we need to write the main Python program to utilize the machine learning and rolling window program.

A blank main.py file will look like this;

Kelvin SmartApp™ Default Code Example
import asyncio

from kelvin.application import KelvinApp


async def main() -> None:
    # Creating instance of Kelvin SmartApp™ Client
    app = KelvinApp()

    # Connect the App Client
    await app.connect()

    while True:
        # Custom Loop
        await asyncio.sleep(1)


if __name__ == "__main__":
    asyncio.run(main())

As you can see the Python program is already setup to instantiate and connect to the Kelvin Cloud and to run an infinite loop.

We will use this template and do the following;

Import our machine learning code
Connect our program to receive live data and process it into windows of data
Run our machine learning model
Create Control Changes and Recommendations if the output of the machine learning model recommends new settings

Import Machine Learning Files¶

We start with importing our two machine learning files we created (well, actually copied from the clones repository) earlier.

Machine Learning Libraries
from multi_objective_optimization import run_model
from rolling_window import RollingWindow

Connect to the Live Data¶

The input and output data needs to be declared as Asset / Data Stream pairs to be able to connect the program to the live data in the Cloud.

The Data Stream names are declared in the App's app.yaml file and the Asset names are declared at runtime when an Asset is added to Kelvin SmartApps™.

We will setup the app.yaml file later in the tutorial

So here we only need to subscribe to the Asset / Data Stream pairs.

Subscribe Asset / Data Streams Pair
from kelvin.application import filters
from kelvin.message import Number

# Subscribe to the asset data streams
msg_queue: asyncio.Queue[Number] = app.filter(filters.is_asset_data_message)

We will also instantiate a RollingWindow object and assigning it to the variable rolling_window.

Rolling Window
from datetime import timedelta

# Create a rolling window
rolling_window = RollingWindow(
    max_data_points=500, timestamp_rounding_interval=timedelta(seconds=1)
)

Within the infinite loop part of the program we will first wait for new data to arrive for our Asset / Data Stream pair;

Await New Messages From Queue
# Await a new message from the queue
message = await msg_queue.get()

Then we add this new data to the rolling window, find out what asset we are working on and retrieve a snapshot of data as a DataFrame from the RollingWindow class associated with our wanted asset.

Add Data to Rolling Window
# Add the message to the rolling window
rolling_window.add_message(message)

# Get asset
asset = message.resource.asset

print("asset:", asset)

# Retrieve dataframe from the rolling window for the specified asset
df = rolling_window.get_asset_dataframe(asset)

Up to this stage your main.py code should look like this;

Partial main.py Code
import asyncio
from datetime import timedelta

from kelvin.application import KelvinApp, filters
from kelvin.message import Number

from multi_objective_optimization import run_model
from rolling_window import RollingWindow


async def main() -> None:
    # Creating instance of Kelvin SmartApp™ Client
    app = KelvinApp()

    # Connect the App Client
    await app.connect()

    # Subscribe to the asset data streams
    msg_queue: asyncio.Queue[Number] = app.filter(filters.is_asset_data_message)

    # Create a rolling window
    rolling_window = RollingWindow(
        max_data_points=500, timestamp_rounding_interval=timedelta(seconds=1)
    )

    while True:
        # Await a new message from the queue
        message = await msg_queue.get()

        # Add the message to the rolling window
        rolling_window.add_message(message)

        # Get asset
        asset = message.resource.asset

        # Retrieve dataframe from the rolling window for the specified asset
        df = rolling_window.get_asset_dataframe(asset)


if __name__ == "__main__":
    asyncio.run(main())

Machine Learning Model¶

Now that the main declarations and data retrieval and cleaning has been done, let's move onto the machine learning model.

To keep the code clean and also wrap our machine learning model in an error handling mechanism we will put the model and output processing code in another function and call this function from our infinite loop;

Create Model on Data Function
import pandas as pd
import logging

# Configure logging
logging.basicConfig(level=logging.INFO)

async def process_data(app: KelvinApp, asset: str, df: pd.DataFrame) -> None:
    try:
        # Run model
        recommended_setpoints = run_model(df)

        if recommended_setpoints:
            pass

    except Exception as e:
        logging.error(f"Error processing data for asset {asset}: {e}")

And in the infinite loop after retrieving the DataFrame with rolling data for an asset, we will call the function and wait for it to finish before moving on.

Run the Machine Learning Model
# Process the data
await process_data(app, asset, df)

Control Changes and Recommendations¶

Finally we will process the output of the machine learning program.

If there are new settings to send as recommendations in the Kelvin UI, then we will create Control Change instructions for each set point that needs to be updated and the wrap all the Control Changes in a Recommendation.

If the Recommendation is accepted in the Kelvin UI, then Kelvin will automatically process all the Control Changes we have created and send the new data values to the associated Asset / Data Stream pairs.

The acceptance process in the Kelvin UI is demonstrated in detail in the Multi-Objective Optimization with Machine Learning for Operations.

You can also see a brief overview of the process in the Multi-Objective Optimization with Machine Learning tutorial.

Firstly let's process the model's output to make Control Changes. For the Control Change we need to assign three variables;

The Asset / Data Stream pair to update
The value to update to
The expiration date when a write will be declared a failure

As this is a multi-objective optimization there could be a whole array of set points that need updating, so we will loop through the output and create one Control Change per set point value and save them to an array called control_changes.

Create All Control Changes
from kelvin.message import ControlChange
from kelvin.krn import KRNAsset, KRNAssetDataStream

control_changes = []
for setpoint_name, value in recommended_setpoints.items():
    control_change = ControlChange(
        resource=KRNAssetDataStream(asset=asset, data_stream=setpoint_name),
        payload=value,
        expiration_date=timedelta(hours=1),
    )
    control_changes.append(control_change)

All the Control Changes are now wrapped into one Recommendation. When the Recommendation is created it will automatically appear in the Kelvin UI.

Create Recommendation with Control
from kelvin.message import Recommendation

# Create and Publish a Recommendation with all control changes
await app.publish(
    Recommendation(
        resource=KRNAsset(asset=asset),
        type="multi_objective_optimization",
        description="Multi Objective Optimization",
        control_changes=control_changes,
    )
)

Complete Code¶

The main.py is now complete and if you should have successfully create the full code like this;

Full Code on main.py
import asyncio
import logging
from datetime import timedelta

import pandas as pd
from kelvin.application import KelvinApp, filters
from kelvin.message import ControlChange, Number, Recommendation
from kelvin.krn import KRNAsset, KRNAssetDataStream
from multi_objective_optimization import run_model
from rolling_window import RollingWindow

# Configure logging
logging.basicConfig(level=logging.INFO)


async def process_data(app: KelvinApp, asset: str, df: pd.DataFrame) -> None:
    try:
        # Run model
        recommended_setpoints = run_model(df)

        if recommended_setpoints:
            # Create a control change for each recommended set point
            control_changes = []
            for setpoint_name, value in recommended_setpoints.items():
                control_change = ControlChange(
                    resource=KRNAssetDataStream(asset=asset, data_stream=setpoint_name),
                    payload=value,
                    expiration_date=timedelta(hours=1),
                )
                control_changes.append(control_change)

            # Create and Publish a Recommendation with all control changes
            await app.publish(
                Recommendation(
                    resource=KRNAsset(asset=asset),
                    type="multi_objective_optimization",
                    description="Multi Objective Optimization",
                    control_changes=control_changes,
                )
            )
    except Exception as e:
        logging.error(f"Error processing data for asset {asset}: {e}")


async def main() -> None:
    # Creating instance of Kelvin SmartApp™ Client
    app = KelvinApp()

    # Connect the App Client
    await app.connect()

    # Subscribe to the asset data streams
    msg_queue: asyncio.Queue[Number] = app.filter(filters.is_asset_data_message)

    # Create a rolling window
    rolling_window = RollingWindow(
        max_data_points=500, timestamp_rounding_interval=timedelta(seconds=1)
    )

    while True:
        # Await a new message from the queue
        message = await msg_queue.get()

        print("message:", message)

        # Add the message to the rolling window
        rolling_window.add_message(message)

        # Get asset
        asset = message.resource.asset

        print("asset:", asset)

        # Retrieve dataframe from the rolling window for the specified asset
        df = rolling_window.get_asset_dataframe(asset)

        print("df:", df)

        # Process the data
        await process_data(app, asset, df)


if __name__ == "__main__":
    asyncio.run(main())

Update requirements.txt File¶

Next we will update the requriements.txt file to ensure all required libraries are install in our App.

Looking at all our imports in all three program files we have created see that we should end up with the file contents looking like this;

If you want, you can fix the version number of the library to install

requirements.txt
1 2 3	`kelvin-python-sdk pandas scikit-learn`

Update app.yaml File¶

In the app.yaml file we need to add all the Data Streams that we want to read/write data.

In the Kelvin UI this will be linked to as Asset to create the Asset / Data Stream pairs that we montior. We only need to declare here the Data Stream part.

app.yaml Example
inputs:
  - name: wire_part_vacuum_foil_level_set_point
    data_type: number
  - name: exhaust_fan_3_burner_temperature_set_point
    data_type: number
  - name: paper_machine_speed_set_point
    data_type: number
  - name: primary_screen_reject_flow_rate_set_point
    data_type: number
  - name: turbo_3_vacuum_control_output_set_point
    data_type: number
  - name: shoe_press_hydration_tank_level
    data_type: number
  - name: low_pressure_steam_flow_rate_set_point
    data_type: number
  - name: air_dryer_temperature_set_point
    data_type: number
  - name: jw_ratio_volume_flow
    data_type: number
  - name: 3p_load_top_side_set_point
    data_type: number
  - name: mix_pipe_flow_set_point
    data_type: number
  - name: top_dryers_steam_pressure_set_point
    data_type: number
  - name: spray_starch_standby_pump_rate_set_point
    data_type: number
  - name: paper_substance_weight
    data_type: number
  - name: paper_brightness_top_side
    data_type: number
  - name: luminance_value_top_side
    data_type: number
  - name: luminance_value_bottom_side
    data_type: number

We also will update the Description of this App so that is has a friendly name to read in Kelvin SmartApps™ section in the Kelvin UI. The friendly name can have special characters and spaces,

In full the app.yaml file should now look like this.

app.yaml Example
app:
  kelvin:
    configuration: {}
    inputs:
      - name: wire_part_vacuum_foil_level_set_point
        data_type: number
      - name: exhaust_fan_3_burner_temperature_set_point
        data_type: number
      - name: paper_machine_speed_set_point
        data_type: number
      - name: primary_screen_reject_flow_rate_set_point
        data_type: number
      - name: turbo_3_vacuum_control_output_set_point
        data_type: number
      - name: shoe_press_hydration_tank_level
        data_type: number
      - name: low_pressure_steam_flow_rate_set_point
        data_type: number
      - name: air_dryer_temperature_set_point
        data_type: number
      - name: jw_ratio_volume_flow
        data_type: number
      - name: 3p_load_top_side_set_point
        data_type: number
      - name: mix_pipe_flow_set_point
        data_type: number
      - name: top_dryers_steam_pressure_set_point
        data_type: number
      - name: spray_starch_standby_pump_rate_set_point
        data_type: number
      - name: paper_substance_weight
        data_type: number
      - name: paper_brightness_top_side
        data_type: number
      - name: luminance_value_top_side
        data_type: number
      - name: luminance_value_bottom_side
        data_type: number
    language:
      python:
        entry_point: kelvin_python_sdk
      type: python
    outputs: []
    parameters: []
  type: kelvin
info:
  description: multi-objective-optimization-ml
  name: multi-objective-optimization-ml
  title: Multi-Objective Optimization ML
  version: 1.0.0
spec_version: 4.11.0
system:
  environment_vars:
    - name: KELVIN_GW_MODE
      value: SOCKETS

Test App Locally¶

Now that the coding is complete, before we upload the App to the App Registry in the Cloud we can test the app locally with simulation data to ensure it is working.

For this you will need to have two terminals open, both pointing to the folder that has our App.

Terminal 1 - This will run a simulation Cloud server which can serve data to your App and also receive and show on the screen all requests send from your program. You can use the printouts to analyze the requests and make sure your code is working properly.
Terminal 2 - This will run your python program as an App just like any normal Python program.

Terminal 1¶

Before we start, we need to have a csv file with all the simulation data we want to send to the App.

We will use the same data the Data Scientist used for his Jupyter Notebook.

So, from the cloned repository copy the csv folder which contains the data.csv to our current folder.

The header of the csv file is used by the simulation Cloud server as the Data Stream names.

The data.csv file looks something like this;

Simulation Data data.csv

timestamp,wire_part_vacuum_foil_level_set_point,exhaust_fan_3_burner_temperature_set_point,paper_machine_speed_set_point,primary_screen_reject_flow_rate_set_point,turbo_3_vacuum_control_output_set_point,shoe_press_hydration_tank_level,low_pressure_steam_flow_rate_set_point,air_dryer_temperature_set_point,jw_ratio_volume_flow,3p_load_top_side_set_point,mix_pipe_flow_set_point,top_dryers_steam_pressure_set_point,spray_starch_standby_pump_rate_set_point,paper_substance_weight,paper_brightness_top_side,luminance_value_top_side,luminance_value_bottom_side
0,-3.6299422468457903,35.20203389714766,464.8130798339844,1489.0996733165923,85.30304605432599,90.40479569208054,37.394937242780415,41.94282386416481,102.26864224388486,15.0,2426.5804036458335,0.6634207367897034,84.99734674781598,271.8,91.19,92.21,90.87
1,-3.6913933640434626,35.91813412113294,464.23390706380206,1498.7530808221727,85.32227330205046,90.35345495314826,36.22669347127279,42.12481126331148,102.09702373686291,15.0,2437.4226422991073,0.49544096276873634,35.945196543188764,271.6,91.14,92.21,90.91

...

With the data.csv file available we can now start the simulation Cloud server;

Kelvin SDK Command Line Example
$ kelvin app test csv --csv csv/data.csv --asset-count 1 --publish-rate 0 --offset-timestamps

The response will look like this;

Kelvin SDK Command Line Output
1	`Publisher started.`

If all is ok, then the simulator will then respond with a Publisher started. response.

Terminal 2¶

We are now ready to test our program. We run it like any normal Python program.

Run Python Example
1	`python main.py`

Testing¶

With the testing started, you can see here how each terminal will respond.

Conclusion¶

In this part of the Multi-Objective Optimization with Machine Learning, we have shown you how to start from a Jupyter Notebook and end up with a fully tested Kelvin SmartApp™ ready for upload and deployment.

You can also checkout the other tutorials related to this one;

Multi-Objective Optimization with Machine Learning Overview : A fast run through from creating the Kelvin SmartApp™, upload to the App Registry, deployment to the edge and managing Recommendations.
Multi-Objective Optimization with Machine Learning for Developers : A detailed step-by-step process to go from a jupyter Notebook concept machine learning model to a fully tested Kelvin SmartApp™ ready for upload to the Cloud.
Multi-Objective Optimization with Machine Learning for Platform Administrators : A detailed step-by-step process to setup the Instance with all required Assets and Data Streams, add a Connection to connect and read/write data to Assets and Upload the Kelvin SmartApp™ to the App Registry.
Multi-Objective Optimization with Machine Learning for Operations : A detailed step-by-step process to Add Assets to Kelvin SmartApps™, monitor Kelvin SmartApps™ and Asset performance and respond to Recommendations fron Kelvin SmartApps™.