title	titleSuffix	description	services	ms.service	ms.subservice	ms.author	ms.reviewer	author	ms.date	ms.topic	ms.custom
Deploy machine learning models to managed online endpoint using Python SDK v2 (preview).	Azure Machine Learning	Learn to deploy your machine learning model to Azure using Python SDK v2 (preview).	machine-learning	machine-learning	mlops	ssambare	larryfr	shivanissambare	05/25/2022	how-to	how-to, devplatv2, sdkv2, deployment

Deploy and score a machine learning model with managed online endpoint using Python SDK v2 (preview)

[!INCLUDE sdk v2]

Important

SDK v2 is currently in public preview. The preview version is provided without a service level agreement, and it's not recommended for production workloads. Certain features might not be supported or might have constrained capabilities. For more information, see Supplemental Terms of Use for Microsoft Azure Previews.

In this article, you learn how to deploy your machine learning model to managed online endpoint and get predictions. You'll begin by deploying a model on your local machine to debug any errors, and then you'll deploy and test it in Azure.

Prerequisites

If you don't have an Azure subscription, create a free account before you begin. Try the free or paid version of Azure Machine Learning today.
The Azure Machine Learning SDK v2 for Python.
You must have an Azure resource group, and you (or the service principal you use) must have Contributor access to it.
You must have an Azure Machine Learning workspace.
To deploy locally, you must install Docker Engine on your local computer. We highly recommend this option, so it's easier to debug issues.

Clone examples repository

To run the training examples, first clone the examples repository and change into the sdk directory:

git clone --depth 1 https://github.com/Azure/azureml-examples
cd azureml-examples/sdk

Tip

Use --depth 1 to clone only the latest commit to the repository, which reduces time to complete the operation.

Connect to Azure Machine Learning workspace

The workspace is the top-level resource for Azure Machine Learning, providing a centralized place to work with all the artifacts you create when you use Azure Machine Learning. In this section, we'll connect to the workspace in which you'll perform deployment tasks.

Import the required libraries:

# import required libraries
from azure.ai.ml import MLClient
from azure.ai.ml.entities import (
    ManagedOnlineEndpoint,
    ManagedOnlineDeployment,
    Model,
    Environment,
    CodeConfiguration,
)
from azure.identity import DefaultAzureCredential

Configure workspace details and get a handle to the workspace:

To connect to a workspace, we need identifier parameters - a subscription, resource group and workspace name. We'll use these details in the MLClient from azure.ai.ml to get a handle to the required Azure Machine Learning workspace. This example uses the default Azure authentication.
```
# enter details of your AML workspace
subscription_id = "<SUBSCRIPTION_ID>"
resource_group = "<RESOURCE_GROUP>"
workspace = "<AML_WORKSPACE_NAME>"
```
```
# get a handle to the workspace
ml_client = MLClient(
    DefaultAzureCredential(), subscription_id, resource_group, workspace
)
```

Create local endpoint and deployment

Note

To deploy locally, Docker Engine must be installed. Docker Engine must be running. Docker Engine typically starts when the computer starts. If it doesn't, you can troubleshoot Docker Engine.

Create local endpoint:

The goal of a local endpoint deployment is to validate and debug your code and configuration before you deploy to Azure. Local deployment has the following limitations:

Local endpoints don't support traffic rules, authentication, or probe settings.
Local endpoints support only one deployment per endpoint.

# Creating a local endpoint
import datetime

local_endpoint_name = "local-" + datetime.datetime.now().strftime("%m%d%H%M%f")

# create an online endpoint
endpoint = ManagedOnlineEndpoint(
    name=local_endpoint_name, description="this is a sample local endpoint"
)

ml_client.online_endpoints.begin_create_or_update(endpoint, local=True)

Create local deployment:

The example contains all the files needed to deploy a model on an online endpoint. To deploy a model, you must have:
- Model files (or the name and version of a model that's already registered in your workspace). In the example, we have a scikit-learn model that does regression.
- The code that's required to score the model. In this case, we have a score.py file.
- An environment in which your model runs. As you'll see, the environment might be a Docker image with Conda dependencies, or it might be a Dockerfile.
- Settings to specify the instance type and scaling capacity.
Key aspects of deployment
- name - Name of the deployment.
- endpoint_name - Name of the endpoint to create the deployment under.
- model - The model to use for the deployment. This value can be either a reference to an existing versioned model in the workspace or an inline model specification.
- environment - The environment to use for the deployment. This value can be either a reference to an existing versioned environment in the workspace or an inline environment specification.
- code_configuration - the configuration for the source code and scoring script
  - path- Path to the source code directory for scoring the model
  - scoring_script - Relative path to the scoring file in the source code directory
- instance_type - The VM size to use for the deployment. For the list of supported sizes, see Managed online endpoints SKU list.
- instance_count - The number of instances to use for the deployment
```
model = Model(path="../model-1/model/sklearn_regression_model.pkl")
env = Environment(
    conda_file="../model-1/environment/conda.yml",
    image="mcr.microsoft.com/azureml/openmpi3.1.2-ubuntu18.04:20210727.v1",
)

blue_deployment = ManagedOnlineDeployment(
    name="blue",
    endpoint_name=local_endpoint_name,
    model=model,
    environment=env,
    code_configuration=CodeConfiguration(
        code="../model-1/onlinescoring", scoring_script="score.py"
    ),
    instance_type="Standard_F2s_v2",
    instance_count=1,
)
```
```
ml_client.online_deployments.begin_create_or_update(
    deployment=blue_deployment, local=True
)
```

Verify the local deployment succeeded

Check the status to see whether the model was deployed without error:

ml_client.online_endpoints.get(name=local_endpoint_name, local=True)

Get logs:

ml_client.online_deployments.get_logs(
    name="blue", endpoint_name=local_endpoint_name, local=True, lines=50
)

Invoke the local endpoint

Invoke the endpoint to score the model by using the convenience command invoke and passing query parameters that are stored in a JSON file

ml_client.online_endpoints.invoke(
    endpoint_name=local_endpoint_name,
    request_file="../model-1/sample-request.json",
    local=True,
)

Deploy your online endpoint to Azure

Next, deploy your online endpoint to Azure.

Configure online endpoint:
[!TIP]
- endpoint_name: The name of the endpoint. It must be unique in the Azure region. For more information on the naming rules, see managed online endpoint limits.
- auth_mode : Use key for key-based authentication. Use aml_token for Azure Machine Learning token-based authentication. A key doesn't expire, but aml_token does expire. For more information on authenticating, see Authenticate to an online endpoint.
- Optionally, you can add description, tags to your endpoint.
```
# Creating a unique endpoint name with current datetime to avoid conflicts
import datetime

online_endpoint_name = "endpoint-" + datetime.datetime.now().strftime("%m%d%H%M%f")

# create an online endpoint
endpoint = ManagedOnlineEndpoint(
    name=online_endpoint_name,
    description="this is a sample online endpoint",
    auth_mode="key",
    tags={"foo": "bar"},
)
```
Create the endpoint:

Using the MLClient created earlier, we'll now create the Endpoint in the workspace. This command will start the endpoint creation and return a confirmation response while the endpoint creation continues.
```
ml_client.begin_create_or_update(endpoint)
```

Configure online deployment:

A deployment is a set of resources required for hosting the model that does the actual inferencing. We'll create a deployment for our endpoint using the ManagedOnlineDeployment class.

model = Model(path="../model-1/model/sklearn_regression_model.pkl")
env = Environment(
    conda_file="../model-1/environment/conda.yml",
    image="mcr.microsoft.com/azureml/openmpi3.1.2-ubuntu18.04:20210727.v1",
)

blue_deployment = ManagedOnlineDeployment(
    name="blue",
    endpoint_name=online_endpoint_name,
    model=model,
    environment=env,
    code_configuration=CodeConfiguration(
        code="../model-1/onlinescoring", scoring_script="score.py"
    ),
    instance_type="Standard_F2s_v2",
    instance_count=1,
)

Create the deployment:

Using the MLClient created earlier, we'll now create the deployment in the workspace. This command will start the deployment creation and return a confirmation response while the deployment creation continues.
```
ml_client.begin_create_or_update(blue_deployment)
```
```
# blue deployment takes 100 traffic
endpoint.traffic = {"blue": 100}
ml_client.begin_create_or_update(endpoint)
```

Test the endpoint with sample data

Using the MLClient created earlier, we'll get a handle to the endpoint. The endpoint can be invoked using the invoke command with the following parameters:

endpoint_name - Name of the endpoint
request_file - File with request data
deployment_name - Name of the specific deployment to test in an endpoint

We'll send a sample request using a json file.

# test the blue deployment with some sample data
ml_client.online_endpoints.invoke(
    endpoint_name=online_endpoint_name,
    deployment_name="blue",
    request_file="../model-1/sample-request.json",
)

Managing endpoints and deployments

Get details of the endpoint:

# Get the details for online endpoint
endpoint = ml_client.online_endpoints.get(name=online_endpoint_name)

# existing traffic details
print(endpoint.traffic)

# Get the scoring URI
print(endpoint.scoring_uri)

Get the logs for the new deployment:

Get the logs for the green deployment and verify as needed

ml_client.online_deployments.get_logs(
    name="blue", endpoint_name=online_endpoint_name, lines=50
)

Delete the endpoint

ml_client.online_endpoints.begin_delete(name=online_endpoint_name)

Next steps

Try these next steps to learn how to use the Azure Machine Learning SDK (v2) for Python:

Managed online endpoint safe rollout
Explore online endpoint samples - https://github.com/Azure/azureml-examples/tree/main/sdk/endpoints

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Files

how-to-deploy-managed-online-endpoint-sdk-v2.md

how-to-deploy-managed-online-endpoint-sdk-v2.md

Deploy and score a machine learning model with managed online endpoint using Python SDK v2 (preview)

Prerequisites

Clone examples repository

Connect to Azure Machine Learning workspace

Create local endpoint and deployment

Verify the local deployment succeeded

Invoke the local endpoint

Deploy your online endpoint to Azure

Test the endpoint with sample data

Managing endpoints and deployments

Delete the endpoint

Next steps

Collapse file tree

Files

how-to-deploy-managed-online-endpoint-sdk-v2.md

Latest commit

History

how-to-deploy-managed-online-endpoint-sdk-v2.md

File metadata and controls

Deploy and score a machine learning model with managed online endpoint using Python SDK v2 (preview)

Prerequisites

Clone examples repository

Connect to Azure Machine Learning workspace

Create local endpoint and deployment

Verify the local deployment succeeded

Invoke the local endpoint

Deploy your online endpoint to Azure

Test the endpoint with sample data

Managing endpoints and deployments

Delete the endpoint

Next steps