title | titleSuffix | description | services | ms.service | ms.subservice | ms.custom | ms.topic | ms.author | author | ms.date |
---|---|---|---|---|---|---|---|---|---|---|
What is a workspace? |
Azure Machine Learning |
The workspace is the top-level resource for Azure Machine Learning. It keeps a history of all training runs, with logs, metrics, output, and a snapshot of your scripts. |
machine-learning |
machine-learning |
core |
event-tier1-build-2022 |
conceptual |
sgilley |
sdgilley |
01/04/2022 |
The workspace is the top-level resource for Azure Machine Learning, providing a centralized place to work with all the artifacts you create when you use Azure Machine Learning. The workspace keeps a history of all training runs, including logs, metrics, output, and a snapshot of your scripts. You use this information to determine which training run produces the best model.
Once you have a model you like, you register it with the workspace. You then use the registered model and scoring scripts to deploy to Azure Container Instances, Azure Kubernetes Service, or to a field-programmable gate array (FPGA) as a REST-based HTTP endpoint.
A taxonomy of the workspace is illustrated in the following diagram:
The diagram shows the following components of a workspace:
-
A workspace can contain Azure Machine Learning compute instances, cloud resources configured with the Python environment necessary to run Azure Machine Learning.
-
User roles enable you to share your workspace with other users, teams, or projects.
-
Compute targets are used to run your experiments.
-
When you create the workspace, associated resources are also created for you.
-
Experiments are training runs you use to build your models.
-
Pipelines are reusable workflows for training and retraining your model.
-
Datasets aid in management of the data you use for model training and pipeline creation.
-
Once you have a model you want to deploy, you create a registered model.
-
Use the registered model and a scoring script to create a deployment endpoint.
You can interact with your workspace in the following ways:
Important
Tools marked (preview) below are currently in public preview. The preview version is provided without a service level agreement, and it's not recommended for production workloads. Certain features might not be supported or might have constrained capabilities. For more information, see Supplemental Terms of Use for Microsoft Azure Previews.
- On the web:
- In any Python environment with the Azure Machine Learning SDK for Python.
- On the command line using the Azure Machine Learning CLI extension
- Azure Machine Learning VS Code Extension
Machine learning tasks read and/or write artifacts to your workspace.
- Run an experiment to train a model - writes experiment run results to the workspace.
- Use automated ML to train a model - writes training results to the workspace.
- Register a model in the workspace.
- Deploy a model - uses the registered model to create a deployment.
- Create and run reusable workflows.
- View machine learning artifacts such as experiments, pipelines, models, deployments.
- Track and monitor models.
You can also perform the following workspace management tasks:
Workspace management task | Portal | Studio | Python SDK | Azure CLI | VS Code |
---|---|---|---|---|---|
Create a workspace | ✓ | ✓ | ✓ | ✓ | |
Manage workspace access | ✓ | ✓ | |||
Create and manage compute resources | ✓ | ✓ | ✓ | ✓ | |
Create a Notebook VM | ✓ |
Warning
Moving your Azure Machine Learning workspace to a different subscription, or moving the owning subscription to a new tenant, is not supported. Doing so may cause errors.
There are multiple ways to create a workspace:
- Use the Azure portal for a point-and-click interface to walk you through each step.
- Use the Azure Machine Learning SDK for Python to create a workspace on the fly from Python scripts or Jupyter notebooks
- Use an Azure Resource Manager template or the Azure Machine Learning CLI when you need to automate or customize the creation with corporate security standards.
- If you work in Visual Studio Code, use the VS Code extension.
Note
The workspace name is case-insensitive.
These sub resources are the main resources that are made in the AML workspace.
- VMs: provide computing power for your AML workspace and are an integral part in deploying and training models.
- Load Balancer: a network load balancer is created for each compute instance and compute cluster to manage traffic even while the compute instance/cluster is stopped.
- Virtual Network: these help Azure resources communicate with one another, the internet, and other on-premises networks.
- Bandwidth: encapsulates all outbound data transfers across regions.
When you create a new workspace, it automatically creates several Azure resources that are used by the workspace:
-
Azure Storage account: Is used as the default datastore for the workspace. Jupyter notebooks that are used with your Azure Machine Learning compute instances are stored here as well.
[!IMPORTANT] By default, the storage account is a general-purpose v1 account. You can upgrade this to general-purpose v2 after the workspace has been created. Do not enable hierarchical namespace on the storage account after upgrading to general-purpose v2.
To use an existing Azure Storage account, it cannot be of type BlobStorage or a premium account (Premium_LRS and Premium_GRS). It also cannot have a hierarchical namespace (used with Azure Data Lake Storage Gen2). Neither premium storage nor hierarchical namespaces are supported with the default storage account of the workspace. You can use premium storage or hierarchical namespace with non-default storage accounts.
-
Azure Container Registry: Registers docker containers that are used for the following components:
- Azure Machine Learning environments when training and deploying models
- AutoML when deploying
- Data profiling
To minimize costs, ACR is lazy-loaded until images are needed.
[!NOTE] If your subscription setting requires adding tags to resources under it, Azure Container Registry (ACR) created by Azure Machine Learning will fail, since we cannot set tags to ACR.
-
Azure Application Insights: Stores monitoring and diagnostics information. For more information, see Monitor and collect data from Machine Learning web service endpoints.
[!NOTE] You can delete the Application Insights instance after cluster creation if you want. Deleting it limits the information gathered from the workspace, and may make it more difficult to troubleshoot problems. If you delete the Application Insights instance created by the workspace, you cannot re-create it without deleting and recreating the workspace.
-
Azure Key Vault: Stores secrets that are used by compute targets and other sensitive information that's needed by the workspace.
Note
You can instead use existing Azure resource instances when you create the workspace with the Python SDK or the Azure Machine Learning CLI using an ARM template.
As of September 2020, all capabilities that were available in Enterprise edition workspaces are now also available in Basic edition workspaces.
New Enterprise workspaces can no longer be created. Any SDK, CLI, or Azure Resource Manager calls that use the sku
parameter will continue to work but a Basic workspace will be provisioned.
Beginning December 21st, all Enterprise Edition workspaces will be automatically set to Basic Edition, which has the same capabilities. No downtime will occur during this process. On January 1, 2021, Enterprise Edition will be formally retired.
In either editions, customers are responsible for the costs of Azure resources consumed and will not need to pay any additional charges for Azure Machine Learning. Please refer to the Azure Machine Learning pricing page for more details.
To learn more about planning a workspace for your organization's requirements, see Organize and set up Azure Machine Learning.
To get started with Azure Machine Learning, see: