Skip to content

Files

Latest commit

218b196 · May 20, 2022

History

History
254 lines (153 loc) · 31.3 KB

whats-new-archive.md

File metadata and controls

254 lines (153 loc) · 31.3 KB
title description author ms.author ms.service ms.subservice ms.topic ms.date
Previous monthly updates in Azure Synapse Analytics
Archive of the new features and documentation improvements for Azure Synapse Analytics
ryanmajidi
rymajidi
synapse-analytics
overview
conceptual
05/20/2022

Previous monthly updates in Azure Synapse Analytics

This article describes previous month updates to Azure Synapse Analytics. For the most current month's release, check out Azure Synapse Analytics latest updates. Each update links to the Azure Synapse Analytics blog and an article that provides more information.

Mar 2022 update

The following updates are new to Azure Synapse Analytics this month.

Developer Experience

  • Code cells in Synapse notebooks that result in exception will now show standard output along with the exception message. This feature is supported for Python and Scala languages. To learn more, see the example output when a code statement fails.

  • Synapse notebooks now support partial output when running code cells. To learn more, see the examples at this blog post

  • You can now dynamically control Spark session configuration for the notebook activity with pipeline parameters. To learn more, see the variable explorer feature of Synapse notebooks.

  • You can now reuse and manage notebook sessions without having to start a new one. You can easily connect a selected notebook to an active session in the list started from another notebook. You can detach a session from a notebook, stop the session, and monitor it. To learn more, see how to manage your active notebook sessions.

  • Synapse notebooks now capture anything written through the Python logging module, in addition to the driver logs. To learn more, see support for Python logging.

SQL

  • Column Level Encryption for Azure Synapse dedicated SQL Pools is now Generally Available. With column level encryption, you can use different protection keys for each column with each key having its own access permissions. The data in CLE-enforced columns are encrypted on disk and remain encrypted in memory until the DECRYPTBYKEY function is used to decrypt it. To learn more, see how to encrypt a data column.

  • Serverless SQL pools now support better performance for CETAS (Create External Table as Select) and subsequent SELECT queries. The performance improvements include, a parallel execution plan resulting in faster CETAS execution and outputting multiple files. To learn more, see CETAS with Synapse SQL article and the blog post

Apache Spark for Synapse

  • Synapse Spark Common Data Model (CDM) Connector is now Generally Available. The CDM format reader/writer enables a Spark program to read and write CDM entities in a CDM folder via Spark dataframes. To learn more, see how the CDM connector supports reading, writing data, examples, & known issues.

  • Synapse Spark Dedicated SQL Pool (DW) Connector now supports improved performance. The new architecture eliminates redundant data movement and uses COPY-INTO instead of PolyBase. You can authenticate through SQL basic authentication or opt into the Azure Active Directory/Azure AD based authentication method. It now has ~5x improvements over the previous version. To learn more, see Azure Synapse Dedicated SQL Pool Connector for Apache Spark

  • Synapse Spark Dedicated SQL Pool (DW) Connector now supports all Spark Dataframe SaveMode choices. It supports Append, Overwrite, ErrorIfExists, and Ignore modes. The Append and Overwrite are critical for managing data ingestion at scale. To learn more, see DataFrame write SaveMode support

  • Accelerate Spark execution speed using the new Intelligent Cache feature. This feature is currently in public preview. Intelligent Cache automatically stores each read within the allocated cache storage space, detecting underlying file changes and refreshing the files to provide the most recent data. To learn more, see how to Enable/Disable the cache for your Apache Spark pool or see the blog post

Security

Data Integration

Feb 2022 update

The following updates are new to Azure Synapse Analytics this month.

SQL

Data integration

Jan 2022 update

The following updates are new to Azure Synapse Analytics this month.

Apache Spark for Synapse

You can now use four new database templates in Azure Synapse. Learn more about Automotive, Genomics, Manufacturing, and Pharmaceuticals templates from the blog post or the database templates article. These templates are currently in public preview and are available within the Synapse Studio gallery.

Machine Learning

Improvements to the Synapse Machine Learning library v0.9.5 (previously called MMLSpark). This release simplifies the creation of massively scalable machine learning pipelines with Apache Spark. To learn more, read the blog post about the new capabilities in this release or see the full release notes

Security

  • The Azure Synapse Analytics security overview - A whitepaper that covers the five layers of security. The security layers include authentication, access control, data protection, network security, and threat protection. Understand each security feature in detailed to implement an industry-standard security baseline and protect your data on the cloud.

  • TLS 1.2 is now required for newly created Synapse Workspaces. To learn more, see how TLS 1.2 provides enhanced security using this article or the blog post. Login attempts to a newly created Synapse workspace from connections using TLS versions lower than 1.2 will fail.

Data Integration

Synapse SQL

December 2021 update

The following updates are new to Azure Synapse Analytics this month.

Apache Spark for Synapse

  • Accelerate Spark workloads with NVIDIA GPU acceleration blog article
  • Mount remote storage to a Synapse Spark pool blog article
  • Natively read & write data in ADLS with Pandas blog article
  • Dynamic allocation of executors for Spark blog article

Machine Learning

  • The Synapse Machine Learning library blog article
  • Getting started with state-of-the-art pre-built intelligent models blog article
  • Building responsible AI systems with the Synapse ML library blog article
  • PREDICT is now GA for Synapse Dedicated SQL pools blog article
  • Simple & scalable scoring with PREDICT and MLFlow for Apache Spark for Synapse blog article
  • Retail AI solutions blog article

Security

  • User-Assigned managed identities now supported in Synapse Pipelines in preview blog article
  • Browse ADLS Gen2 folders in an Azure Synapse Analytics workspace in preview blog article

Data Integration

  • Pipeline Fail activity blog article
  • Mapping Data Flow gets new native connectors blog article
  • More notebook export formats: HTML, Python, and LaTeX blog
  • Three new chart types in notebook view: box plot, histogram, and pivot table blog
  • Reconnect to lost notebook session blog

Integrate

November 2021 update

The following updates are new to Azure Synapse Analytics this month.

Synapse Data Explorer

  • Synapse Data Explorer now available in preview blog article

Working with Databases and Data Lakes

  • Introducing Lake databases (formerly known as Spark databases) blog article
  • Lake database designer now available in preview blog article
  • Database Templates and Database Designer blog article

SQL

  • Delta Lake support for serverless SQL is generally available blog article
  • Query multiple file paths using OPENROWSET in serverless SQL blog article
  • Serverless SQL queries can now return up to 200 GB of results blog article
  • Handling invalid rows with OPENROWSET in serverless SQL blog article

Apache Spark for Synapse

  • Accelerate Spark workloads with NVIDIA GPU acceleration blog article
  • Mount remote storage to a Synapse Spark pool blog article
  • Natively read & write data in ADLS with Pandas blog article
  • Dynamic allocation of executors for Spark blog article

Machine Learning

  • The Synapse Machine Learning library blog article
  • Getting started with state-of-the-art pre-built intelligent models blog article
  • Building responsible AI systems with the Synapse ML library blog article
  • PREDICT is now GA for Synapse Dedicated SQL pools blog article
  • Simple & scalable scoring with PREDICT and MLFlow for Apache Spark for Synapse blog article
  • Retail AI solutions blog article

Security

  • User-Assigned managed identities now supported in Synapse Pipelines in preview blog article
  • Browse ADLS Gen2 folders in an Azure Synapse Analytics workspace in preview blog article

Data Integration

Synapse Link

  • Synapse Link for Dataverse blog article
  • Custom partitions for Synapse link for Azure Cosmos DB in preview blog article

October 2021 update

The following updates are new to Azure Synapse Analytics this month.

General

  • Manage your cost with Azure Synapse pre-purchase plans blog article
  • Move your Azure Synapse workspace across Azure regions blog article

Apache Spark for Synapse

  • Spark performance optimizations blog

Security

  • All Synapse RBAC roles are now generally available for use in production blog article
  • Apply User-Assigned Managed Identities for Double Encryption blog article
  • Synapse Administrators now have elevated access to dedicated SQL pools blog article

Governance

  • Synapse workspaces can now automatically push lineage data to Microsoft Purview blog article

Integrate

  • Use Stringify in data flows to easily transform complex data types to strings blog article
  • Control Spark session time-to-live (TTL) in data flows blog article

CI/CD & Git

  • Deploy Synapse workspaces using GitHub Actions blog article
  • More control creating Git branches in Synapse Studio blog article

Developer Experience

  • Enhanced Markdown editing in Synapse notebooks preview blog article
  • Pandas dataframes automatically render as nicely formatted HTML tables blog article
  • Use IPython widgets in Synapse Notebooks blog article
  • Mssparkutils runtime context now available for Python and Scala blog article

Next steps

Get started with Azure Synapse Analytics