Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Spark] Support in-place migration from unpartitioned table to clustered table #3174

Merged
merged 5 commits into from
May 31, 2024

Conversation

zedtang
Copy link
Collaborator

@zedtang zedtang commented May 29, 2024

Which Delta project/connector is this regarding?

  • Spark
  • Standalone
  • Flink
  • Kernel
  • Other (fill in here)

Description

Support in-place migration from unpartitioned table to clustered table. If the table is an unpartitioned table and users run ALTER TABLE CLUSTER BY on it, it will now create a clustered table with ClusteringMetadataDomain.

Resolves #2460

How was this patch tested?

New UTs.

Does this PR introduce any user-facing changes?

No

@zedtang zedtang changed the title Support in-place migration from unpartitioned table to clustered table. [Spark] Support in-place migration from unpartitioned table to clustered table May 29, 2024
@zedtang zedtang self-assigned this May 29, 2024
Copy link
Contributor

@dabao521 dabao521 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the work! Left a few comments, and let's add add more validations in the test.

@zedtang zedtang requested a review from dabao521 May 30, 2024 19:31
Copy link
Contributor

@dabao521 dabao521 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@vkorukanti vkorukanti merged commit 56ce212 into delta-io:master May 31, 2024
10 checks passed
@zedtang zedtang deleted the in-place-migration branch May 31, 2024 15:33
richardc-db pushed a commit to richardc-db/delta that referenced this pull request Jun 5, 2024
…red table (delta-io#3174)

## Description
Support in-place migration from unpartitioned table to clustered table.
If the table is an unpartitioned table and users run `ALTER TABLE
CLUSTER BY` on it, it will now create a clustered table with
ClusteringMetadataDomain.

Resolves delta-io#2460 

## How was this patch tested?
New UTs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Feature Request] In-place migration from unpartitioned table to clustered table
3 participants