[Merged by Bors] - Prune finalized execution payloads #3565

michaelsproul · 2022-09-12T01:47:34Z

Issue Addressed

Proposed Changes

Delete finalized execution payloads from the database in two places:

When running the finalization migration in migrate_database. We delete the finalized payloads between the last split point and the new updated split point. If payloads are already pruned prior to this then this is sufficient to prune all payloads as non-canonical payloads are already deleted by the head pruner, and all canonical payloads prior to the previous split will already have been pruned.
To address the fact that users will update to this code after the merge on mainnet (and testnets), we need a one-off scan to delete the finalized payloads from the canonical chain. This is implemented in try_prune_execution_payloads which runs on startup and scans the chain back to the Bellatrix fork or the anchor slot (if checkpoint synced after Bellatrix). In the case where payloads are already pruned this check only imposes a single state load for the split state, which shouldn't be too slow. Even so, a flag --prepare-payloads-on-startup=false is provided to turn this off after it has run the first time, which provides faster start-up times.

There is also a new lighthouse db prune_payloads subcommand for users who prefer to run the pruning manually.

Additional Info

The tests have been updated to not rely on finalized payloads in the database, instead using the MockExecutionLayer to reconstruct them. Additionally a check was added to check_chain_dump which asserts the non-existence or existence of payloads on disk depending on their slot.

beacon_node/store/src/hot_cold_store.rs

paulhauner

Great work, couldn't fault it (without making an error whilst trying to fault it 😅)

I was initially concerned that we might highlight bugs where we were wrongly assuming that finalized payloads are present (because we forgot to delete them). However I see that risk is greatly reduced since:

We store pre-anchor blocks as blinded.
In HotColdDB::try_get_full_block we don't even try to get the payload for blocks before the split slot.

Happy to merge!

paulhauner · 2022-09-16T03:41:45Z

bors r+

## Issue Addressed Closes #3556 ## Proposed Changes Delete finalized execution payloads from the database in two places: 1. When running the finalization migration in `migrate_database`. We delete the finalized payloads between the last split point and the new updated split point. _If_ payloads are already pruned prior to this then this is sufficient to prune _all_ payloads as non-canonical payloads are already deleted by the head pruner, and all canonical payloads prior to the previous split will already have been pruned. 2. To address the fact that users will update to this code _after_ the merge on mainnet (and testnets), we need a one-off scan to delete the finalized payloads from the canonical chain. This is implemented in `try_prune_execution_payloads` which runs on startup and scans the chain back to the Bellatrix fork or the anchor slot (if checkpoint synced after Bellatrix). In the case where payloads are already pruned this check only imposes a single state load for the split state, which shouldn't be _too slow_. Even so, a flag `--prepare-payloads-on-startup=false` is provided to turn this off after it has run the first time, which provides faster start-up times. There is also a new `lighthouse db prune_payloads` subcommand for users who prefer to run the pruning manually. ## Additional Info The tests have been updated to not rely on finalized payloads in the database, instead using the `MockExecutionLayer` to reconstruct them. Additionally a check was added to `check_chain_dump` which asserts the non-existence or existence of payloads on disk depending on their slot.

bors · 2022-09-16T03:53:16Z

Build failed (retrying...):

clippy

## Issue Addressed Closes #3556 ## Proposed Changes Delete finalized execution payloads from the database in two places: 1. When running the finalization migration in `migrate_database`. We delete the finalized payloads between the last split point and the new updated split point. _If_ payloads are already pruned prior to this then this is sufficient to prune _all_ payloads as non-canonical payloads are already deleted by the head pruner, and all canonical payloads prior to the previous split will already have been pruned. 2. To address the fact that users will update to this code _after_ the merge on mainnet (and testnets), we need a one-off scan to delete the finalized payloads from the canonical chain. This is implemented in `try_prune_execution_payloads` which runs on startup and scans the chain back to the Bellatrix fork or the anchor slot (if checkpoint synced after Bellatrix). In the case where payloads are already pruned this check only imposes a single state load for the split state, which shouldn't be _too slow_. Even so, a flag `--prepare-payloads-on-startup=false` is provided to turn this off after it has run the first time, which provides faster start-up times. There is also a new `lighthouse db prune_payloads` subcommand for users who prefer to run the pruning manually. ## Additional Info The tests have been updated to not rely on finalized payloads in the database, instead using the `MockExecutionLayer` to reconstruct them. Additionally a check was added to `check_chain_dump` which asserts the non-existence or existence of payloads on disk depending on their slot.

paulhauner · 2022-09-16T04:13:42Z

bors r-

bors · 2022-09-16T04:13:43Z

Canceled.

paulhauner · 2022-09-16T04:14:10Z

bors r+

## Issue Addressed Closes #3556 ## Proposed Changes Delete finalized execution payloads from the database in two places: 1. When running the finalization migration in `migrate_database`. We delete the finalized payloads between the last split point and the new updated split point. _If_ payloads are already pruned prior to this then this is sufficient to prune _all_ payloads as non-canonical payloads are already deleted by the head pruner, and all canonical payloads prior to the previous split will already have been pruned. 2. To address the fact that users will update to this code _after_ the merge on mainnet (and testnets), we need a one-off scan to delete the finalized payloads from the canonical chain. This is implemented in `try_prune_execution_payloads` which runs on startup and scans the chain back to the Bellatrix fork or the anchor slot (if checkpoint synced after Bellatrix). In the case where payloads are already pruned this check only imposes a single state load for the split state, which shouldn't be _too slow_. Even so, a flag `--prepare-payloads-on-startup=false` is provided to turn this off after it has run the first time, which provides faster start-up times. There is also a new `lighthouse db prune_payloads` subcommand for users who prefer to run the pruning manually. ## Additional Info The tests have been updated to not rely on finalized payloads in the database, instead using the `MockExecutionLayer` to reconstruct them. Additionally a check was added to `check_chain_dump` which asserts the non-existence or existence of payloads on disk depending on their slot.

bors · 2022-09-16T04:15:15Z

This PR was included in a batch that was canceled, it will be automatically retried

paulhauner · 2022-09-16T04:15:24Z

bors r-

bors · 2022-09-16T04:15:26Z

Canceled.

paulhauner · 2022-09-16T04:15:46Z

bors r+

## Issue Addressed Closes #3556 ## Proposed Changes Delete finalized execution payloads from the database in two places: 1. When running the finalization migration in `migrate_database`. We delete the finalized payloads between the last split point and the new updated split point. _If_ payloads are already pruned prior to this then this is sufficient to prune _all_ payloads as non-canonical payloads are already deleted by the head pruner, and all canonical payloads prior to the previous split will already have been pruned. 2. To address the fact that users will update to this code _after_ the merge on mainnet (and testnets), we need a one-off scan to delete the finalized payloads from the canonical chain. This is implemented in `try_prune_execution_payloads` which runs on startup and scans the chain back to the Bellatrix fork or the anchor slot (if checkpoint synced after Bellatrix). In the case where payloads are already pruned this check only imposes a single state load for the split state, which shouldn't be _too slow_. Even so, a flag `--prepare-payloads-on-startup=false` is provided to turn this off after it has run the first time, which provides faster start-up times. There is also a new `lighthouse db prune_payloads` subcommand for users who prefer to run the pruning manually. ## Additional Info The tests have been updated to not rely on finalized payloads in the database, instead using the `MockExecutionLayer` to reconstruct them. Additionally a check was added to `check_chain_dump` which asserts the non-existence or existence of payloads on disk depending on their slot.

bors · 2022-09-16T06:56:54Z

Build failed (retrying...):

release-tests-windows

## Issue Addressed Closes #3556 ## Proposed Changes Delete finalized execution payloads from the database in two places: 1. When running the finalization migration in `migrate_database`. We delete the finalized payloads between the last split point and the new updated split point. _If_ payloads are already pruned prior to this then this is sufficient to prune _all_ payloads as non-canonical payloads are already deleted by the head pruner, and all canonical payloads prior to the previous split will already have been pruned. 2. To address the fact that users will update to this code _after_ the merge on mainnet (and testnets), we need a one-off scan to delete the finalized payloads from the canonical chain. This is implemented in `try_prune_execution_payloads` which runs on startup and scans the chain back to the Bellatrix fork or the anchor slot (if checkpoint synced after Bellatrix). In the case where payloads are already pruned this check only imposes a single state load for the split state, which shouldn't be _too slow_. Even so, a flag `--prepare-payloads-on-startup=false` is provided to turn this off after it has run the first time, which provides faster start-up times. There is also a new `lighthouse db prune_payloads` subcommand for users who prefer to run the pruning manually. ## Additional Info The tests have been updated to not rely on finalized payloads in the database, instead using the `MockExecutionLayer` to reconstruct them. Additionally a check was added to `check_chain_dump` which asserts the non-existence or existence of payloads on disk depending on their slot.

bors · 2022-09-16T08:53:55Z

Build failed (retrying...):

release-tests-windows

## Issue Addressed Closes #3556 ## Proposed Changes Delete finalized execution payloads from the database in two places: 1. When running the finalization migration in `migrate_database`. We delete the finalized payloads between the last split point and the new updated split point. _If_ payloads are already pruned prior to this then this is sufficient to prune _all_ payloads as non-canonical payloads are already deleted by the head pruner, and all canonical payloads prior to the previous split will already have been pruned. 2. To address the fact that users will update to this code _after_ the merge on mainnet (and testnets), we need a one-off scan to delete the finalized payloads from the canonical chain. This is implemented in `try_prune_execution_payloads` which runs on startup and scans the chain back to the Bellatrix fork or the anchor slot (if checkpoint synced after Bellatrix). In the case where payloads are already pruned this check only imposes a single state load for the split state, which shouldn't be _too slow_. Even so, a flag `--prepare-payloads-on-startup=false` is provided to turn this off after it has run the first time, which provides faster start-up times. There is also a new `lighthouse db prune_payloads` subcommand for users who prefer to run the pruning manually. ## Additional Info The tests have been updated to not rely on finalized payloads in the database, instead using the `MockExecutionLayer` to reconstruct them. Additionally a check was added to `check_chain_dump` which asserts the non-existence or existence of payloads on disk depending on their slot.

bors · 2022-09-16T14:55:00Z

Build failed:

merge-transition-ubuntu

paulhauner · 2022-09-17T02:26:44Z

bors r+

## Issue Addressed Closes #3556 ## Proposed Changes Delete finalized execution payloads from the database in two places: 1. When running the finalization migration in `migrate_database`. We delete the finalized payloads between the last split point and the new updated split point. _If_ payloads are already pruned prior to this then this is sufficient to prune _all_ payloads as non-canonical payloads are already deleted by the head pruner, and all canonical payloads prior to the previous split will already have been pruned. 2. To address the fact that users will update to this code _after_ the merge on mainnet (and testnets), we need a one-off scan to delete the finalized payloads from the canonical chain. This is implemented in `try_prune_execution_payloads` which runs on startup and scans the chain back to the Bellatrix fork or the anchor slot (if checkpoint synced after Bellatrix). In the case where payloads are already pruned this check only imposes a single state load for the split state, which shouldn't be _too slow_. Even so, a flag `--prepare-payloads-on-startup=false` is provided to turn this off after it has run the first time, which provides faster start-up times. There is also a new `lighthouse db prune_payloads` subcommand for users who prefer to run the pruning manually. ## Additional Info The tests have been updated to not rely on finalized payloads in the database, instead using the `MockExecutionLayer` to reconstruct them. Additionally a check was added to `check_chain_dump` which asserts the non-existence or existence of payloads on disk depending on their slot.

bors · 2022-09-17T04:44:39Z

Pull request successfully merged into unstable.

Build succeeded:

## Proposed Changes Improve the payload pruning feature in several ways: - Payload pruning is now entirely optional. It is enabled by default but can be disabled with `--prune-payloads false`. The previous `--prune-payloads-on-startup` flag from #3565 is removed. - Initial payload pruning on startup now runs in a background thread. This thread will always load the split state, which is a small fraction of its total work (up to ~300ms) and then backtrack from that state. This pruning process ran in 2m5s on one Prater node with good I/O and 16m on a node with slower I/O. - To work with the optional payload pruning the database function `try_load_full_block` will now attempt to load execution payloads for finalized slots _if_ pruning is currently disabled. This gives users an opt-out for the extensive traffic between the CL and EL for reconstructing payloads. ## Additional Info If the `prune-payloads` flag is toggled on and off then the on-startup check may not see any payloads to delete and fail to clean them up. In this case the `lighthouse db prune_payloads` command should be used to force a manual sweep of the database.

## Issue Addressed Closes sigp#3556 ## Proposed Changes Delete finalized execution payloads from the database in two places: 1. When running the finalization migration in `migrate_database`. We delete the finalized payloads between the last split point and the new updated split point. _If_ payloads are already pruned prior to this then this is sufficient to prune _all_ payloads as non-canonical payloads are already deleted by the head pruner, and all canonical payloads prior to the previous split will already have been pruned. 2. To address the fact that users will update to this code _after_ the merge on mainnet (and testnets), we need a one-off scan to delete the finalized payloads from the canonical chain. This is implemented in `try_prune_execution_payloads` which runs on startup and scans the chain back to the Bellatrix fork or the anchor slot (if checkpoint synced after Bellatrix). In the case where payloads are already pruned this check only imposes a single state load for the split state, which shouldn't be _too slow_. Even so, a flag `--prepare-payloads-on-startup=false` is provided to turn this off after it has run the first time, which provides faster start-up times. There is also a new `lighthouse db prune_payloads` subcommand for users who prefer to run the pruning manually. ## Additional Info The tests have been updated to not rely on finalized payloads in the database, instead using the `MockExecutionLayer` to reconstruct them. Additionally a check was added to `check_chain_dump` which asserts the non-existence or existence of payloads on disk depending on their slot.

## Proposed Changes Improve the payload pruning feature in several ways: - Payload pruning is now entirely optional. It is enabled by default but can be disabled with `--prune-payloads false`. The previous `--prune-payloads-on-startup` flag from sigp#3565 is removed. - Initial payload pruning on startup now runs in a background thread. This thread will always load the split state, which is a small fraction of its total work (up to ~300ms) and then backtrack from that state. This pruning process ran in 2m5s on one Prater node with good I/O and 16m on a node with slower I/O. - To work with the optional payload pruning the database function `try_load_full_block` will now attempt to load execution payloads for finalized slots _if_ pruning is currently disabled. This gives users an opt-out for the extensive traffic between the CL and EL for reconstructing payloads. ## Additional Info If the `prune-payloads` flag is toggled on and off then the on-startup check may not see any payloads to delete and fail to clean them up. In this case the `lighthouse db prune_payloads` command should be used to force a manual sweep of the database.

michaelsproul added 5 commits September 8, 2022 16:25

Delete finalized exec payloads while running

69d5474

Implement on-demand pruning operation

d5adc2e

Implement DB manager command

2289b20

Fix and update beacon chain tests

de775d6

Add flag to disable prune on startup

b28e8d0

michaelsproul added ready-for-review The code is ready for review v3.1.2 Release after v3.1.0 (formerly v3.1.1) labels Sep 12, 2022

michaelsproul commented Sep 12, 2022

View reviewed changes

beacon_node/store/src/hot_cold_store.rs Show resolved Hide resolved

Clippy

a4960eb

paulhauner self-requested a review September 13, 2022 05:41

paulhauner approved these changes Sep 14, 2022

View reviewed changes

paulhauner added ready-for-merge This PR is ready to merge. and removed ready-for-review The code is ready for review labels Sep 15, 2022

bors bot changed the title ~~Prune finalized execution payloads~~ [Merged by Bors] - Prune finalized execution payloads Sep 17, 2022

bors bot closed this Sep 17, 2022

michaelsproul deleted the delete-exec-payloads branch September 17, 2022 04:48

michaelsproul mentioned this pull request Sep 19, 2022

[Merged by Bors] - Refined payload pruning #3587

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Merged by Bors] - Prune finalized execution payloads #3565

[Merged by Bors] - Prune finalized execution payloads #3565

michaelsproul commented Sep 12, 2022

paulhauner left a comment •

edited

Loading

paulhauner commented Sep 16, 2022

bors bot commented Sep 16, 2022

paulhauner commented Sep 16, 2022

bors bot commented Sep 16, 2022

paulhauner commented Sep 16, 2022

bors bot commented Sep 16, 2022

paulhauner commented Sep 16, 2022

bors bot commented Sep 16, 2022

paulhauner commented Sep 16, 2022

bors bot commented Sep 16, 2022

bors bot commented Sep 16, 2022

bors bot commented Sep 16, 2022

paulhauner commented Sep 17, 2022

bors bot commented Sep 17, 2022

[Merged by Bors] - Prune finalized execution payloads #3565

[Merged by Bors] - Prune finalized execution payloads #3565

Conversation

michaelsproul commented Sep 12, 2022

Issue Addressed

Proposed Changes

Additional Info

paulhauner left a comment • edited Loading

Choose a reason for hiding this comment

paulhauner commented Sep 16, 2022

bors bot commented Sep 16, 2022

paulhauner commented Sep 16, 2022

bors bot commented Sep 16, 2022

paulhauner commented Sep 16, 2022

bors bot commented Sep 16, 2022

paulhauner commented Sep 16, 2022

bors bot commented Sep 16, 2022

paulhauner commented Sep 16, 2022

bors bot commented Sep 16, 2022

bors bot commented Sep 16, 2022

bors bot commented Sep 16, 2022

paulhauner commented Sep 17, 2022

bors bot commented Sep 17, 2022

paulhauner left a comment •

edited

Loading