blockdb: bound max deleted blocks per blockdb sync #5910

algorandskiy · 2024-01-19T20:59:19Z

Summary

If a node switches from Archival to non-archival then the blockdb cleanup can take a long time blocking the blockdb in a write tx. This PR addresses this by scoping max blocks deleted per sync op.

Earliest                       MinToSave         Latest

   |                               |                 |
   +-------------------------------+-----------------+
   ^            ^                  ^                 ^
   |            |                  +-----------------+
   +------------+
   | maxDeletionBatchSize          ^     MaxBlockHistoryLookback
   +-------------------------------+

              Deletion Range (Old)

Test Plan

Added a unit test.
Additionally fixed a data race seen in TestVotersReloadFromDiskAfterOneStateProofCommitted.

ledger/blockqueue.go

codecov · 2024-01-19T23:21:04Z

Codecov Report

Attention: 2 lines in your changes are missing coverage. Please review.

Comparison is base (877090b) 55.14% compared to head (0cb72c6) 55.97%.

Files	Patch %	Lines
ledger/blockqueue.go	80.00%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #5910      +/-   ##
==========================================
+ Coverage   55.14%   55.97%   +0.83%     
==========================================
  Files         478      478              
  Lines       67602    67612      +10     
==========================================
+ Hits        37280    37848     +568     
+ Misses      27757    27201     -556     
+ Partials     2565     2563       -2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

onetechnical · 2024-01-22T21:37:23Z

From my testing, this looks good. algod remains available, and it sheds 10,000 rounds at a time, though it takes about 26 seconds per batch. This means that for going archival -> non-archival it would take about a day to completely cut through 35 million rounds. I think this is acceptable, since it remains responsive in the mean time.

jannotti · 2024-01-22T21:45:41Z

it takes about 26 seconds per batch. This means that for going archival -> non-archival it would take about a day to completely cut through 35 million rounds. I think this is acceptable, since it remains responsive in the mean time.

I think this means that since a sync takes 26 seconds or so, we will only be calling sync every 8 rounds or so. If that's the case, I don't see any point in even going as high as 10,000. If we do 1,000 I guess we might be syncing fast enough to sync almost every round. And 8 rounds or so of 1,000 deletions is nearly as good as doing 10,000 rounds every 26 seconds. It'll still clean up in about a day.

Keep in mind, I may be misunderstanding how sync works. I had thought we wrote a block every round.

algorandskiy · 2024-01-23T20:05:28Z

My intuition is bigger batches should take a bit less overall time than smaller batches because of the index update. Not sure tho if there any optimizations in sqlite for sequential removal from indices.

algorandskiy added the Enhancement label Jan 19, 2024

algorandskiy requested review from jannotti, onetechnical and gmalouf January 19, 2024 20:59

algorandskiy self-assigned this Jan 19, 2024

jannotti reviewed Jan 19, 2024

View reviewed changes

ledger/blockqueue.go Outdated Show resolved Hide resolved

gmalouf reviewed Jan 19, 2024

View reviewed changes

ledger/blockqueue.go Outdated Show resolved Hide resolved

WIP: blockdb: bound max deleted blocks per blockdb sync

11db893

algorandskiy force-pushed the pavel/batch-block-deletion branch from 19082af to 11db893 Compare January 19, 2024 21:37

algorandskiy added 2 commits January 19, 2024 18:01

add blockq syncer test

153ecee

fix TestVotersReloadFromDiskAfterOneStateProofCommitted race cond

0cb72c6

algorandskiy marked this pull request as ready for review January 19, 2024 23:03

algorandskiy changed the title ~~WIP: blockdb: bound max deleted blocks per blockdb sync~~ blockdb: bound max deleted blocks per blockdb sync Jan 22, 2024

algorandskiy requested review from jannotti and gmalouf January 22, 2024 15:38

gmalouf approved these changes Jan 23, 2024

View reviewed changes

jannotti approved these changes Jan 23, 2024

View reviewed changes

gmalouf merged commit 3490731 into algorand:master Jan 23, 2024
12 checks passed

Algo-devops-service mentioned this pull request Jan 23, 2024

go-algorand 3.22.0-beta Release PR #5919

Merged

Algo-devops-service mentioned this pull request Jan 31, 2024

go-algorand 3.22.0-stable Release PR #5925

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

blockdb: bound max deleted blocks per blockdb sync #5910

blockdb: bound max deleted blocks per blockdb sync #5910

algorandskiy commented Jan 19, 2024 •

edited

Loading

codecov bot commented Jan 19, 2024 •

edited

Loading

onetechnical commented Jan 22, 2024

jannotti commented Jan 22, 2024

algorandskiy commented Jan 23, 2024

blockdb: bound max deleted blocks per blockdb sync #5910

blockdb: bound max deleted blocks per blockdb sync #5910

Conversation

algorandskiy commented Jan 19, 2024 • edited Loading

Summary

Test Plan

codecov bot commented Jan 19, 2024 • edited Loading

Codecov Report

onetechnical commented Jan 22, 2024

jannotti commented Jan 22, 2024

algorandskiy commented Jan 23, 2024

algorandskiy commented Jan 19, 2024 •

edited

Loading

codecov bot commented Jan 19, 2024 •

edited

Loading