feat(pending): pending block sealed instead of re-added to mempool #468

Trantorian1 · 2025-01-16T16:59:22Z

Important

Most of the changes in this pr are just me adding some tests. Actual logic changes are pretty small.

Pull Request type

Feature

What is the current behavior?

Pending block transactions are re-added back to the mempool and re-executed upon startup. This happens in case of graceful or ungraceful shutdown in the middle of block production: the pending block will be saved to db to avoid loss of state.

This leads to several issues:

Transactions are re-executed, and might provided different outputs or hashes.
Transactions from the pending block are treated like any other and could be dropped by the mempool under certain edge cases.

Resolves: #458

What is the new behavior?

Pending block, alongside the pending state diff, visited segments and declared classes are retrieved from db and closed into a new finalized block.

This means that if a sequencer node is shutdown mid-production, the state contained in the pending block will automatically be persisted upon startup as it own finalized block without re-executing transactions. This also guarantees that any changes to the config will not be applied to transactions executed under different parameters (these will only affect the next block).

Does this introduce a breaking change?

No.

Mohiiit

Great work! The test cases are really clean and well-structured. I have a couple of thoughts:

Should we consider adding E2E tests for this? For example, we could simulate spawning Madara, shutting it down mid-process, and then restarting it to verify that everything works as expected.
I found the stages a bit confusing. In most test cases, when we store the pending block in stage 2, it’s mentioned that this simulates stopping and restarting the node. However, isn’t that more related to the block production task? Maybe I’m misunderstanding, but this part could use some clarification.

crates/madara/client/block_production/src/lib.rs

Trantorian1 · 2025-01-21T08:10:05Z

I found the stages a bit confusing. In most test cases, when we store the pending block in stage 2, it’s mentioned that this simulates stopping and restarting the node. However, isn’t that more related to the block production task? Maybe I’m misunderstanding, but this part could use some clarification.

So, the block production functions by storing un-finalized blocks as pending. This is the only form of data we can recover without re-execution as everything else is stored in RAM (mempool transactions which have not yet been polled yet are also stored in db for retrieval, but these haven't been executed anyways). This means that if ever the node crashes, we will only be able to retrieve whatever data was stored in the pending block. This is done atomically so we never commit partial data to the database and only a full pending block can ever be stored.

We are therefore "simulate[ing] stopping and restarting the node", since:

This is the only pending data that can persist a node restart, and it cannot be partially valid (we still test failing cases though).
Upon restart, this is what the block production would be looking to seal.

I will be adding some if this in comments to the tests.

Trantorian1 added feature Request for new feature or enhancement sequencer Related to the sequencing logic and implementation labels Jan 16, 2025

Trantorian1 requested review from notlesh, cchudant, Mohiiit and jbcaron January 16, 2025 16:59

Trantorian1 self-assigned this Jan 16, 2025

Trantorian1 added 2 commits January 17, 2025 15:52

feat(pending): pending block sealed instead of re-added to mempool

be71866

test(block_prod): added tests and fixed some edge cases

74f48fc

Trantorian1 force-pushed the feat/avoid_pending_re_exec branch from 05fd24e to 74f48fc Compare January 17, 2025 14:54

refactor(mempool): removed methods for re-adding pending

0154d02

Trantorian1 marked this pull request as ready for review January 17, 2025 15:07

Trantorian1 added 2 commits January 17, 2025 16:37

test(devnet): removed duplicate test

8304623

fix(clippy)

9bd7cd5

Mohiiit reviewed Jan 20, 2025

View reviewed changes

fix(comments)

5c8e0c2

Mohiiit approved these changes Jan 21, 2025

View reviewed changes

jbcaron approved these changes Jan 21, 2025

View reviewed changes

Trantorian1 added 2 commits January 21, 2025 18:01

fix(ci): unblocking ci

d814015

Merge branch 'main' into feat/avoid_pending_re_exec

95c0e29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(pending): pending block sealed instead of re-added to mempool #468

feat(pending): pending block sealed instead of re-added to mempool #468

Trantorian1 commented Jan 16, 2025 •

edited

Loading

Mohiiit left a comment

Trantorian1 commented Jan 21, 2025 •

edited

Loading

feat(pending): pending block sealed instead of re-added to mempool #468

Are you sure you want to change the base?

feat(pending): pending block sealed instead of re-added to mempool #468

Conversation

Trantorian1 commented Jan 16, 2025 • edited Loading

Pull Request type

What is the current behavior?

What is the new behavior?

Does this introduce a breaking change?

Mohiiit left a comment

Choose a reason for hiding this comment

Trantorian1 commented Jan 21, 2025 • edited Loading

Trantorian1 commented Jan 16, 2025 •

edited

Loading

Trantorian1 commented Jan 21, 2025 •

edited

Loading