Original source: topic-wal-commit-semantics

Topic: The replay method ignores COMMIT markers — investigate whether this is intentional simplification or a bug, and what the implications are for batch atomicity guarantees

Date: 2026-05-28

Time: 18:17

I can work from the observations provided. The replay method starts at line 212 and the key comments are visible in the grep results. Here's the analysis:

Topics to Explore

[function] write-ahead-log/wal.py:iterate — The raw record iterator that *does* preserve COMMIT markers; understanding the contrast with replay() clarifies the design intent
[function] write-ahead-log/wal.py:append_batch — How the single-write atomicity trick works and where it breaks down for large batches
[function] write-ahead-log/wal.py:truncate — Whether truncation respects batch boundaries or can split a committed batch
[file] log-structured-merge-tree/lsm.py — Contains the other replay method (line 28); worth checking if the LSM tree consumes WAL replay and whether it would be affected by partial batches
[general] wal-crash-recovery-correctness — Building a correct replay that groups records by batch and checks for COMMIT before applying, using iterate() as the foundation

Beliefs

wal-replay-filters-commit-checkpoint — replay() returns only PUT/DELETE records; COMMIT and CHECKPOINT markers are intentionally excluded and never reach the caller
wal-iterate-preserves-all-ops — iterate() yields every record including COMMIT and CHECKPOINT, providing the raw stream needed for commit-aware recovery logic
wal-batch-single-write — append_batch() serializes all ops plus the COMMIT marker into one bytearray and issues a single fd.write(), relying on OS write atomicity for small batches
wal-replay-no-atomicity-check — replay() does not verify that batch operations have a corresponding COMMIT record, so partial batches from a crash would be replayed as if committed