bitcoin-bitcoin-core

mirror of https://github.com/bitcoin/bitcoin.git synced 2025-02-25 12:51:55 -05:00

Author	SHA1	Message	Date
TheCharlatan	9be330b654	[refactor] Define MessageStartChars as std::array	2023-09-12 22:49:49 +02:00
fanquake	ecab855838	Merge bitcoin/bitcoin#28195 : blockstorage: Drop legacy -txindex check `fae405556d` scripted-diff: Rename CBlockTreeDB -> BlockTreeDB (MarcoFalke) `faf63039cc` Fixup style of moved code (MarcoFalke) `fa65111b99` move-only: Move CBlockTreeDB to node/blockstorage (MarcoFalke) `fa8685597e` index: Drop legacy -txindex check (MarcoFalke) `fa69148a0a` scripted-diff: Use blocks_path where possible (MarcoFalke) Pull request description: The only reason for the check was to print a warning about an increase in storage use. Now that 22.x is EOL and everyone should have migrated (or decided to not care about storage use), remove the check. Also, a move-only commit is included. (Rebased from https://github.com/bitcoin/bitcoin/pull/22242) ACKs for top commit: TheCharlatan: ACK `fae405556d`, though I lack historical context to really judge the second commit `fa8685597e`. stickies-v: ACK `fae405556d` Tree-SHA512: 9da8f48767ae52d8e8e21c09a40c949cc0838794f1856cc5f58a91acd3f00a3bca818c8082242b3fdc9ca5badb09059570bb3870850d3807b75a8e23b5222da1	2023-09-05 11:37:35 +01:00
fanquake	be44332803	Merge bitcoin/bitcoin#28191 : refactor: Remove unused MessageStartChars parameters from BlockManager methods `fa69e3a95c` Remove unused MessageStartChars parameters from BlockManager methods (MarcoFalke) Pull request description: Seems odd to expose these for mocking, when it is not needed. Fix this by removing the the unused parameters and use the already existing member field instead. ACKs for top commit: Empact: utACK `fa69e3a95c` dergoegge: utACK `fa69e3a95c` Tree-SHA512: 7814e9560abba8d9c0926bcffc70f92e502d22f543af43671248f6fcd1433f35238553c0f05123fde6d8e0f80261af0ab0500927548115153bd68d57fe2da746	2023-08-07 10:57:39 +02:00
MarcoFalke	fae405556d	scripted-diff: Rename CBlockTreeDB -> BlockTreeDB -BEGIN VERIFY SCRIPT- sed -i 's\|CBlockTreeDB\|BlockTreeDB\|g' $( git grep -l CBlockTreeDB ) -END VERIFY SCRIPT-	2023-08-02 07:49:32 +02:00
MarcoFalke	fa65111b99	move-only: Move CBlockTreeDB to node/blockstorage The block index (CBlockTreeDB) is required to write and read blocks, so move it to blockstorage. This allows to drop the txdb.h include from `node/blockstorage.h`. Can be reviewed with: --color-moved=dimmed-zebra --color-moved-ws=ignore-all-space	2023-08-01 15:27:33 +02:00
MarcoFalke	fa69e3a95c	Remove unused MessageStartChars parameters from BlockManager methods	2023-07-31 14:32:57 +02:00
Suhas Daftuar	1cfc887d00	Remove CChain dependency in node/blockstorage	2023-07-14 14:54:57 -04:00
Suhas Daftuar	fe86a7cd48	Explicitly track maximum block height stored in undo files When writing a new block to disk, if we have filled up the current block file, then we flush and truncate that block file (to free allocated but unused space) before advancing to the next one. When this happens, we have to determine whether to also flush and truncate the corresponding undo file. Undo data is only written when blocks are connected, not when blocks are received. Thus it's possible that the corresponding undo file already has all the data it will ever have, and we should flush/truncate it as we advance files; or it's possible that there is more data we expect to write, and should therefore defer flush/truncation until undo data is later written. Prior to this commit, we made the determination of whether the undo file was full of all requisite data by comparing against the chain tip. This patch replaces that dependence on validation data structures by instead just tracking the highest height of any block written in the undo file as we go.	2023-07-14 14:47:00 -04:00
TheCharlatan	462390c85f	refactor: Move stopafterblockimport handling out of blockstorage This has the benefit of moving the StartShutdown call out of the blockstorage file and thus out of the kernel's responsibility. The user can now decide if he wants to start shutdown / interrupt after a block import or not.	2023-07-11 12:00:57 +02:00
furszy	ca91c244ef	index: verify blocks data existence only once At present, during init, we traverse the chain (once per index) to confirm that all necessary blocks to sync each index up to the current tip are present. To make the process more efficient, we can fetch the oldest block from the indexers and perform the chain data existence check from that point only once. This also moves the pruning violation check to the end of the 'loadinit' thread, which is where the reindex, block loading and chain activation processes happen. Making the node's startup process faster, allowing us to remove the global g_indexes_ready_to_sync flag, and enabling the execution of the pruning violation verification even when the reindex or reindex-chainstate flags are enabled (which has being skipped so far).	2023-07-10 10:50:50 -03:00
furszy	2ec89f1970	refactor: simplify pruning violation check By generalizing 'GetFirstStoredBlock' and implementing 'CheckBlockDataAvailability' we can dedup code and avoid repeating work when multiple indexes are enabled. E.g. get the oldest block across all indexes and perform the pruning violation check from that point up to the tip only once (this feature is being introduced in a follow-up commit). This commit shouldn't change behavior in any way. Co-authored-by: Ryan Ofsky <ryan@ofsky.org>	2023-07-10 10:50:50 -03:00
furszy	c82ef91eae	make GetFirstStoredBlock assert that 'start_block' always has data And transfer the responsibility of verifying whether 'start_block' has data or not to the caller. This is because the 'GetFirstStoredBlock' function responsibility is to return the first block containing data. And the current implementation can return 'start_block' when it has no data!. Which is misleading at least. Edge case behavior change: Previously, if the block tip lacked data but all preceding blocks contained data, there was no prune violation. And now, such scenario will result in a prune violation.	2023-07-10 10:47:17 -03:00
furszy	04575106b2	scripted-diff: rename 'loadblk' thread name to 'initload' The thread does not only load blocks, it loads the mempool and, in a future commit, will start the indexes as well. Also, renamed the 'ThreadImport' function to 'ImportBlocks' And the 'm_load_block' class member to 'm_thread_load'. -BEGIN VERIFY SCRIPT- sed -i "s/ThreadImport/ImportBlocks/g" $(git grep -l ThreadImport -- ':!/doc/') sed -i "s/loadblk/initload/g" $(git grep -l loadblk -- ':!/doc/release-notes/') sed -i "s/m_load_block/m_thread_load/g" $(git grep -l m_load_block) -END VERIFY SCRIPT-	2023-07-07 19:31:27 -03:00
furszy	ed4462cc78	init: start indexes sync earlier The mempool load can take a while, and it is not needed for the indexes' synchronization. Also, having the mempool load function call inside 'blockstorage.cpp' wasn't structurally correct.	2023-07-07 19:31:26 -03:00
TheCharlatan	edb55e2777	kernel: Pass interrupt reference to chainman This and the following commit seek to decouple the libbitcoinkernel library from the shutdown code. As a library, it should it should have its own flexible interrupt infrastructure without relying on node-wide globals. The commit takes the first step towards this goal by de-globalising `ShutdownRequested` calls in kernel code. Co-authored-by: Russell Yanofsky <russ@yanofsky.org> Co-authored-by: TheCharlatan <seb.kung@gmail.com>	2023-06-28 09:52:27 +02:00
Martin Zumsande	97844d9268	index: Enable reindex-chainstate with active indexes This is achieved by letting the index sync thread wait until reindex-chainstate is finished. This also disables the pruning check when reindexing the chainstate (which is incompatible with prune mode) because there would be no chain at this point in init.	2023-05-17 11:14:28 -04:00
TheCharlatan	5ff63a09a9	refactor, blockstorage: Replace stopafterblockimport arg Add a stop_after_block_import field to the BlockManager options. Use this field instead of the global gArgs. This should allow users of the BlockManager to not rely on the global Args.	2023-05-10 19:07:46 +02:00
TheCharlatan	f0bb1021f0	refactor: Move functions to BlockManager methods This is a commit in preparation for the next few commits. The functions are moved to methods to avoid their re-declaration for the purpose of passing in BlockManager options. The functions that were now moved into the BlockManager should no longer use the params as an argument, but instead use the member variable. In the moved ReadBlockFromDisk and UndoReadFromDisk, change the function signature to accept a reference to a CBlockIndex instead of a raw pointer. The pointer is expected to be non-null, so reflect that in the type. To allow for the move of functions to BlockManager methods all call sites require an instantiated BlockManager, or a callback to one.	2023-05-10 19:06:53 +02:00
MarcoFalke	fa5d7c39eb	Remove unused chainparams from BlockManager methods Also, replace pointer with reference while touching the signature.	2023-05-04 19:27:23 +02:00
MarcoFalke	facdb8b331	Add BlockManagerOpts::chainparams reference and use it in blockstorage.cpp	2023-05-04 19:26:43 +02:00
TheCharlatan	00e9b97f37	refactor: Move fs.* to util/fs.* The fs.* files are already part of the libbitcoin_util library. With the introduction of the fs_helpers.* it makes sense to move fs.* into the util/ directory as well.	2023-03-23 12:55:18 +01:00
MarcoFalke	fadf8b8182	refactor: Add and use PRUNE_TARGET_MANUAL constexpr	2023-03-15 16:02:47 +01:00
MarcoFalke	fa9bd7be47	Move ::fImporting to BlockManager	2023-03-15 15:48:44 +01:00
MarcoFalke	fa177d7b6b	Move ::fPruneMode into BlockManager	2023-03-15 15:47:42 +01:00
MarcoFalke	fa721f1cab	Move ::nPruneTarget into BlockManager	2023-03-15 15:33:12 +01:00
Andrew Chow	bb136aaf2c	Merge bitcoin/bitcoin#26533 : prune: scan and unlink already pruned block files on startup `3141eab9c6` test: add functional test for ScanAndUnlinkAlreadyPrunedFiles (Andrew Toth) `e252909e56` test: add unit test for ScanAndUnlinkAlreadyPrunedFiles (Andrew Toth) `77557dda4a` prune: scan and unlink already pruned block files on startup (Andrew Toth) Pull request description: There are a few cases where we can mark a block and undo file as pruned in our block index, but not actually remove the files from disk. 1. If we call `FindFilesToPrune` or `FindFilesToPruneManual` and crash before `UnlinkPrunedFiles`. 2. If on Windows there is an open file handle to the file somewhere else when calling `fs::remove` in `UnlinkPrunedFiles` (https://en.cppreference.com/w/cpp/filesystem/remove, https://learn.microsoft.com/en-us/windows/win32/api/fileapi/nf-fileapi-deletefilew#remarks). This could be from another process, or if we are calling `ReadBlockFromDisk`/`ReadRawBlockFromDisk` without having a lock on `cs_main` (which has been allowed since `ccd8ef65f9`). This PR mitigates this by scanning all pruned block files on startup after `LoadBlockIndexDB` and unlinking them again. ACKs for top commit: achow101: ACK `3141eab9c6` pablomartin4btc: re-ACK with added functional test `3141eab9c6`. furszy: Code review ACK `3141eab9` theStack: Code-review ACK `3141eab9c6` Tree-SHA512: 6c73bc57838ad1b7e5d441af3c4d6bf4c61c4382e2b86485e57fbb74a61240710c0ceeceb8b4834e610ecfa3175c6955c81ea4b2285fee11ca6383f472979d8d	2023-02-28 09:54:10 -05:00
MarcoFalke	faf7b4f1fc	Add BlockManager::IsPruneMode()	2023-01-16 17:31:32 +01:00
MarcoFalke	fae71fe27e	Add BlockManager::GetPruneTarget()	2023-01-16 17:16:30 +01:00
MarcoFalke	fa0f0436d8	Add BlockManager::LoadingBlocks()	2023-01-16 16:38:11 +01:00
fanquake	282019cd3d	refactor: add kernel/cs_main.* Co-authored-by: Anthony Towns <aj@erisian.com.au>	2023-01-05 09:05:14 +00:00
Hennadii Stepanov	306ccd4927	scripted-diff: Bump copyright headers -BEGIN VERIFY SCRIPT- ./contrib/devtools/copyright_header.py update ./ -END VERIFY SCRIPT- Commits of previous years: - 2021: `f47dda2c58` - 2020: `fa0074e2d8` - 2019: `aaaaad6ac9`	2022-12-24 23:49:50 +00:00
Andrew Toth	77557dda4a	prune: scan and unlink already pruned block files on startup	2022-12-20 12:25:36 -05:00
glozow	cc12b8947b	Merge bitcoin/bitcoin#24858 : incorrect blk file size calculation during reindex results in recoverable blk file corruption `bcb0cacac2` reindex, log, test: fixes #21379 (mruddy) Pull request description: Fixes #21379. The blocks/blk?????.dat files are mutated and become increasingly malformed, or corrupt, as a result of running the re-indexing process. The mutations occur after the re-indexing process has finished, as new blocks are appended, but are a result of a re-indexing process miscalculation that lingers in the block manager's `m_blockfile_info` `nSize` data until node restart. These additions to the blk files are non-fatal, but also not desirable. That is, this is a form of data corruption that the reading code is lenient enough to process (it skips the extra bytes), but it adds some scary looking log messages as it encounters them. The summary of the problem is that the re-index process double counts the size of the serialization header (magic message start bytes [4 bytes] + length [4 bytes] = 8 bytes) while calculating the blk data file size (both values already account for the serialization header's size, hence why it is over accounted). This bug manifests itself in a few different ways, after re-indexing, when a new block from a peer is processed: 1. If the new block will not fit into the last blk file processed while re-indexing, while remaining under the 128MiB limit, then the blk file is flushed to disk and truncated to a size that is 8 greater than it should be. The truncation adds zero bytes (see `FlatFileSeq::Flush` and `TruncateFile`). 1. If the last blk file processed while re-indexing has logical space for the new block under the 128 MiB limit: 1. If the blk file was not already large enough to hold the new block, then the zeros are, in effect, added by `fseek` when the file is opened for writing. Eight zero bytes are added to the end of the last blk file just before the new block is written. This happens because the write offset is 8 too great due to the miscalculation. The result is 8 zero bytes between the end of the last block and the beginning of the next block's magic + length + block. 1. If the blk file was already large enough to hold the new block, then the current existing file contents remain in the 8 byte gap between the end of the last block and the beginning of the next block's magic + length + block. Commonly, when this occcurs, it is due to the blk file containing blocks that are not connected to the block tree during reindex and are thus left behind by the reindex process and later overwritten when new blocks are added. The orphaned blocks can be valid blocks, but due to the nature of concurrent block download, the parent may not have been retrieved and written by the time the node was previously shutdown. ACKs for top commit: LarryRuane: tested code-review ACK `bcb0cacac2` ryanofsky: Code review ACK `bcb0cacac2`. This is a disturbing bug with an easy fix which seems well-worth merging. mzumsande: ACK `bcb0cacac2` (reviewed code and did some testing, I agree that it fixes the bug). w0xlt: tACK `bcb0cacac2` Tree-SHA512: acc97927ea712916506772550451136b0f1e5404e92df24cc05e405bb09eb6fe7c3011af3dd34a7723c3db17fda657ae85fa314387e43833791e9169c0febe51	2022-10-12 14:13:54 -04:00
James O'Beirne	00eeb31c76	scripted-diff: rename CChainState -> Chainstate -BEGIN VERIFY SCRIPT- sed -i 's/CChainState/Chainstate/g' $(git grep -l CChainState ':(exclude)doc/release-notes*') -END VERIFY SCRIPT- Co-authored-by: MacroFake <falke.marco@gmail.com>	2022-09-09 11:47:27 -04:00
Carl Dong	06b88ffb8a	LoadMempool: Pass in load_path, stop using gArgs Also: 1. Have CChainState::LoadMempool and ::ThreadImport take in paths and pass it through untouched to LoadMempool. 2. Make LoadMempool exit early if the load_path is empty. 3. Adjust the call to ::ThreadImport in ::AppInitMain to correctly pass in an empty path if mempool persistence is disabled.	2022-07-15 12:26:20 -04:00
darosior	64f81a38b9	doc: Correct nPruneTarget misidentifying units of variable	2022-06-07 15:30:16 -05:00
Cory Fields	a4741bd8d4	kernel: pass params to BlockManager rather than using a global	2022-06-02 15:18:09 +00:00
mruddy	bcb0cacac2	reindex, log, test: fixes #21379 This fixes a blk file size calculation made during reindex that results in increased blk file malformity. The fix is to avoid double counting the size of the serialization header during reindex. This adds a unit test to reproduce the bug before the fix and to ensure that it does not recur. These changes include a log message change also so as to not be as alarming. This is a common and recoverable data corruption. These messages can now be filtered by the debug log reindex category.	2022-05-07 07:11:29 -04:00
Jon Atack	4cb9d21434	blockstorage: add LIFETIMEBOUND to GetFirstStoredBlock()::start_time See PR 22278 for discussion. Co-authored-by: MarcoFalke <falke.marco@gmail.com>	2022-05-03 22:20:31 +02:00
Jon Atack	86ce844d3b	blockstorage, refactor: pass GetFirstStoredBlock() start_block by reference instead of by pointer, so as to not accept a nullptr.	2022-04-28 20:42:08 +02:00
Jon Atack	ed12c0a49d	blockstorage, refactor: make GetFirstStoredBlock() a member of BlockManager instead of a global	2022-04-28 20:42:08 +02:00
Carl Dong	7ab07e0332	validation: Prune UnloadBlockIndex and callees In previous commits in this patchset, we've made sure that every Unload/UnloadBlockIndex member function resets its own members, and does not reach out to globals. This means that their corresponding classes' default destructors can now replace them, and do an even more thorough job without the need to be updated for every new member variable. Therefore, we can remove them, and also remove UnloadBlockIndex since that's not used anymore. Unfortunately, chainstatemanager_loadblockindex relies on CChainState::UnloadBlockIndex, so that needs to stay for now.	2022-04-27 11:13:38 -04:00
fanquake	bd616bc16a	Merge bitcoin/bitcoin#24917 : Make BlockManager::LoadBlockIndex private `fa1970f075` Make BlockManager::LoadBlockIndex private (MarcoFalke) Pull request description: * After commit `fa27f03b49` `BlockManager::LoadBlockIndex` is only called by `BlockManager::LoadBlockIndexDB`. Thus, it can be made `private`. * After commit `c600ee3816` `m_best_invalid` is no longer accessed by `BlockManager::LoadBlockIndex`. Thus, the unused `friend` can be removed. ACKs for top commit: mruddy: ACK `fa1970f075` I verified by double checking references, then applying the patch, and running `make check`. LGTM. Tree-SHA512: 9b36b4c59bf7ad01171764ce61b1be9750fc92d105c4fe939b1a6a70027ab6300d5d2a2fc3e82f981e22c3987f2ca84e092d2e1f8463fa320af9f05048580c0a	2022-04-26 20:20:07 +01:00
Fabian Jahr	2561823531	blockstorage: Add prune locks to BlockManager This change also introduces an aditional buffer of 10 blocks (PRUNE_LOCK_BUFFER) that will not be pruned before the best block. Co-authored-by: Luke Dashjr <luke-jr+git@utopios.org>	2022-04-25 23:21:58 +02:00
Fabian Jahr	231fc7b035	refactor: Introduce GetFirstStoredBlock helper function	2022-04-25 23:18:01 +02:00
Carl Dong	f0a2fb3c5d	scripted-diff: Rename pindexBestHeader, fHavePruned ...to m_best_header and m_have_pruned -BEGIN VERIFY SCRIPT- find_regex="\bpindexBestHeader\b" \ && git grep -l -E "$find_regex" -- src \ \| xargs sed -i -E "s@$find_regex@m_best_header@g" find_regex="\bfHavePruned\b" \ && git grep -l -E "$find_regex" -- src \ \| xargs sed -i -E "s@$find_regex@m_have_pruned@g" -END VERIFY SCRIPT-	2022-04-19 14:36:18 -04:00
Carl Dong	3308ecd3fc	move-mostly: Make fHavePruned a BlockMan member [META] In the next commit, we move the clearing of fHavePruned to BlockManager::Unload()	2022-04-19 14:34:56 -04:00
Carl Dong	0d567daf23	move-mostly: Make pindexBestHeader a ChainMan member [META] In the next commit, we move the clearing of pindexBestHeader to ChainstateManager::Unload()	2022-04-19 14:34:55 -04:00
MarcoFalke	fa1970f075	Make BlockManager::LoadBlockIndex private	2022-04-19 11:32:49 +02:00
Carl Dong	f865cf8ded	Add and use BlockManager::GetAllBlockIndices	2022-03-15 19:42:43 -04:00

1 2

73 commits