bitcoin-bitcoin-core

mirror of https://github.com/bitcoin/bitcoin.git synced 2025-02-24 12:41:41 -05:00

Author	SHA1	Message	Date
Hodlinator	b74d8d58fa	refactor: Add consteval uint256(hex_str) Complements uint256::FromHex() nicely in that it naturally does all error checking at compile time and so doesn't need to return an std::optional. Will be used in the following 2 commits to replace many calls to uint256S(). uint256S() calls taking C-string literals are littered throughout the codebase and executed at runtime to perform parsing unless a given optimizer was surprisingly efficient. While this may not be a hot spot, it's better hygiene in C++20 to store the parsed data blob directly in the binary, without any parsing at runtime.	2024-08-05 14:45:18 +02:00
brunoerg	6714276d72	miniscript: Use `ToIntegral` instead of `ParseInt64`	2024-08-05 08:23:24 -03:00
glozow	1a19a4d960	Merge bitcoin/bitcoin#29656 : chainparams: Change nChainTx type to uint64_t `bf0efb4fc7` scripted-diff: Modernize naming of nChainTx and nTxCount (Fabian Jahr) `72e5d1be1f` test: Add basic check for nChainTx type (Fabian Jahr) `dc2938e979` chainparams: Change nChainTx to uint64_t (Fabian Jahr) Pull request description: This picks up the work from #29331 and closes #29258. This simply changes the type and addresses the comments from #29331 by changing the type in all relevant places and removing unnecessary casts. This also adds an extremely simple unit test. Additionally this modernizes the name of `nChainTx` which helps reviewers check all use of the symbol and can make silent merge conflicts. ACKs for top commit: maflcko: only rebase in scripted-diff, re-ACK `bf0efb4fc7` 🔈 glozow: reACK `bf0efb4fc7` via range-diff Tree-SHA512: ee4020926d0800236fe655d0c7b127215ab36b553b04d5f91494f4b7fac6e1cfe7ee298b07c0983db5a3f4786932acaa54f5fd2ccd45f2fcdcfa13427358dc3b	2024-08-05 10:00:25 +01:00
glozow	bba01ba18d	Merge bitcoin/bitcoin#30285 : cluster mempool: merging & postprocessing of linearizations `bbcee5a0d6` clusterlin: improve rechunking in LinearizationChunking (optimization) (Pieter Wuille) `04d7a04ea4` clusterlin: add MergeLinearizations function + fuzz test + benchmark (Pieter Wuille) `4f8958d756` clusterlin: add PostLinearize + benchmarks + fuzz tests (Pieter Wuille) `0e2812d293` clusterlin: add algorithms for connectedness/connected components (Pieter Wuille) `0e52728a2d` clusterlin: rename Intersect -> IntersectPrefixes (Pieter Wuille) Pull request description: Part of cluster mempool: #30289 Depends on #30126, and was split off from it. #28676 depends on this. This adds the algorithms for merging & postprocessing linearizations. The `PostLinearize(depgraph, linearization)` function performs an in-place improvement of `linearization`, using two iterations of the [Linearization post-processing](https://delvingbitcoin.org/t/linearization-post-processing-o-n-2-fancy-chunking/201/8) algorithm. The first running from back to front, the second from front to back. The `MergeLinearizations(depgraph, linearization1, linearization2)` function computes a new linearization for the provided cluster, given two existing linearizations for that cluster, which is at least as good as both inputs. The algorithm is described at a high level in [merging incomparable linearizations](https://delvingbitcoin.org/t/merging-incomparable-linearizations/209). For background and references, see [Introduction to cluster linearization](https://delvingbitcoin.org/t/introduction-to-cluster-linearization/1032). ACKs for top commit: sdaftuar: ACK `bbcee5a0d6` glozow: code review ACK `bbcee5a0d6` instagibbs: ACK `bbcee5a0d6` Tree-SHA512: d2b5a3f132d1ef22ddf9c56421ab8b397efe45b3c4c705548dda56f5b39fe4b8f57a0d2a4c65b338462d80bb5b9b84a9a39efa1b4f390420a8005ce31817774e	2024-08-05 09:42:22 +01:00
Ryan Ofsky	1a7d20509f	Merge bitcoin/bitcoin#30526 : doc: Correct uint256 hex string endianness `73e3fa10b4` doc + test: Correct uint256 hex string endianness (Hodlinator) Pull request description: This PR is a follow-up to #30436. Only changes test-code and modifies/adds comments. Byte order of hex string representation was wrongfully documented as little-endian, but are in fact closer to "big-endian" (endianness is a memory-order concept rather than a numeric concept). `[arith_]uint256` both store their data in arrays with little-endian byte order (`arith_uint256` has host byte order within each `uint32_t` element). uint256_tests.cpp - Avoid using variable from the left side of the condition in the right side. Credits to @maflcko: https://github.com/bitcoin/bitcoin/pull/30436#discussion_r1688273553 setup_common.cpp - Skip needless ArithToUint256-conversion. Credits to @stickies-v: https://github.com/bitcoin/bitcoin/pull/30436#discussion_r1688621638 --- <details> <summary> ## Logical reasoning for endianness </summary> 1. Comparing an `arith_uint256` (`base_uint<256>`) to a `uint64_t` compares the beginning of the array, and verifies the remaining elements are zero. ```C++ template <unsigned int BITS> bool base_uint<BITS>::EqualTo(uint64_t b) const { for (int i = WIDTH - 1; i >= 2; i--) { if (pn[i]) return false; } if (pn[1] != (b >> 32)) return false; if (pn[0] != (b & 0xfffffffful)) return false; return true; } ``` ...that is consistent with little endian ordering of the array. 2. They have the same endianness (but `arith_` has host-ordering of each `uint32_t` element): ```C++ arith_uint256 UintToArith256(const uint256 &a) { arith_uint256 b; for(int x=0; x<b.WIDTH; ++x) b.pn[x] = ReadLE32(a.begin() + x4); return b; } ``` ### String conversions The reversal of order which happens when converting hex-strings <=> uint256 means strings are actually closer to big-endian, see the end of `base_blob<BITS>::SetHexDeprecated`: ```C++ unsigned char* p1 = m_data.data(); unsigned char* pend = p1 + WIDTH; while (digits > 0 && p1 < pend) { p1 = ::HexDigit(trimmed[--digits]); if (digits > 0) { p1 \|= ((unsigned char)::HexDigit(trimmed[--digits]) << 4); p1++; } } ``` Same reversal here: ```C++ template <unsigned int BITS> std::string base_blob<BITS>::GetHex() const { uint8_t m_data_rev[WIDTH]; for (int i = 0; i < WIDTH; ++i) { m_data_rev[i] = m_data[WIDTH - 1 - i]; } return HexStr(m_data_rev); } ``` It now makes sense to me that `SetHexDeprecated`, upon receiving a shorter hex string that requires zero-padding, would pad as if the missing hex chars where towards the end of the little-endian byte array, as they are the most significant bytes. "Big-endian" string representation is also consistent with the case where `SetHexDeprecated` receives too many hex digits and discards the leftmost ones, as a form of integer narrowing takes place. ### How I got it wrong in #30436 Previously I used the less than (`<`) comparison to prove endianness, but for `uint256` it uses `memcmp` and thereby gives priority to the lower bytes at the beginning of the array. ```C++ constexpr int Compare(const base_blob& other) const { return std::memcmp(m_data.data(), other.m_data.data(), WIDTH); } ``` `arith_uint256` is different in that it begins by comparing the bytes from the end, as it is using little endian representation, where the bytes toward the end are more significant. ```C++ template <unsigned int BITS> int base_uint<BITS>::CompareTo(const base_uint<BITS>& b) const { for (int i = WIDTH - 1; i >= 0; i--) { if (pn[i] < b.pn[i]) return -1; if (pn[i] > b.pn[i]) return 1; } return 0; } ``` (The commit documents that `base_blob::Compare()` is doing lexicographic ordering unlike the `arith_*`-variant which is doing numeric ordering). </details> ACKs for top commit: paplorinc: ACK `73e3fa10b4` ryanofsky: Code review ACK `73e3fa10b4` Tree-SHA512: 121630c37ab01aa7f7097f10322ab37da3cbc0696a6bbdbf2bbd6db180dc5938c7ed91003aaa2df7cf4a4106f973f5118ba541b5e077cf3588aa641bbd528f4e	2024-08-04 22:27:10 -04:00
Fabian Jahr	bf0efb4fc7	scripted-diff: Modernize naming of nChainTx and nTxCount -BEGIN VERIFY SCRIPT- sed -i 's/nChainTx/m_chain_tx_count/g' $(git grep -l 'nChainTx' ./src) sed -i 's/nTxCount/tx_count/g' $(git grep -l 'nTxCount' ./src) -END VERIFY SCRIPT-	2024-08-04 14:24:43 +02:00
Fabian Jahr	72e5d1be1f	test: Add basic check for nChainTx type	2024-08-04 12:12:39 +02:00
josibake	72a5822d43	tests: add tests for KeyPair Reuse existing BIP340 tests, as there should be no behavior change between the two	2024-08-04 08:52:21 +02:00
Hodlinator	73e3fa10b4	doc + test: Correct uint256 hex string endianness Follow-up to #30436. uint256 string representation was wrongfully documented as little-endian due to them being reversed by GetHex() etc, and base_blob::Compare() giving most significance to the beginning of the internal array. They are closer to "big-endian", but this commit tries to be even more precise than that. uint256_tests.cpp - Avoid using variable from the left side of the condition in the right side. setup_common.cpp - Skip needless ArithToUint256-conversion.	2024-08-03 21:59:54 +02:00
josibake	5d507a0091	tests: add key tweak smoke test Sanity check that using CKey/CPubKey directly vs using secp256k1_keypair objects returns the same results for BIP341 key tweaking. Co-authored-by: l0rinc <pap.lorinc@gmail.com>	2024-08-03 15:16:03 +02:00
glozow	2aff9a36c3	Merge bitcoin/bitcoin#30352 : policy: Add PayToAnchor(P2A), `OP_1 <0x4e73>` as a standard output script for spending `75648cea5a` test: add P2A ProduceSignature coverage (Greg Sanders) `7998ce6b20` Add release note for P2A output feature (Greg Sanders) `71c9b02a04` test: add P2A coverage for decodescript (Greg Sanders) `1349e9ec15` test: Add anchor mempool acceptance test (Greg Sanders) `9d89209937` policy: stop 3rd party wtxid malleability of anchor spend (Greg Sanders) `b60aaf8b23` policy: make anchor spend standard (Greg Sanders) `455fca86cf` policy: Add OP_1 <0x4e73> as a standard output type (Greg Sanders) Pull request description: This is a sub-feature taken out of the original proposal for ephemeral anchors #30239 This PR makes spending of `OP_1 <0x4e73>` (i.e. `bc1pfeessrawgf`) standard. Creation of this output type is already standard. Any future witness output types are considered relay-standard to create, but not to spend. This preserves upgrade hooks, such as a completely new output type for a softfork such as BIP341. It also gives us a bit of room to use a new output type for policy uses. This particular sized witness program has no other known use-cases (https://bitcoin.stackexchange.com/a/110664/17078), s it affords insufficient cryptographic security for a secure commitment to data, such as a script or a public key. This makes this type of output "keyless", or unauthenticated. As a witness program, the `scriptSig` of the input MUST be blank, by BIP141. This helps ensure txid-stability of the spending transaction, which may be required for smart contracting wallets. If we do not use segwit, a miner can simply insert an `OP_NOP` in the `scriptSig` without effecting the result of program execution. An additional relay restriction is to disallow non-empty witness data, which an adversary may use to penalize the "honest" transactor when RBF'ing the transaction due to the incremental fee requirement of RBF rules. The intended use-case for this output type is to "anchor" the transaction with a spending child to bring exogenous CPFP fees into the transaction package, encouraging the inclusion of the package in a block. The minimal size of creation and spending of this output makes it an attractive contrast to outputs like `p2sh(OP_TRUE)` and `p2wsh(OP_TRUE)` which are significantly larger in vbyte terms. Combined with TRUC transactions which limits the size of child transactions significantly, this is an attractive option for presigned transactions that need to be fee-bumped after the fact. ACKs for top commit: sdaftuar: utACK `75648cea5a` theStack: re-ACK `75648cea5a` ismaelsadeeq: re-ACK `75648cea5a` via [diff](`e7ce6dc070..75648cea5a`) glozow: ACK `75648cea5a` tdb3: ACK `75648cea5a` Tree-SHA512: d529de23d20857e6cdb40fa611d0446b49989eaafed06c28280e8fd1897f1ed8d89a4eabbec1bbf8df3d319910066c3dbbba5a70a87ff0b2967d5205db32ad1e	2024-08-02 15:49:44 +01:00
Andrew Toth	8bd3959fea	refactor: require self and sentinel parameters for AddFlags No behavior change. Prepares for adding the CoinsCachePairs to a linked list when flagged.	2024-08-02 00:29:42 -04:00
Andrew Toth	f08faeade2	refactor: move flags to private uint8_t and rename to m_flags No behavior change. This prepares to add CCoinsCacheEntrys to a doubly linked list when a flag is added.	2024-08-01 23:36:00 -04:00
Andrew Toth	8737c0cefa	refactor: encapsulate flags setting with AddFlags and ClearFlags No behavior change. This prepares moving the cache entry flags field to private access.	2024-08-01 23:36:00 -04:00
Andrew Toth	9715d3bf1e	refactor: encapsulate flags get access for all other checks No behavior change. This prepares moving the cache entry flags field to private access.	2024-08-01 23:36:00 -04:00
Andrew Toth	df34a94e57	refactor: encapsulate flags access for dirty and fresh checks No behavior change. This prepares moving the cache entry flags field to private access. Co-Authored-By: l0rinc <pap.lorinc@gmail.com>	2024-08-01 23:36:00 -04:00
Pieter Wuille	04d7a04ea4	clusterlin: add MergeLinearizations function + fuzz test + benchmark	2024-08-01 16:03:34 -04:00
Pieter Wuille	4f8958d756	clusterlin: add PostLinearize + benchmarks + fuzz tests	2024-08-01 16:02:09 -04:00
Pieter Wuille	0e2812d293	clusterlin: add algorithms for connectedness/connected components Add utility functions to DepGraph for finding connected components.	2024-08-01 15:43:59 -04:00
Pieter Wuille	0e52728a2d	clusterlin: rename Intersect -> IntersectPrefixes This makes it clearer what the function does.	2024-08-01 14:07:54 -04:00
glozow	ebd82fa9fa	Merge bitcoin/bitcoin#30532 : refactor: remove deprecated TxidFromString() in favour of transaction_identifier::FromHex() `f553e6d86f` refactor: remove TxidFromString (stickies-v) `285ab50ace` test: replace WtxidFromString with Wtxid::FromHex (stickies-v) `9a0b2a69c4` fuzz: increase FromHex() coverage (stickies-v) `526a87ba6b` test: add uint256::FromHex unittest coverage (stickies-v) Pull request description: Since `fab6ddbee6`, `TxidFromString()` has been deprecated because it is less robust than the `transaction_identifier::FromHex()` introduced in [the same PR](https://github.com/bitcoin/bitcoin/pull/30482). Specifically, it tries to recover from length-mismatches, recover from untrimmed whitespace, 0x-prefix and garbage at the end, instead of simply requiring exactly 64 hex-only characters. In this PR, `TxidFromString` is removed completely to clean up the code and prevent further unsafe usage. Unit and fuzz test coverage on `uint256::FromHex()` and functions that wrap it is increased. Note: `TxidFromSring` allowed users to prefix strings with "0x", this is no longer allowed for `transaction_identifier::FromHex()`, so a helper function for input validation may prove helpful in the future _(this overlaps with the `uint256::uint256S()` vs `uint256::FromHex()` future cleanup)_. It is not relevant to this PR, though, besides the fact that this unused (except for in tests) functionality is removed. The only users of `TxidFromString` are: - `test`, where it is straightforward to drop in the new `FromHex()` methods without much further concern - `qt` coincontrol. There is no need for input validation here, but txids are not guaranteed to be 64 characters. This is already handled by the existing code, so again, using `FromHex()` here seems quite straightforward. Addresses @maflcko's suggestion: https://github.com/bitcoin/bitcoin/pull/30482#discussion_r1691826934 Also removes `WtxidFromString()`, which is a test-only helper function. ### Testing GUI changes To test the GUI coincontrol affected lines, `regtest` is probably the easiest way to quickly get some test coins, you can use e.g. ``` alias cli="./src/bitcoin-cli -regtest" cli createwallet "coincontrol" # generate 10 spendable outputs on 1 address cli generatetoaddress 10 $(cli -rpcwallet=coincontrol getnewaddress) # generate 10 spendable outputs on another address cli generatetoaddress 10 $(cli -rpcwallet=coincontrol getnewaddress) # make previous outputs spendable cli generatetoaddress 100 $(cli -rpcwallet=coincontrol getnewaddress) ``` ACKs for top commit: maflcko: re-ACK `f553e6d86f` 🔻 hodlinator: ACK `f553e6d86f` paplorinc: ACK `f553e6d86f` TheCharlatan: Nice, ACK `f553e6d86f` Tree-SHA512: c1c7e6ea4cbf05cf660ba178ffc4f35f0328f7aa6ad81872e2462fb91a6a22e4681ff64b3d0202a5a9abcb650c939561585cd309164a69ab6081c0765ee271ef	2024-08-01 12:02:52 +01:00
stickies-v	f553e6d86f	refactor: remove TxidFromString TxidFromString was deprecated due to missing 64-character length-check and hex-check, replace it with the more robust Txid::FromHex.	2024-07-31 16:47:39 +01:00
stickies-v	285ab50ace	test: replace WtxidFromString with Wtxid::FromHex The newly introduced Wtxid::FromHex is more robust and removes the need for a WtxidFromString helper function	2024-07-31 16:47:39 +01:00
stickies-v	9a0b2a69c4	fuzz: increase FromHex() coverage	2024-07-31 16:47:38 +01:00
stickies-v	526a87ba6b	test: add uint256::FromHex unittest coverage Simultaneously cover transaction_identifier::FromHex()	2024-07-31 16:47:37 +01:00
dergoegge	afd237bb5d	[fuzz] Harness for version handshake	2024-07-31 13:25:52 +01:00
Greg Sanders	75648cea5a	test: add P2A ProduceSignature coverage	2024-07-30 14:06:58 -04:00
Greg Sanders	455fca86cf	policy: Add OP_1 <0x4e73> as a standard output type These outputs are called anchors, and allow key-less anchor spends which are vsize-minimized versus keyed anchors which require larger outputs when creating and inputs when spending.	2024-07-30 14:06:58 -04:00
MarcoFalke	fa895c7283	mingw: Document mode wbx workaround	2024-07-26 17:31:15 +02:00
Ryan Ofsky	30cef53707	Merge bitcoin/bitcoin#30386 : Early logging improvements `b4dd7ab43e` logging: use std::string_view (Anthony Towns) `558df5c733` logging: Apply formatting to early log messages (Anthony Towns) `6cf9b34440` logging: Limit early logging buffer (Anthony Towns) `0b1960f1b2` logging: Add DisableLogging() (Anthony Towns) `6bbc2dd6c5` logging: Add thread safety annotations (Anthony Towns) Pull request description: In order to cope gracefully with `Log()` calls that are invoked prior to logging being fully configured (indicated by calling `StartLogging()` we buffer early log messages in `m_msgs_before_open`. This has a couple of minor issues: if there are many such log messages the buffer can become arbitrarily large; this can be a problem for users of libkernel that might not wish to worry about logging at all, and as a result never invoke `StartLogging()` * early log messages are formatted before the formatting options are configured, leading to inconsistent output Fix those issues by buffering the log info prior to formatting it, and setting a limit on the size of the buffer (dropping the oldest lines, and reporting the number of lines skipped). Also adds some thread safety annotations, and the ability to invoke `LogInstance().DisableLogging()` if you want to disable logging entirely, for a minor efficiency improvement. ACKs for top commit: maflcko: re-ACK `b4dd7ab43e` 🕴 ryanofsky: Code review ACK `b4dd7ab43e` TheCharlatan: Nice, ACK `b4dd7ab43e` Tree-SHA512: 966660181276939225a9f776de6ee0665e44577d2ee9cc76b06c8937297217482e6e426bdc5772d1ce533a0ba093a8556b6a50857d4c876ad8923e432a200440	2024-07-26 08:06:08 -04:00
Ryan Ofsky	123888dcb8	Merge bitcoin/bitcoin#30447 : fuzz: Deglobalize signature cache in sigcache test `fae0db0360` fuzz: Deglobalize signature cache in sigcache test (TheCharlatan) Pull request description: The body of the fuzz test should ideally be a pure function. If data is persisted in the cache over many iterations, and there is a crash, reproducing it from the input might be difficult. Solve this by getting rid of the global state. This is a follow-up from #30425. ACKs for top commit: dergoegge: utACK `fae0db0360` ryanofsky: Code review ACK `fae0db0360` Tree-SHA512: 93dcbb9f2497f13856970469042d6870f04de10fe206827a8db1aae7fc8f3ac7fd900bee7945b5fe4c9e33883268dabb15be7e7bc91cf353ffc0d118cd60e97d	2024-07-26 07:41:10 -04:00
Pieter Wuille	28549791b3	clusterlin: permit passing in existing linearization to Linearize This implements the LIMO algorithm for linearizing by improving an existing linearization. See https://delvingbitcoin.org/t/limo-combining-the-best-parts-of-linearization-search-and-merging for details.	2024-07-25 10:16:40 -04:00
Pieter Wuille	97d98718b0	clusterlin: add LinearizationChunking class It encapsulates a given linearization in chunked form, permitting arbitrary subsets of transactions to be removed from the linearization. Its purpose is adding the Intersect function, which is a crucial operation that will be used in a further commit to make Linearize improve existing linearizations.	2024-07-25 10:16:40 -04:00
Pieter Wuille	d5918dc3c6	clusterlin: randomize the SearchCandidateFinder search order To make search non-deterministic, change the BFS logic from always picking the first queue item to randomly picking the first or second queue item.	2024-07-25 10:16:40 -04:00
Pieter Wuille	46aad9b099	clusterlin: add Linearize function This adds a first version of the overall linearization interface, which given a DepGraph constructs a good linearization, by incrementally including good candidate sets (found using AncestorCandidateFinder and SearchCandidateFinder).	2024-07-25 10:16:37 -04:00
Pieter Wuille	ee0ddfe4f6	clusterlin: add chunking algorithm A fuzz test is added which verifies various of its expected properties, including correctness	2024-07-25 10:16:37 -04:00
Pieter Wuille	2a41f151af	clusterlin: add SearchCandidateFinder class Similar to AncestorCandidateFinder, this encapsulates the state needed for finding good candidate sets using a search algorithm.	2024-07-25 10:16:37 -04:00
Pieter Wuille	4828079db3	clusterlin: add AncestorCandidateFinder class This is a class that encapsulates precomputed ancestor set feerates, and presents an interface for getting the best remaining ancestor set.	2024-07-25 10:16:37 -04:00
Pieter Wuille	58f7e01db4	tests: framework for testing DepGraph class This introduces a bespoke fuzzing-focused serialization format for DepGraphs, and then tests that this format can represent any graph, roundtrips, and then uses that to test the correctness of DepGraph itself. This forms the basis for future fuzz tests that need to work with interesting graphs.	2024-07-25 10:16:37 -04:00
merge-script	bee23ce9ec	Merge bitcoin/bitcoin#30399 : test: Add arguments for creating a slimmer TestingSetup `f46b220256` fuzz: Use BasicTestingSetup for coins_view target (TheCharlatan) `9e2a723d5d` test: Add arguments for creating a slimmer setup (TheCharlatan) Pull request description: This adds arguments to some of the testing setup constructors for creating an environment without networking and a validation interface. This is useful for improving the performance of the utxo snapshot fuzz test, which constructs a new TestingSetup on each iteration. Using this slimmed down `TestingSetup` in future might also make the tests a bit faster when run in aggregate. ACKs for top commit: maflcko: review ACK `f46b220256` dergoegge: utACK `f46b220256` Tree-SHA512: 9dc62512b127b781fc9e2d8ef2b5a9b06ebb927a8294b6d872001c553984a7eb1f348e0257b32435b34b5505b5d0323f73bdd572a673da272d3e1e8538ab49d6	2024-07-25 13:53:50 +01:00
merge-script	30e8a79aef	Merge bitcoin/bitcoin#30482 : rest: Reject truncated hex txid early in getutxos parsing `fac0c3d4bf` doc: Add release notes for two pull requests (MarcoFalke) `fa7b57e5f5` refactor: Replace ParseHashStr with FromHex (MarcoFalke) `fa90777245` rest: Reject truncated hex txid early in getutxos parsing (MarcoFalke) `fab6ddbee6` refactor: Expose FromHex in transaction_identifier (MarcoFalke) `fad2991ba0` refactor: Implement strict uint256::FromHex() (MarcoFalke) `fa103db2bb` scripted-diff: Rename SetHex to SetHexDeprecated (MarcoFalke) `fafe4b8051` test: refactor: Replace SetHex with uint256 constructor directly (MarcoFalke) Pull request description: In `rest_getutxos` truncated txids such as `aa` or `ff` are accepted. This is brittle at best. Fix it by rejecting any truncated (or overlarge) input. ---- Review note: This also starts a major refactor to rework hex parsing in Bitcoin Core, meaning that a few refactor commits are included as well. They are explained individually in the commit message and the work will be continued in the future. ACKs for top commit: stickies-v: re-ACK `fac0c3d4bf` - only doc and test updates to address review comments, thanks! hodlinator: ACK `fac0c3d4bf` Tree-SHA512: 473feb3fcf6118443435d1dd321006135b0b54689bfbbcb1697bb5811a449bef51f475c715de6911ff3c4ea3bdb75f601861ff93347bc4414d6b9e5298105dd7	2024-07-25 13:49:21 +01:00
MarcoFalke	fa7b57e5f5	refactor: Replace ParseHashStr with FromHex No need to have two functions with different names that achieve the exact same thing.	2024-07-24 17:40:18 +02:00
MarcoFalke	fad2991ba0	refactor: Implement strict uint256::FromHex() This is a safe replacement of the previous SetHex, which now returns an optional to indicate success or failure. The code is similar to the ParseHashStr helper, which will be removed in a later commit.	2024-07-24 17:38:06 +02:00
merge-script	9607277032	Merge bitcoin/bitcoin#30111 : locks: introduce mutex for tx download, flush rejection filters once per tip change `c85accecaf` [refactor] delete EraseTxNoLock, just use EraseTx (glozow) `6ff84069a5` remove obsoleted TxOrphanage::m_mutex (glozow) `61745c7451` lock m_recent_confirmed_transactions using m_tx_download_mutex (glozow) `723ea0f9a5` remove obsoleted hashRecentRejectsChainTip (glozow) `18a4355250` update recent_rejects filters on ActiveTipChange (glozow) `36f170d879` add ValidationInterface::ActiveTipChange (glozow) `3eb1307df0` guard TxRequest and rejection caches with new mutex (glozow) Pull request description: See #27463 for full project tracking. This contains the first few commits of #30110, which require some thinking about thread safety in review. - Introduce a new `m_tx_download_mutex` which guards the transaction download data structures including `m_txrequest`, the rolling bloom filters, and `m_orphanage`. Later this should become the mutex guarding `TxDownloadManager`. - `m_txrequest` doesn't need to be guarded using `cs_main` anymore - `m_recent_confirmed_transactions` doesn't need its own lock anymore - `m_orphanage` doesn't need its own lock anymore - Adds a new `ValidationInterface` event, `ActiveTipChanged`, which is a synchronous callback whenever the tip of the active chainstate changes. - Flush `m_recent_rejects` and `m_recent_rejects_reconsiderable` on `ActiveTipChanged` just once instead of checking the tip every time `AlreadyHaveTx` is called. This should speed up calls to that function (no longer comparing a block hash each time) and removes the need to lock `cs_main` every time it is called. Motivation: - These data structures need synchronization. While we are holding `m_tx_download_mutex`, these should hold: - a tx hash in `m_txrequest` is not also in `m_orphanage` - a tx hash in `m_txrequest` is not also in `m_recent_rejects` or `m_recent_confirmed_transactions` - In the future, orphan resolution tracking should also be synchronized. If a tx has an entry in the orphan resolution tracker, it is also in `m_orphanage`, and not in `m_txrequest`, etc. - Currently, `cs_main` is used to e.g. sync accesses to `m_txrequest`. We should not broaden the scope of things it locks. - Currently, we need to know the current chainstate every time we call `AlreadyHaveTx` so we can decide whether we should update it. Every call compares the current tip hash with `hashRecentRejectsChainTip`. It is more efficient to have a validation interface callback that updates the rejection filters whenever the chain tip changes. ACKs for top commit: instagibbs: reACK `c85accecaf` dergoegge: Code review ACK `c85accecaf` theStack: Light code-review ACK `c85accecaf` hebasto: ACK `c85accecaf`, I have reviewed the code and it looks OK. Tree-SHA512: c3bd524b5de1cafc9a10770dadb484cc479d6d4c687d80dd0f176d339fd95f73b85cb44cb3b6b464d38a52e20feda00aa2a1da5a73339e31831687e4bd0aa0c5	2024-07-24 09:30:28 +01:00
MarcoFalke	fa103db2bb	scripted-diff: Rename SetHex to SetHexDeprecated SetHex is fragile, because it accepts any non-hex input or any length of input, without error feedback. This can lead to issues when the input is truncated or otherwise corrupted. Document the problem by renaming the method. In the future, the fragile method should be removed from the public interface. -BEGIN VERIFY SCRIPT- sed -i 's/SetHex/SetHexDeprecated/g' $( git grep -l SetHex ./src ) -END VERIFY SCRIPT-	2024-07-24 09:15:34 +02:00
MarcoFalke	fafe4b8051	test: refactor: Replace SetHex with uint256 constructor directly This avoids a hex-decoding and makes the next commit smaller.	2024-07-24 09:14:57 +02:00
Ryan Ofsky	7cc00bfc86	Merge bitcoin/bitcoin#30436 : fix: Make TxidFromString() respect string_view length `09ce3501fa` fix: Make TxidFromString() respect string_view length (Hodlinator) `01e314ce0a` refactor: Change base_blob::SetHex() to take std::string_view (Hodlinator) `2f5577dc2e` test: uint256 - Garbage suffixes and zero padding (Hodlinator) `f11f816800` refactor: Make uint256_tests no longer use deprecated BOOST_CHECK() (Hodlinator) `f0eeee2dc1` test: Add test for TxidFromString() behavior (Ryan Ofsky) Pull request description: ### Problem Prior to this, `TxidFromString()` was passing `string_view::data()` into `uint256S()` which meant it would only receive the a naked `char` pointer and potentially scan past the `string_view::length()` until it found a null terminator (or some other non-hex character). Appears to have been a fully dormant bug as callers were either passing a string literal or `std::string` directly to `TxidFromFromString()`, meaning a null terminator always existed at `pointer[length()]`. Bug existed since original merge of `TxidFromString()`. ### Solution Make `uint256S()` (and `base_blob::SetHex()`) take and operate on `std::string_view` instead of `const char` and have `TxidFromString()` pass that in. (PR was prompted by comment in https://github.com/bitcoin/bitcoin/pull/30377#issuecomment-2208857200 (referring to https://github.com/bitcoin/bitcoin/pull/28922#discussion_r1404437378)). ACKs for top commit: maflcko: re-ACK `09ce3501fa` 🕓 paplorinc: ACK `09ce3501fa` ryanofsky: Code review ACK `09ce3501fa`. I think the current code changes are about as small as you could make to fix the bug without introducing a string copy, and the surrounding test improvements are all very nice and welcome. Tree-SHA512: c2c10551785fb6688d1e2492ba42a8eee4c19abbe8461bb0774d56a70c23cd6b0718d2641632890bee880c06202dee148126447dd2264eaed4f5fee7e1bcb581	2024-07-23 14:19:27 -04:00
Hodlinator	09ce3501fa	fix: Make TxidFromString() respect string_view length Prior to this, passing string_view::data() into uint256S() meant the latter would only receive the a naked char* pointer and potentially scan past the string_view::length() until it found a null terminator (or some other non-hex character). Appears to have been a fully dormant bug as callers were either passing a string literal or std::string directly to TxidFromFromString(), meaning null terminator always existed at pointer[length()]. Bug existed since original merge of TxidFromString(), discussed in https://github.com/bitcoin/bitcoin/pull/28922#discussion_r1404437378.	2024-07-23 14:51:39 +02:00
Hodlinator	01e314ce0a	refactor: Change base_blob::SetHex() to take std::string_view Clarify that hex strings are parsed as little-endian.	2024-07-23 14:51:36 +02:00
Hodlinator	2f5577dc2e	test: uint256 - Garbage suffixes and zero padding	2024-07-23 14:44:30 +02:00

... 3 4 5 6 7 ...

5446 commits