#bitcoin-core-dev on 2021-05-04 — searchable irc log

04:51 < bitcoin-git> [bitcoin] MarcoFalke pushed 4 commits to master: https://github.com/bitcoin/bitcoin/compare/bf5e6a7771b3...e2d4e67a8fcc

04:51 < bitcoin-git> bitcoin/master fa2197c MarcoFalke: test: Use loop to register RPCs

04:51 < bitcoin-git> bitcoin/master 000098f MarcoFalke: test: Use throwing variant accessor

04:51 < bitcoin-git> bitcoin/master fa8a888 MarcoFalke: bench: Remove duplicate constants

04:51 < bitcoin-git> [bitcoin] MarcoFalke merged pull request #21840: test: Misc refactor to get rid of &foo[0] raw pointers (master...2105-testRefactor) https://github.com/bitcoin/bitcoin/pull/21840

07:16 < bitcoin-git> [bitcoin] MarcoFalke opened pull request #21848: refactor: Make CFeeRate constructor arch-independent (master...2105-feerate) https://github.com/bitcoin/bitcoin/pull/21848

07:27 < bitcoin-git> [bitcoin] MarcoFalke opened pull request #21849: fuzz: Limit toxic test globals to their respective scope (master...2105-fuzzToxic) https://github.com/bitcoin/bitcoin/pull/21849

07:45 < bitcoin-git> [bitcoin] laanwj pushed 4 commits to master: https://github.com/bitcoin/bitcoin/compare/e2d4e67a8fcc...ab9a566ab333

07:45 < bitcoin-git> bitcoin/master ea269c7 Jon Atack: contrib: parse I2P addresses in generate-seeds.py

07:45 < bitcoin-git> bitcoin/master e01f173 Jon Atack: contrib: add a few I2P seed nodes

07:45 < bitcoin-git> bitcoin/master 142e2da Jon Atack: net: add I2P seeds to chainparamsseeds

07:45 < bitcoin-git> [bitcoin] laanwj merged pull request #21825: net: add I2P hardcoded seeds (master...i2p-hardcoded-seeds) https://github.com/bitcoin/bitcoin/pull/21825

07:55 < wumpus> ryanofsky: done

08:04 < gmaxwell> Antpool taproot block, thats over 50% hashpower now.

08:06 < aj> \o/

08:11 < hugohn> hi, I'm curious what else needs to be done before https://github.com/bitcoin/bips/pull/1097 can get a BIP number assigned?

08:11 < hugohn> I saw some chatter earlier about folks wanting to change the BIP process, but until that becomes more clear I assume the process stays the same.

08:11 < hugohn> (also yay on Taproot signaling progress)

08:14 < wumpus> hugohn: nothing, it should just get a BIP number assigned

08:16 < wumpus> ping @luke-jr ^

08:16 < aj> hugohn: https://lists.linuxfoundation.org/pipermail/bitcoin-dev/2021-April/018868.html suggests assigning a name in the meantime, i guess like hugohn-multisig-setup; otherwise what wumpus said

08:19 < hugohn> @wumpus @aj thanks !

08:19 < wumpus> yes, moving to names might make sense too, in any case please don't let bitcoin innovation be stuck on having to centrally assign numbers, this shouldn't be an issue

08:20 < hugohn> I've already named the branch "bip-hugonguyen-bsms". Does it need to be included in the spec itself?

08:25 < hugohn> on a side note: I think using BIP numbers or acronyms like BSMS / PSBT as unique identifiers for a spec is still more ideal. Using personal names as identifiers for a standard seems... weird :)

08:26 < hugohn> not to mention the long ID is a mouthful / harder to share

08:27 < hugohn> although the idea of of a more decentralized process does sound good

08:29 < hugohn> maybe the process could be decoupled, but the ID aspect could stay centralized (e.g. there could be a BIP on BIP number generation). just thinking out loud.

08:29 < wumpus> an idea we had early was to simply use the PR number as BIP number

08:29 < wumpus> some projects (rust, I think?) use this approach

08:30 < wumpus> but also there was some effort to make BIP numbers meaningful, certain ranges map to certain things, it is not simply a sequence generator

08:32 < wumpus> agree that having an author name in it is weird, on the other hand, namespacing is hard, many projects eventually settle on organization/pseudonym based namespacing to prevent name squatting etc

08:32 < hugohn> right. if it's not a simple sequence generator, a BIP on BIPs probably makes more sense.

08:33 < wumpus> this is mustly a problem if it would be managed by as scriptbot

08:33 < wumpus> you mean like BIP1 and 2?

08:34 < hugohn> yes

08:34 < wumpus> in any case, this discussion is kind of off-topic here, the bitcoin core project is one implementation, that the current BIPs repository is in the same orginazation is a historical artifact, it does not mean BIPs 'belong' to bitcoin core or something like that

08:39 < michaelfolkson> I think there will be future discussions on a revised BIP process once (hopefully) Taproot activation is completed hugohn if you'd like to engage with that https://github.com/bitcoin/bips/pull/1015

08:39 < michaelfolkson> A few things I think are probably in need of a revamp (or at least a discussion)

08:40 < hugohn> thanks @michaelfolkson . I'd take a look.

08:40 < wumpus> ideally it should be a fast and low-friction process

08:44 < michaelfolkson> Right, there are a few edge cases that need ironing out too eg what "Rejected" means and how you get out of "Rejected" status (if indeed you ever do)

08:51 < michaelfolkson> I'm assuming GUI translation issues should be in the GUI repo and not the main repo? https://github.com/bitcoin/bitcoin/issues/21847

08:52 < wumpus> correct, that is what the GUI repo is for

08:53 < wumpus> i think we should add an entry to the "https://github.com/bitcoin/bitcoin/issues/new/choose" list for GUI issues, redirecting people to the appropriate repository

08:54 < hugohn> @wumpus I understand. I think we should continue with this "historical artifact", as unfortunate as it is, until there's a better way.

08:55 < hugohn> As the Bitcoin protocol slowly ossifies, the bulk of new specs will likely be in the Application layer. It does make less sense to bundle them with the Core project.

08:56 < michaelfolkson> hugohn: They aren't really bundled with the Core project. It is true the BIPs repo is under the Bitcoin Core GitHub org but no Bitcoin Core maintainers merge BIP PRs afaik so I think it is fine under the Core org

08:57 < michaelfolkson> hugohn: I personally think it would be overkill to give the BIPs repo its own org

08:59 < michaelfolkson> wumpus: There already is "An issue or feature request related to the GUI" option when you open an issue. Is that what you mean?

08:59 < michaelfolkson> wumpus: You click on that option and you get "Any report, issue or feature request related to the GUI should be reported at

08:59 < michaelfolkson> https://github.com/bitcoin-core/gui/issues/"

08:59 < wumpus> michaelfolkson: yes, except it should send you to the GUI repo

08:59 < wumpus> oh okay, that's a bit weird but i guess it works

08:59 < michaelfolkson> wumpus: Ah ok I see what you mean

09:01 < michaelfolkson> I agree that would be optimal but not sure if you can be redirected to a different repo when you click a new issue option

09:01 < wumpus> i guess the problem is that "gui" will have the same options, by definition, because it has the same underlying repository

09:01 < michaelfolkson> Right

09:02 < wumpus> i thought it could be like the "view policy" link for security reports, but apparently that one is a special edge cases by github

11:19 < bitcoin-git> [bitcoin] fanquake pushed 4 commits to master: https://github.com/bitcoin/bitcoin/compare/ab9a566ab333...0ca8b7e7ecd5

11:19 < bitcoin-git> bitcoin/master faeabef MarcoFalke: ci: Enable D_GLIBCXX_DEBUG for multiprocess task

11:19 < bitcoin-git> bitcoin/master fad0f21 MarcoFalke: ci: Use clang in multiprocess task to avoid OOM

11:19 < bitcoin-git> bitcoin/master fa44f51 MarcoFalke: ci: Clarify that previous_releases task is using DEBUG

11:19 < bitcoin-git> [bitcoin] fanquake merged pull request #21812: ci: Enable D_GLIBCXX_DEBUG for multiprocess task (master...2104-ciDEBUG) https://github.com/bitcoin/bitcoin/pull/21812

11:42 < jonatack> MarcoFalke: question regarding fuzzed_data_provider.ConsumeBool(), which exists but isn't used yet...is it ok to add a line to an enum to use it? e.g. adding "kMaxValue = NET_MAX," at the end of netaddress.h::enum Network

11:43 < fanquake> I see 169 uses of fuzzed_data_provider.ConsumeBool() ?

11:44 < jonatack> er, ConsumeEnum, mistyped...or do we prefer to avoid changing enums to use it

11:55 < jonatack> will propose using ConsumeEnum() and see what reviewers say

12:06 < bitcoin-git> [bitcoin] kiminuo opened pull request #21850: Remove `GetDataDir(net_specific)` function (master...feature/2021-05-get-data-dir-step-2) https://github.com/bitcoin/bitcoin/pull/21850

12:45 < bitcoin-git> [bitcoin] fanquake opened pull request #21851: [WIP] build: support cross-compiling for arm64-apple-darwin20 (Apple M1) in depends (master...m1_support_depends) https://github.com/bitcoin/bitcoin/pull/21851

13:07 < bitcoin-git> [bitcoin] MarcoFalke opened pull request #21852: ci: Add msan fuzz config (master...2105-ci12Msan) https://github.com/bitcoin/bitcoin/pull/21852

13:08 < bitcoin-git> [bitcoin] fanquake closed pull request #21414: doc: add arm macOS depends platform triplet (master...macOS-platform-triplets) https://github.com/bitcoin/bitcoin/pull/21414

15:00 < vasild> sdaftuar: https://github.com/bitcoin/bitcoin/pull/20685#discussion_r625856311 -- maybe the i2p router was restarted or the connection to it was interrupted in some way and the i2p thread did RemoveLocal() (here: https://github.com/bitcoin/bitcoin/blob/0ca8b7e7ecd5bc537fbc1e372f6755a34a136f7f/src/net.cpp#L2232-L2234) which undid the initial AddLocal(MANUAL) due to externalip

15:01 < vasild> later when the connection was reestablished the i2p thread does AddLocal(BIND) which is not "strong" enough to overcome fDiscover==false

15:02 < vasild> sounds plausible?

15:06 < bitcoin-git> [bitcoin] MarcoFalke pushed 4 commits to master: https://github.com/bitcoin/bitcoin/compare/0ca8b7e7ecd5...a1c6434e19bc

15:06 < bitcoin-git> bitcoin/master fab3017 MarcoFalke: ci: Set BASE_SCRATCH_DIR early, so that it can be used in test configs

15:06 < bitcoin-git> bitcoin/master fa399a7 MarcoFalke: ci: Use clang-12 in msan task

15:06 < bitcoin-git> bitcoin/master fa0422c MarcoFalke: ci: Add msan fuzz config

15:06 < bitcoin-git> [bitcoin] MarcoFalke merged pull request #21852: ci: Add msan fuzz config (master...2105-ci12Msan) https://github.com/bitcoin/bitcoin/pull/21852

15:07 < jonatack> update on FuzzedDataProvider::ConsumeEnum(), proposed to allow passing it a std::optional<T> max_value, so ConsumeEnum() may be used without needing to change existing enums.

15:08 < vasild> does it assume contigious values starting from 0?

15:08 < jonatack> yes, same as before

15:09 < jonatack> only avoids having to add an alias to the enum in the codebase

15:09 < vasild> yeah, can't be otherwise :/

15:11 < jonatack> otherwise, there's the existing ConsumeWeakEnum(values list...

15:23 < bitcoin-git> [bitcoin] MarcoFalke pushed 2 commits to master: https://github.com/bitcoin/bitcoin/compare/a1c6434e19bc...3f8f238deb5a

15:23 < bitcoin-git> bitcoin/master cf83b82 MarcoFalke: fuzz: Limit toxic test globals to their respective scope

15:23 < bitcoin-git> bitcoin/master 3f8f238 MarcoFalke: Merge bitcoin/bitcoin#21849: fuzz: Limit toxic test globals to their respe...

15:23 < bitcoin-git> [bitcoin] MarcoFalke merged pull request #21849: fuzz: Limit toxic test globals to their respective scope (master...2105-fuzzToxic) https://github.com/bitcoin/bitcoin/pull/21849

15:50 < luke-jr> hugohn: I think what it is missing, is something like "This BIP is not compatible with existing multisig software/hardware at all."

15:50 < luke-jr> (unless it is)

15:59 < bitcoin-git> [bitcoin] Relaxo143 opened pull request #21854: doc: make small corrections (v0.21.1 release notes) (master...patch-1) https://github.com/bitcoin/bitcoin/pull/21854

17:08 < michaelfolkson> fanquake: Someone is building Core on Apple M1 here https://bitcoin.stackexchange.com/questions/105126/could-not-detect-boost-libraries-when-configuring-bitcoin-core-apple-m1-mac-mi

17:10 < bitcoin-git> [bitcoin] jonatack opened pull request #21855: fuzz: enable passing a max value to FuzzedDataProvider::ConsumeEnum() (master...ConsumeEnum-enable-passing-max-value) https://github.com/bitcoin/bitcoin/pull/21855

17:42 < hugohn> @luke-jr the inclusion of things like NO_ENCRYPTION mode and BIP39-like PBKDF2 function parameters in the spec is to help with backward compatibility. IMO the main thing that might not be backward compatible with all vendors is the stateful nature of the Signers, which I have stated in the Compatibility section.

17:52 < bitcoin-git> [bitcoin] adamjonas opened pull request #21856: doc: add OSS-Fuzz section to fuzzing.md doc (master...add-oss-fuzz) https://github.com/bitcoin/bitcoin/pull/21856

18:21 < robert_spigler> luke-jr: I agree with hugohn's analysis

19:07 < bitcoin-git> [bitcoin] Rqcker opened pull request #21857: Pull request (master...master) https://github.com/bitcoin/bitcoin/pull/21857

19:07 < bitcoin-git> [bitcoin] Rqcker closed pull request #21857: Pull request (master...master) https://github.com/bitcoin/bitcoin/pull/21857

19:58 < luke-jr> robert_spigler: I'm not saying it should or shouldn't be a certain answer - but it needs to be in the BIP, whatever it is :p

20:52 < jnewbery> Hi folks. We have a p2p meeting scheduled in 10 minutes. Currently there aren't any proposed topics at https://github.com/bitcoin-core/bitcoin-devwiki/wiki/P2P-IRC-meetings.

20:55 < gmaxwell> Current bitcoin core causes really old nodes to be unable to sync and to dos attack you. The issue is that prior to 0.7-ish the size of headers messages wasn't limited, and the node requests headers from everyone. They blast you with a multimegabyte header and disconnect and go do it to someoen else. These old nodes seem to have also formed a fork at an early height, nodes that have this

20:55 < gmaxwell> fork will send it to you and get rejected for sending a fork before the highest checkpoing-- another product of requesting headers.

20:56 < gmaxwell> Fixing this appears to be trivial: Just don't ever request headers from peers that aren't NODE_WITNESS-- the node will never request blocks from them anyways. But I changed this and it breaks like a dozen tests, some of which didn't seem trivial to fix.

20:56 < gmaxwell> Someone who cares about p2p might want to fix this.

20:58 < gmaxwell> (until I stopped sending headers reqests to non-node witness peers, this garbage was few percent of my (pruned) node's bandwidth usage)

20:59 < gmaxwell> (my correct behavior might be slowly fixing the second forked-chain problem since I should be healing the partition)

21:00 < jnewbery> #startmeeting

21:00 < core-meetingbot> Meeting started Tue May 4 21:00:29 2021 UTC. The chair is jnewbery. Information about MeetBot at https://bitcoin.jonasschnelli.ch/ircmeetings.

21:00 < core-meetingbot> Available commands: action commands idea info link nick

21:00 < amiti> hi

21:00 < jnewbery> #bitcoin-core-dev Meeting: achow101 aj amiti ariard bluematt cfields Chris_Stewart_5 digi_james dongcarl elichai2 emilengler fanquake fjahr gleb glozow gmaxwell gwillen hebasto instagibbs jamesob jb55 jeremyrubin jl2012 jnewbery jonasschnelli jonatack jtimon kallewoof kanzure kvaciral lightlike luke-jr maaku marcofalke meshcollider michagogo moneyball morcos nehan NicolasDorier paveljanik

21:00 < gleb> hi

21:00 < glozow> hi

21:00 < jnewbery> petertodd phantomcircuit promag provoostenator ryanofsky sdaftuar sipa vasild wumpus

21:00 < sipa> hi

21:00 < ajonas> hi

21:01 < jnewbery> There aren't any proposed topics in https://github.com/bitcoin-core/bitcoin-devwiki/wiki/P2P-IRC-meetings

21:01 < jnewbery> Feel free to share your priorities in https://github.com/bitcoin-core/bitcoin-devwiki/wiki/P2P-Current-Priorities

21:01 < jnewbery> Does anyone have any topics to add?

21:02 < sipa> i'm working on PR'ing minisketch

21:02 < sipa> i think it's in a good enough state now

21:02 < jnewbery> \o/

21:02 < gleb> sipa: thank you! I'm working on more or less finalizing erlay to invite people running nodes. Fully went through antoine's review, gonna make one protocol tweak it seems.

21:03 < gleb> *to invite people to run nodes.

21:03 < sipa> gleb: cool, will run

21:03 < glozow> \o/

21:03 < jnewbery> sipa: are you looking for any specific help?

21:04 < sipa> not in particular, but review is of course always welcome

21:04 < sipa> with this PR https://github.com/sipa/minisketch/pull/20 (making us able to just build field size 32 instead of everything) the binary size goes from 3 MB to 64 KiB

21:04 < sipa> also way faster compilation

21:05 < jnewbery> that seems good

21:06 < jnewbery> ok, any other topics?

21:07 < jnewbery> alright!

21:07 < jnewbery> #endmeeting

21:07 < core-meetingbot> topic: Bitcoin Core development discussion and commit log | Feel free to watch, but please take commentary and usage questions to #bitcoin | Channel logs: http://www.erisian.com.au/bitcoin-core-dev/, http://gnusha.org/bitcoin-core-dev/ | Meeting topics http://gnusha.org/bitcoin-core-dev/proposedmeetingtopics.txt / http://gnusha.org/bitcoin-core-dev/proposedwalletmeetingtopics.txt

21:07 < core-meetingbot> Meeting ended Tue May 4 21:07:03 2021 UTC.

21:07 < core-meetingbot> Minutes: https://bitcoin.jonasschnelli.ch/ircmeetings/logs/bitcoin-core-dev/2021/bitcoin-core-dev.2021-05-04-21.00.moin.txt

21:09 < jnewbery> gmaxwell: I'm surprised there are a dozen tests with nodes that aren't signalling NODE_WITNESS. Do you have a branch?

21:10 < jnewbery> as far as I'm aware, the only functional test with a node that doesn't signal NODE_WITNESS is p2p_segwit.py (and that should be removed in #21090

21:10 < gribble> https://github.com/bitcoin/bitcoin/issues/21090 | Default to NODE_WITNESS in nLocalServices by dhruv · Pull Request #21090 · bitcoin/bitcoin · GitHub

21:14 < gmaxwell> Well dozen is probably an exaggeration, it was more than p2p_segwit. This might implement the expected functionality: https://0bin.net/paste/1ugjq4jz#XcT5A29tp0YlpDnYDs9NRD92Z9myByRpsJOMzwButl4

21:15 < gmaxwell> (I say might because the patch I run locally is much larger, but it looks like I previously saved out that chunk while trying to run the tests.)

21:17 < gmaxwell> gleb: what protocol change?

21:17 < jnewbery> Did you try gating on fHaveWitness instead of fClient?

21:20 < gmaxwell> No, though we shouldn't be sending headers requests to non-node-network peers either--. But ah, I see your point wrt implicated tests.

21:21 < gleb> gmaxwell: antoine found this inconsistency, let me explain here in short (full convo is here: https://github.com/bitcoin/bitcoin/pull/21515#discussion_r624952992)

21:21 < sipa> gleb: was it needed to include minisketch in bitcoin-tx?

21:22 < gmaxwell> (my sequence was that I just fixed it for myself to see that it fixed the behavior... then just tried running the tests to see if it looked like it would be easy to submit upstream, it failed a number of tests, so I just dropped it into the patch I carry.

21:23 < gleb> gmaxwell: Ah hold on, maybe ignore that.

21:23 < gleb> I just realized maybe it's fine all the way.

21:24 < gleb> sipa: sorry, are you referring to the way I integrate minisketch in the erlay PR?

21:24 < sipa> gleb: yeah, cherry picking that

21:24 < sipa> just wonder if there was a reason for that

21:25 < gleb> gmaxwell: yeah indeed, I withdraw my comment, no protocol tweaks. Will finish last touches per antoine feedback tomorrow, and freeze the version to run nodes.

21:26 < gmaxwell> gleb: good because I was about to respond to you hear after reading that saying I didn't undrestand the issue there. :P

21:26 < gleb> gmaxwell: I'm glad you're following the work anyway.

21:27 < gmaxwell> gleb: I think in general when we talk about erlay (e.g. in the paper and elsewhere) we use this "non-reachable peer" terminology, but non-reachability doesn't exist explicitly anywhere in the implementation of bitcoin core or the bitcoin protocol... so although non-reachable peers were a good concept for modling, it can be a little confusing when it comes to the implementation.

21:29 < gleb> gmaxwell: Agree, I should think if I could gracefully disconnect the "general intuition" from protocol language.

21:29 < sipa> right, the protocol should only concern itself with "inbound" and "outbound" and "recon sender" and "recon receiver"

21:30 < gleb> Also, I should think about non-reachable nodes with a node from the same NAT or something, I just realized our current approach is prone in this case?

21:31 < gleb> Say my "non-reachable" node A (as the peers can easily tell by probing IP) has my own trusted inbound node B and that's the only connection for B, then the peers can easily tell that transactions flooded from B are either from B or from A.

21:32 < gleb> I should admit I never considered this corner case.

21:33 < sipa> hmm, do we just want to disable reconciliation on connections to/from non-public IPs?

21:33 < gmaxwell> that shouldn't be neceesary.

21:34 < sipa> i think the problem is: you have internal node A, only connected to bridge node B, which is connected to the public network

21:35 < sipa> if A relay a tx to B, B will treat this as being a reachable node, and use erlay to relay it further... which it would never do if B were just a non-reachable node without internal nodes behind it?

21:35 < gleb> sipa: right. Or maybe if there are N bridges, but A is still generally unavailable.

21:36 < sipa> sorry, i swapped notation; my A and B are the B and A in your explanation

21:37 < gmaxwell> I don't want to comment much because I've probably forgotten everything but this seems confused to me. The privacy loss comes from flooding conditionally flooding, using the reconcillation itself is just additive and harmless.

21:37 < sipa> gleb: is the logic "only use recon for transaction received from inbound connections" now?

21:37 < gleb> sipa: only use flood in that case.

21:37 < gleb> Ok, there are 2 relevant flags.

21:37 < sipa> gmaxwell: the choice whether to relay through recon or flood reveals information

21:38 < gmaxwell> sipa: It should always relay through recon (otherwise it's potentially just adding missed items to the sets), it might choose to additionally relay through flooding.

21:38 < sipa> gmaxwell: that's not what erlay proposes

21:38 < gmaxwell> I don't remember it being broken. :P

21:39 < gleb> sipa is correct, it was always either or

21:39 < sipa> come on

21:39 < gmaxwell> sipa: failing to add it to recon strictly wastes bandwidth, unless I'm confused.

21:39 < gmaxwell> Quite possible I'm confused.

21:39 < gmaxwell> If you don't add it to recon but your remote peer does, you just end up with a fake difference.

21:40 < gmaxwell> As your remote peer won't make the same flood/no-flood decision you do.

21:40 < gmaxwell> Am I confused?

21:40 < sipa> i'm trying to swap in the whole idea again :)

21:40 < gleb> Yeah same here, gimme a second.

21:41 < sipa> gleb: what happens with locally generated transactions?

21:41 < gmaxwell> I don't think this changes the fundimental issue gleb is looking at though.

21:41 < gmaxwell> But I'm trying to remember how any of this works so every piece of confusion is standing out for me.

21:41 < gleb> sipa: The idea was to always reconcile local transactions. It's done to preserve privacy of non-reachable, but you won't understand why unless you know the full protocol...

21:42 < sipa> gleb: so in which cases do you only flood a tx?

21:43 < gleb> sipa: if the transaction is received from inbound AND the peer is 1 of 8 outbound flooding peers.

21:44 < gleb> The idea is: non reachable has no inbounds, so they never flood (so local txs should also not be flooded)

21:44 < sipa> which for now is effectively all outbound peers?

21:45 < gleb> sipa: yes, now. ANd my confusion was the case after bump, but now I think that case is fine.

21:45 < gleb> The bridge case is not fine though. When this non-reachable is a bridge.

21:45 < gmaxwell> If a transaction that comes in from an inbound peers is only flooded out and not reconciled, how will those txn ever end up in reconcoillation? :P

21:45 < sipa> i believe the solution is just changing the criterion to "received from inbound *with public IP* AND ..."

21:46 < gmaxwell> ehhh

21:46 < sipa> but i still want to understand this again

21:46 < sipa> because i don't remember why this flooding is done in the first place

21:47 < gmaxwell> I still don't believe that it floods instead of reconcilling rather than in addition to, I think that is either transparently wrong or I'm badly corrupted. :)

21:47 < sipa> gleb: you say "received", is that both when received through flooding and through recon?

21:48 < gleb> sipa: currently there is no distinction, since we added an extra round for INV (just non-duplicate INV). It's possible to compare by short id, but currently I don't.

21:48 < gleb> It all looks like inbound inv.

21:48 < sipa> ok, just trying to reconcile my memory

21:49 < gleb> gmaxwell: I have to think why I didn't do additive

21:49 < lightlike> isn't recon/flooding a property of the connection instead of the transaction? So that a tx received from an inbound would be flooded to outbounds, but reconciled with other inbound peers?

21:49 < sipa> lightlike: no

21:49 < sipa> (well, both)

21:49 < gmaxwell> My stored approximation of early is that you reconcile with peers, and also flood each transaction you recieve, regardless of how you recieve them, with low fanout to outbound peers (obeying the normal rules about not sending a peer a txn you already know they have). I know this isn't exact in the details.

21:49 < gleb> yeah, both.

21:50 < sipa> so the decision to flood and not reconcile should correspond to a prediction that you will have that tx while your peer won't

21:50 < sipa> how does "received from an inbound connection" lead to that prediction?

21:51 < gleb> gmaxwell: The problem with that is that non-reachable nodes will do too much useless flooding. They learn a transaction from 1 reconciliation, and then they flood to all the rest outbound peers?

21:51 < gmaxwell> gleb: If nodeA and nodeB both recieve tx X, and nodeA decides to recon it, and nodeB decides to flood it, then the recon wastes bandwidth. The process I understood you describing above wouldn't have peers making the same recon decision for the same txn...

21:52 < gleb> gmaxwell: ok, my approach works, and let me explain.

21:52 < gmaxwell> gleb: that is why the flooded relay is low fanout.

21:52 < gleb> Node A doesn't add tx to the set_B because it flooded the tx to B. Node B doesn't add it to the set_A because it was received from A.

21:52 < gleb> That's why both sets won't have the tx at the end

21:53 < gmaxwell> gleb: But node_b also could have recieved it from someone else first.

21:54 < sipa> yes, it's clear to me you don't want to both flood and add to recon set, because the peer won't relay the tx back from to, so adding it to the set would hurt reconciliation

21:54 < gmaxwell> sipa: Why wouldn't they add the flooded txn to it, if was new to them? It's free if both sides have it.

21:54 < sipa> the question is why is there an assumption in this received-from-inbound-and-relay-to-outbound case that there won't be a relay in the other direction (as in: why can't the peer have learned the tx from elsewhere)

21:55 < sipa> gmaxwell: any tx you actually receive announced through flooding you can delete from that peer's recon set (not sure if the current implementation does that), but if it does, that seems strictly better than adding

21:55 < sipa> (in case you flood)

21:56 < gmaxwell> Consider, nodeb gets a txn from c, nodea also gets a txn from c. Nodea elects to flood it to a, nodeb elects to recon it to a. Now there is an excess set difference. If instead nodea flooded it and added it to recon, and B did as well (including for flooded txn recieved from a) the only time there would be a retcon difference is during a race with the flooding.

21:56 < sipa> delete if it was in that set in the first place, of course

21:56 < gmaxwell> okay I agree that would also work.

21:57 < gleb> Yeah, I see the point indeed. We should get rid of this extra assumption

21:57 < gmaxwell> I think they're ... the same? in both cases you get a extra item in recon during a race between a flood and a recon.

21:57 < sipa> gleb: do you do this currently in the code? if i INV a tx to you for whatever reason, and that tx was in your recon set with me, do you delete it from that set?

21:58 < gleb> sipa: now, but I'm realizing now I should.

21:58 < sipa> gmaxwell: yeah, you either want to both add it, or both delete it when flooded- deleting seems slightly more efficient

21:58 < gmaxwell> In any case, the model where everything is recon and some flooding occurs at low order doesn't have the issue that outbound only nodes behave differently, other than they just happened to not recieve any txn via flooding so they'll get txn later. I'm not arguing that it should work that way, just trying to remember the motivation for it not working that way.

21:59 < gmaxwell> sipa: Agreed. Though I'm not clear why deleting is slightly more effcicient? just that never adding it on one side takes less computation?

21:59 < sipa> gmaxwell: yes, it ~ns scal different

22:00 < sipa> not adding on both sides

22:00 < gleb> gmaxwell: well, I chose to reconcile in a queue-manner every 2 seconds (16 seconds per peer). And the queue consists of outbounds.

22:01 < gleb> If we stick to these rules, we also want to flood some to make relay across the "backbone" faster.

22:01 < gleb> gmaxwell: how would you suggest to flood?

22:02 < gleb> My suggestion is to flood to fixed 8 outbounds, and only what was received inbound. In that case, it just gets relayed though the network of reachable nodes super fast, without non-reachable doing useless stuff ever.

22:03 < gmaxwell> sipa: GOOD. I am glad to agree!

22:03 < gmaxwell> gleb: okay, I don't understand how I was ever okay with that. lol

22:04 < gmaxwell> The issue is that it imposes a strong assumption on the existance of a "non-reachable node" -- which as you are realizing now (and maybe did before?) -- no such clean distinction really exists.

22:04 < gmaxwell> (e.g. the NATed node with some lan peers)

22:05 < gleb> gmaxwell: not really, I never have "if non-reachable". It just makes this implicit categorization by looking at "txs received inbound"

22:05 < gleb> Which is, yeah, not true for NAT stuff.

22:07 < gmaxwell> I don't have any of our old discussions-- lost the server :( -- it might be useful to see how we changed from "flood each new txn to a few outbound peers" to "flood to 8 outbound peers (currently all) but only things learned from inbound peers"?

22:08 < sipa> gmaxwell: by new do you mean "locally created", or "any new tx learned through whatever means at all" ?

22:08 < gleb> gmaxwell: despite us not liking, non-reachable nodes is the majority of the network. If they flood, it's very likely to be useless, because their public peers likely already know the tx.

22:08 < sipa> (i assume the second, as the first is a pretty bad privacy leak)

22:08 < gmaxwell> newly learned, however we learned it.. flooding, local, retcon.

22:08 < gmaxwell> Anything accept to mempool accepts.

22:09 < gmaxwell> gleb: yes, its often useless. But it is still a constant amount of data per accepted transaction.

22:09 < gmaxwell> (and not an amount that grows with the number of peers it has)

22:10 < gmaxwell> if fanout > 1 the vast majority of flooding will always be useless.

22:11 < gleb> A reconciliation happens every 16 seconds, flood trigger is every 1/2 seconds or something. If flood to only outbound, a non-reachable nodes will flood almost all transactions to their reachable peers. So recon set will be empty?

22:11 < gleb> You suggest to try fanout=2/3? If fanout=8, we end up getting 0 gain today (just better scaling)

22:11 < sipa> the fact that flooding is only outbound does imply that more well-reachable nodes will learn about transactions faster than less-reachable nodes

22:11 < gmaxwell> Right, at least at one point I know I was thinking in terms of fanout=2.

22:13 < gmaxwell> perhaps we should reconsider that asymmetry, and instead do something like flooding only happens in one direction on each link, and whichever side knew more transactions in the last recon is the potentially-flodder.

22:14 < sipa> so in that sense i can see how deciding to not flood outbound-received transactions may be a useful optimization... the network structure implies that you're receiving generally from a group of peers who likely are better connected than you are

22:14 < gmaxwell> sipa: yes I am starting to see how things ended up here.

22:15 < sipa> i don't really see the problem with it

22:15 < gleb> And we can always switch to something different if at some point NAT landscape changes?

22:15 < gmaxwell> Basically nowhere else in the bitcoin protocol is there this in/out distinction -- just in tx relay privacy timers (which were 'recently' introduced relatively speaking) and in peer eviction.

22:16 < gmaxwell> for one thing it takes 80% or whatever of bitcoin nodes out of participation in rapidly forwarding transactions, which seems ... like a really big step to take without an obvious reason.

22:17 < gmaxwell> I'm not sure why we'd intentionally do that.

22:18 < gleb> gmaxwell: the timings were 1s for 95% reachable, 5s for 95% non-reachable. Something along those numbers.

22:18 < gleb> (the time it takes to spread a transaction). Saying "rapidly forwarding" is not so different

22:18 < gleb> Again, assuming the topology.

22:19 < gmaxwell> Like, have we made a series of single steps that were logcal but gives a weird outcome? Step 1. relay faster to outbound peers because we control them, so they're less likely to be spies monitoring our txn. Step 2. Introduce reconcillation, but make flooding only happen for reachable nodes because it has some moral similarity to step 1. Conclusion, 80% of bitcoin nodes no longer participate

22:19 < gmaxwell> in fast propagation? But that wasn't the original goal, it seems like a side-effect though perhaps a benign one?

22:19 < gmaxwell> but not completely benign since now we have problems with txn originated 'near' these flooding non-participants.

22:20 < gleb> gmaxwell: I believe I made those steps, I even had 1-3 steps above. I always tried to keep you and pieter in sync, but you might have got lost at some point.

22:20 < sipa> but even without the assymetry in flood relay speed between outbound/inbound, it is the case that better connected nodes (which is correlated with connections to them) will learn about transactions fasters

22:22 < gmaxwell> sipa: indeed, that was why the strawman alternative I suggested above elected whatever direction that sourced more txn to be the flooding direction.

22:23 < sipa> figure 19 in the paper shows bandwidth/latency simulations for a number of configurations

22:23 < gmaxwell> gleb: I probably wasn't lost, I probably was right there with you at the time, following along with whatever reasoning got us there.

22:23 < gmaxwell> :P

22:23 < gmaxwell> but I don't really get the reasoning now. :(

22:24 < sipa> "flooding is not as useful to peers which are better connected than you are" is the summary i think

22:24 < gmaxwell> sipa: re: better connected, esp if outbound connections are increased, it's not even unlikely that a NATed node could end up better connected than some arbitarily selected inbound only node.

22:24 < gleb> sipa: in those experiments, I never tried to pick the flood peers in a smart way. It was always N, M% random across inbound/outbound.

22:26 < sipa> gleb: it is surprising to me that in fig 19 in the paper, increasing the number of outbound flood peers improves both bandwidth and latency

22:26 < sipa> gmaxwell: that's fair

22:27 < gleb> sipa: I can't explain that now.

22:28 < sipa> gleb: is that graph using the same logic (only flood transactions receiving on incoming connections)?

22:28 < gleb> sipa: yes I think so.

22:29 < sipa> that suggests that that decision (relay tx received via inbound through flooding to outbounds) is (in your simulation model) a very good predictor of who won't have a tx already

22:30 < sipa> basically you want to use flooding when you have reasons to believe you had a tx earlier than the recipient, while recon is for when you and your peer are equals

22:30 < gmaxwell> maybe I should step back and also try to explain my general unease with the hard in/out split. Essentially, it makes a strong assumption on the global structure of the network. Recently in dogecoin there were network collapse issues related to a similar asymmetry during IBD... where almost the entire network was all in IBD because it had fallen behind, and as result wouldn't serve blocks

22:30 < gmaxwell> and only requested them from peers they connected out to (which were more themselves in the same broken state)

22:30 < sipa> and i guess "receiving through inbound" is a very good predictor for that (again, in your model, which may not actually correspond to reality)

22:34 < gmaxwell> sipa: If you flood txn recieved through flooding, then indeed, those are going to be the txn that are new. I don't actually know that in/out really matters for that to be true, except when only flooding to outbound makes it true.

22:34 < gleb> sipa: yeah, and that achieved in the following ways. Flooding is mainly used across reachable. We cap at 8 outbounds, so even though "having earlier" is true in 50%, that's fine because it's capped.

22:35 < gleb> sipa: For reconciliation it matters mainly for non-reachable, and efficiency is achieved by reconciling every 2 seconds via queue. When you hit 2*8=16 seconds interval per peer, you already reconciled with 7 other peers since last time, so you are almost equal to that 8th peer.

22:36 < gleb> Sorry for keeping the reachable terminology, I'm just explaining how sipa logic maps to my design.

22:36 < gleb> Yeah, if the topology changes, it would no longer be as efficient. We might see more bandwidth (unlikely more than currently)

22:36 < gmaxwell> thats fine. my comment about the term being confusing was because it looked like it was causing confusion in the review.

22:37 < gleb> well, i mean you also would prefer to avoid reasoning about it in the design i thought?

22:38 < gmaxwell> gleb: like I think with the currently proposed design, the bandwidth per node on reachable nodes may go up a lot... becuase previously non-reachable nodes participated actively in forwarding turn into passive participants and don't forward mcuh txn at all.

22:38 < gleb> gmaxwell: you mean sending tx bodies?

22:38 < gmaxwell> gleb: maybe, thats my impression now but it could just be that I forgot why I used to think it was okay. But I'm not confused by its current existance in the design

22:38 < gmaxwell> yes

22:39 < sipa> perhaps it is the case that even today most relay done by non-reachable nodes doesn't actually contribute much to tx propagation?

22:39 < gmaxwell> nah, it's like a 2:1 asymmetry as seen on my nodes.

22:40 < sipa> by "contribute" in don't mean in terms of bandwidth, but in terms of how much propagation speed would suffer if they stopped

22:40 < gmaxwell> (if you go through your IRC logs with me you'll see maybe a year ago I saw that and thought something was wrong that it was so asymetric then realized it was finally that nodes updated to the most recent broadcast timing logic)

22:40 < gmaxwell> sipa: I actually mean bandwidth though both might be true.

22:41 < gleb> gmaxwell: yes, I think this is very true. Reachable will take more work on forwarding tx bodies, making it even more asymmetrical. I think that's fair to expect.

22:41 < sipa> i don't see why that would be the case

22:41 < gmaxwell> just thinking through the implications of taking the majority of nodes out of the general forwarding process.

22:42 < gmaxwell> sipa: for discussion, assume 80% of nodes are behind nat. Right now, they probably carry the majority of the work transmititng txs (yes, they're slower, but there are many more of them). Post erlay, they won't realy tx bodies except fairly rarely.

22:43 < gleb> sipa: consider an average non-reachable node. The time between any reconciliations for it is 2 seconds. And the time of relaying across *all reachable* is 1 second. I think this implies most tx body work is done by reachable.

22:43 < gmaxwell> ^

22:44 < sipa> ok let me be more specific about what i'm not convinced of

22:44 < gmaxwell> that doesn't seem desirable to me, ... some asymetry is unavoidable because they have fewer connections each.

22:44 < gleb> I probably was like "but they will still save a lot of announcements, so it's fine to increase tx body work"

22:44 < gmaxwell> gleb: I don't think I ever considered that particular aspect before.

22:44 < gmaxwell> (lol well, I really don't know what I thought before, it's been too long)

22:45 < sipa> i believe it is possible (but don't know) that if we could today change the code so that non-reachable nodes drastically reduce how much they perform tx announcements and that when doing so (a) bandwidth usage of reachable nodes won't be affected much and (b) propagation delays won't suffer much

22:46 < sipa> basically i'm saying that it could be the case that effectively a lot of tx propagation work performed today by non-reachable nodes is redundant

22:46 < gleb> I think greg's 2:1 can't agree with you.

22:47 < gleb> I assume, non-reachable sends 0.5x of reachable's tx body traffic?

22:47 < gmaxwell> sipa: sendign tx _bodies_ is never redundant.

22:47 < gmaxwell> lemme go get current figures one sec.

22:47 < sipa> gmaxwell: ah good point, every body only arrives at every node once

22:48 < sipa> whatever non-reachable nodes are doing in that regard must be compensated by others

22:48 < sipa> ok, that's convincing

22:49 < gmaxwell> crap I lost my shell history on my node and now need to remember how to use JQ... I want to look at the ratio tx message sizes on inbound peers.

22:49 < gmaxwell> (it's just data in getpeerinfo)

22:50 < gleb> So potentially 2 problems with current design: 1) NAT privacy (could probably be fixed) 2) deepening the asymmetry

22:50 < gmaxwell> my very bad informal memory says that it was asymmetric, but except for bognon peers not THAT asymmetric.

22:50 < gmaxwell> bogon*

22:53 < gmaxwell> also: fwiw, we created most of this asymmetry 'recently' with the relay grouping stuff... and it wasn't an intended effect. When I did notice it I was confused/surprised. Though I don't think the current level is a problem.

22:53 < gmaxwell> I had automation that was identifyign spies to ban based on part of that ratio and I noticed when it started flagging lots more peers.

22:54 < gleb> gmaxwell: was it symmetrical before groups? Even when we just had 2s/5s independent timers?

22:55 < gmaxwell> gleb: SO, it wasn't asymmetrical enough for me to notice, but I think I was triggering on unusual as 2:1. It's a little hard to say for absolute sure because deployments happen over time.

22:55 < gmaxwell> It's not impossible that the independant 2/5sec change was the cause, and it just took a long time to be deployed enough for me to notice (or some other factor made it more visible). But I think it's likely that making all inbounds share a group was the cause.

22:56 < gmaxwell> I didn't *really* realize something was going on that I didn't understand until I saw a big asymmetry on a connection between sipa's node and mine.

22:57 < gmaxwell> which I couldn't dismiss as being some weird peer.

22:57 < gmaxwell> well, sipa is weird but his bitcoin node isn't. :P

22:57 < gleb> gmaxwell: sad we haven't noticed that during the shared timer design, that's the stuff i like to model... but probably didn't know how to at a time

22:57 < gleb> I think that could have been my first PR to core in 2018.

22:58 < gmaxwell> Well you don't know what you don't know.

22:58 < gmaxwell> You could easily model the txdata burden, I just never thought of it.

23:00 < gmaxwell> The tx serving burden has come up before, but in a different context-- like if you get multiple INVs for the same tx at almost the same time, it's better to pick the source more randomly rather than always go to the first.

23:00 < gmaxwell> sorry.

23:00 < gleb> But that stuff we also changed in the context of invblock or tx overhaul or something recently? :)

23:01 < gleb> like, this exact suggestion, making it random and not first received

23:03 < gleb> So, if we want to make it more symmetrical, we indeed could direct some flooding into inbound, and independently from the tx source (maaaybe based on prev performance). I expect we loose some overall efficiency that way, but if that's desired.

23:03 < gleb> Yeah, that's exactly figure 19, except it chooses the flood peers randomly.

23:04 < bitcoin-git> [bitcoin] fanquake closed pull request #21854: doc: make small corrections (v0.21.1 release notes) (master...patch-1) https://github.com/bitcoin/bitcoin/pull/21854

23:07 < sipa> ./src/bitcoin-cli getpeerinfo | jq -r 'map("\(if .inbound then "I" else "O" end) \((.bytessent_per_msg.tx // 0) / ((.bytessent_per_msg.tx // 0) + (1 + .bytesrecv_per_msg.tx // 0)))") | .[]' | sort -g

23:08 < sipa> gmaxwell: something like that?

23:08 < gmaxwell> is what I ended up with.

23:09 < gmaxwell> but yea your is better.

23:09 < gmaxwell> these need to be normalized though because naturally you'll only recieve once.

23:09 < gleb> sipa: I'm looking at my simulator, and I'm realizing I had no "received from inbound" policy at a time. Same in the paper. It was always just "flood to 8 outbounds" policy.

23:10 < sipa> gleb: that's surprising

23:10 < gmaxwell> that is anti-surprising to me! :P

23:11 < sipa> it conflicts with my intuition for why increasing flooding reducing bandwidth

23:12 < gleb> oh, hold on, sorry, the simulator is old code and hard to understand now :(

23:15 < gleb> It seems like at the time the idea was to only flood what's announced by reconciliation initiator to reconciliation responder (not backwards). Sorry again for this thought bouncing. Yeah, it seems i always had this idea in mind: try not to flood from non-reachable nodes.

23:16 < gleb> And since inbound always initiates, we effectively announce "what is learned inbound". The lattest simulator has that it seems.

23:17 < gleb> Anyway, so should we try other configurations again and see how much bandwidth gain we loose by not making topology assumptions?

23:17 < sipa> just trying "flood relay everything to 2-3-4-5-6-7-8 outbound peers" ?

23:19 < gmaxwell> sipa: so my node has 65 peers that I've exchanged txn with, and overall has sent 3.26x the tx data it has recieved. I think that ratio is pretty low, considering it only recieves once but sends up to 65 times.

23:20 < gmaxwell> gleb: also maybe add to the simulator something to get information on how much tx body data will be sent/recieved? I'm pretty sure the current scheme will behave pretty poorly in that respect.

23:21 < gleb> gmaxwell: you mean poor in the sense of asymmetry?

23:21 < gmaxwell> gleb: yeah, poor in the sense that total traffic on reachable nodes would increase a lot, while traffic at each non-reachable node decreases a bit.

23:22 < gmaxwell> (because the latter outnumber the former)

23:22 < gleb> Okay, maybe it's time to re-implement the simulator so that other people can actually review and use it easier. I'll start looking into it tomorrow.

23:23 < gmaxwell> :( I feel bad.

23:23 < sipa> this is a fairly simple policy change, so i don't think this needs to hold up code review much

23:23 < sipa> (if we'd make it)

23:23 < gmaxwell> yea I don't think it really changes the implementation much except for a few lines here and there.

23:23 < sipa> oh, and don't forget the "remove from recon sets whatever is send/received through flooding" - i think that matters

23:23 < gleb> sipa: i agree, most of the module remains the same, this stuff is even outside the module.

23:24 < gleb> sipa: i noted it, don't worry

23:24 < sipa> gleb: thanks!

23:24 < gmaxwell> in the meantime, minisketch could get merged, and dropped from the erlay PR. That would be nice progress.

23:24 < gleb> okay, thank you, i'll come back once i have the results. I agree on the plan ^

23:31 < gmaxwell> sipa: I have this vague idea that since my tx body ratio is 3.26x that my inv data ratio should be also 3.26x, ideally. (of course it won't be, it'll be beteen 32 and 64x because I have 64 peers and invs are flooded)

23:34 < gmaxwell> oh, no it won't the invs ratio will be near 1 but both sides will be 64 times larger. duh.