#bitcoin-core-dev on 2018-09-21 — searchable irc log

00:23 < gmaxwell> https://bitcoincore.org/en/2018/09/20/notice/

00:25 < luke-jr> this seems way too premature; only 2% of the network's upgraded :/

00:27 < gmaxwell> unfortunately, the issue was made public.

00:27 < luke-jr> :/

00:28 < gmaxwell> it wasn't clear if it would propagate or die out, but since it was moderatly easy to discover on your own, even just the rumors of it risked someone exploiting it.

00:31 < achow101> gmaxwell: luke-jr: there's an r/btc thread about someone finding the bug

00:32 < gmaxwell> achow101: it's referenced in the message. 'circulating'

00:34 < jamesob> Wow, that was fast

00:40 < promag> jnewbery: fyi #14283

01:11 < gmaxwell> theymos has a new announcement up https://www.reddit.com/r/Bitcoin/comments/9hkoo6/new_info_escalates_importance_upgrading_to_0163/

01:20 < gmaxwell> should I PR the extended test case?

01:38 < luke-jr> gmaxwell: I'd wait

01:39 < gmaxwell> OK.

01:56 < luke-jr> did anyone mail the announcement ML about 0.16.3? I don't think I saw one..

01:58 < achow101> luke-jr: there was an announcement for it

02:00 < aj> is bitcoin-core-dev the announcement list? https://lists.linuxfoundation.org/pipermail/bitcoin-core-dev/2018-September/000060.html

02:00 < luke-jr> no

02:00 < luke-jr> this one https://bitcoincore.org/en/list/announcements/join/

02:03 < nanotube> would it make sense to propose a 'contact' page on bitcoin.org similar to the one on bitcoincore.org? it appears it is non-trivial to find where to privately report security issues unless one knows to go to bitcoincore.org, since earlz had to come asking on #bitcoin-dev and -core-dev for a way to report.

02:05 < gmaxwell> nanotube: :( I feel really uncomfortable with people going to bitcoin.org for that kind of information.

02:05 < harding> nanotube: any page on Bitcoin.org, top menu, Participate, Development, "to report an issue, please see the Bug Reporting page", Responsible Disclosure.

02:05 < gmaxwell> but its there

02:05 < gmaxwell> yea

02:11 < nanotube> yes not ideal, but people probably still go there unless they know to check bitcoincore.org... so, good that it has the bug reporting page in there somewhere.

02:58 < echeveria> there used to be bitcoin-security, but that was handled sort of poorly.

02:59 < echeveria> the contents of it ended up being published when someone stole the old satoshi email address.

04:17 < kanzure> hi. mailing list admin has hit a bug and is unusable at the moment.

04:18 < kanzure> We're sorry, we hit a bug!

04:18 < kanzure> Please inform the webmaster for this site of this problem. Printing of traceback and other system information has been explicitly inhibited, but the webmaster can find this information in the Mailman error logs.

04:22 < sipa> great.

04:33 < echeveria> I was looking at one of the public internet mapping tools for bitcoin core versions. there's a pretty disturbing number of hosts that have 8332 open.

04:36 < echeveria> is there some tool or setup guide that is telling people to open this port? I thought it was pretty difficult (not a single switch) to get Bitcoin Core to bind the RPC interface to 0.0.0.0.

04:46 < echeveria> of 8000 IPv4 nodes, 1142 have a RPC port 8332 that respond to a SYN.

05:10 < gmaxwell> maybe honeypots?

05:10 < gmaxwell> as you note, you must take extra steps to bind..

05:52 < echeveria> I don't think so. they're over a huge number of different hosts, old and new.

07:06 < ken2812221_> MarcoFalke: Your gpg signing key has expired

12:05 < Jmabsd> the code that qualifies and validates a segwit transaction starts on what code locations?

12:10 < Jmabsd> mostly validation.cpp's AcceptToMemoryPoolWorker

12:21 < Jmabsd> Where is the code that checks the witness merkle root in the coinbase transaction?

15:33 < emilengler> If I download the linux tarball, will I be able to select a download path for the blockchain ?

15:33 < luke-jr> yes, if you know how or use the GUI

15:34 < luke-jr> better topic for #Bitcoin

15:34 < emilengler> Ok I will keep this in mind excuse me

17:31 < provoostenator> I used invalidateblock on a remote node to go back to ~ 475000, but lost the connection after a few hours. The last debug message is from an hour ago, an updatetip down to 508954. It's in a weird state.

17:32 < provoostenator> Memory usage was swinging between 5GB and 8GB. I was able to shut it down via rpc, though the last message was "net thread exit" which sounds like an unclean exit.

17:34 < provoostenator> Restarting the node, now it's "Replaying blocks", "Rolling back ... 542229" and down from there.

17:59 < sdaftuar> provoostenator: what version bitcoind was it?

18:00 < provoostenator> sdaftuar: v0.17.0rc2 (I was actually dumb enough to not upgrade it before doing this)

18:01 < provoostenator> I also don't know if invalidateblock is supposed to work for such a huge rollback. Though if not, then perhaps the documentation should warn against that.

18:02 < sdaftuar> well, i think we do want it to work

18:02 < provoostenator> Also, the RPC call is blocking. Does getting disconnected have any bearing on that?

18:02 < sdaftuar> no, the invalidateblock function should continue even after the rpc client disconnects, i think

18:03 < sdaftuar> i believe if you had waited long enough, it probably would have finished?

18:03 < sdaftuar> but it might be several hours

18:03 < provoostenator> The logs also suggest it continued for about 30 minutes after disconnecting.

18:04 < sdaftuar> disconnecting blocks is heavily disk-bound. when i last looked at it (on different hardware than i use today), i think i noticed i could disconnect on the order of 3-5 blocks/second, on average

18:04 < provoostenator> It got about halfway in 2-3 hours, so indeed it looked like it would have made it.

18:04 < provoostenator> This is an iMac with SSD and plenty of memory.

18:05 < sdaftuar> we used to have an issue where the memory usage could grow sort of unbounded, as disconnected blocks would have their transactions added to the mempool

18:05 < sdaftuar> but that was fixed

18:05 < provoostenator> Yeah the weird thing I noticed is how dbcache kept growing as it was disconnecting.

18:05 < sdaftuar> but your comment about 5-8GB of memory has me slightly concerned

18:05 < provoostenator> The machine has 64 GB so it didn't run out.

18:05 < sdaftuar> what is -dbcache set to?

18:06 < provoostenator> 5000, so that's bad

18:07 < provoostenator> The mempool is just the default, so that shouldn't have grown so much, right?

18:07 < sdaftuar> yeah assuming the code works correctly, the mempool's memory usage would have been bounded pretty well

18:08 < provoostenator> Last log entry had cache=4650.7MiB

18:09 < sdaftuar> oh so that seems good then

18:10 < provoostenator> The 5-8 GB RAM usage was an hour after the last log entry, when I reconnected, found through "top".

18:14 < sdaftuar> alright well maybe this is all expected (crappy) behavior. i don't know of any clever ideas to speed up block disconnection, unfortunately.

18:14 < sdaftuar> maybe someone could implement https://github.com/bitcoin/bitcoin/issues/8037

18:17 < provoostenator> Maybe invalidateblock could have a "don't bother adding to the mempool" option?

18:19 < provoostenator> I just noticed I have txindex=1, so that could be another issue.

18:19 < sdaftuar> provoostenator: yeah that's fair but i suspect it would still take hours

18:19 < gmaxwell> provoostenator: it's not clear to me what you're saying you saw

18:19 < gmaxwell> provoostenator: was it still rolling back when you stopped it?

18:20 < gmaxwell> if it was then it just sounds like expected behavior.

18:20 < provoostenator> gmaxwell: rolling back was after I restarted (it's still doing that now).

18:20 < gmaxwell> I've rolled back all the way to block 0 many times, though not recently.

18:21 < gmaxwell> provoostenator: yea, it'll keep going until it finishes.

18:21 < kanzure> mailing list bug has been reoslved; can someone send the post mortem link to the mailing list subscribers plzkthx? like https://bitcoincore.org/en/2018/09/20/notice/

18:21 < sipa> kanzure: didn't BlueMatt send one?

18:21 < provoostenator> Before I disconnected (a few hours ago) it was doing "UpdateTip: new best ..." in reverse order, as expcted from doing invalidateblock

18:21 < kanzure> i don't see one in the mod queue

18:22 < kanzure> and i don't see it on https://lists.linuxfoundation.org/pipermail/bitcoin-dev/2018-September/thread.html

18:22 < provoostenator> The logs shows it kept doing that for 30 mins after I disconnected from the machine.

18:23 < provoostenator> When I logged back into the machine, bitcoind was still running, using 5-8 GB of RAM (it was actually going up and down in the space of minutes), but log wasn't updating. I then stopped it via rpc and restarted.

18:23 < provoostenator> So it seems it was still doing _something_, despite not logging.

18:23 < luke-jr> FYI, I still didn't get anything for 0.6.3/CVE from https://bitcoincore.org/en/list/announcements/join/ yet

18:24 < gmaxwell> provoostenator: what you're seeing without the logs is the atomic flush roll forward probably.

18:24 < luke-jr> sipa: ^ since you are one of the 3 who can apparently send those

18:24 < sdaftuar> provoostenator: gmaxwell: it does seem surprising i guess that an unclean shutdown happened?

18:24 < sdaftuar> how would that be possible if you just use rpc to stop the node?

18:25 < provoostenator> gmaxwell: what is a "atomic flush roll forward"?

18:25 < sdaftuar> provoostenator: on startup, we detect if the utxo state wasn't finished being written as of what we think our tip is.

18:26 < sdaftuar> in that situation, we have a rollback / rollforward mechanism to fix the utxo

18:26 < sdaftuar> by disconnecting blocks that are no longer on our chain, and replaying the blocks that might need applying to the utxo state

18:27 < sdaftuar> that should only happen after an unclean shutdown though

18:27 < gmaxwell> not to change subject but anyone know what this is? https://www.reddit.com/r/Bitcoin/comments/9hrusk/orhpan_blocks/e6e4zhk/?context=3

18:28 < gmaxwell> sdaftuar: indeed. I missed that the shutdown was supposted to be clean.

18:38 < provoostenator> gmaxwell: that warning is triggered here: https://github.com/bitcoinj/bitcoinj/blob/master/core/src/main/java/org/bitcoinj/core/AbstractBlockChain.java#L584

18:45 < provoostenator> "duplicate block" seems to mean that it already processed it, nothing to do with orphans.

18:53 < kanzure> *poke* postmortem email plzkthx

19:12 < provoostenator> Ok, so roll back made it to 509650 and all seems well. Except the node seems to have forgotten I invalidated block 485000, because it jumped right into IBD and is moving forward again.

19:16 < provoostenator> I guess that's because the node doesn't check the full block index at launch.

19:19 < provoostenator> Restarted, now using v0.17.0rc4, doing nanother invalidateblock. Memory usage is almost 2GB higher than cache= shown in the logs, and seems to outpace it.

19:19 < provoostenator> I've also turned off the index.

19:21 < gmaxwell> what are the actual dirty page counts?

19:21 < gmaxwell> we recently realized that OS cached pages in mmaped files show up in res.

19:22 < provoostenator> Also, it's still going even though bitcoin-cli stop said it would stop. I'll look at the dirty page counts...

19:22 < provoostenator> (note to self: do not google "top dirty pages")

19:23 < sipa> lol

19:23 < gmaxwell> oh sorry, pmap -x $(pidof bitcoind) | tail -n 1 | tr -s ' ' | cut -d' ' -f 4

19:26 < provoostenator> macOS doesn't have pmap, but vmmap gives me this summary: https://gist.github.com/Sjors/6b01711ccd0f96128c7db5230c85ae8f

19:28 < provoostenator> Also a long list of "mapped file", e.g. many "locks/index/*.ldb"

19:29 < gmaxwell> k, so ~2GB of your resident size is mapped files.

19:29 < gmaxwell> whats your dbcache setting?

19:30 < provoostenator> dbcache=5000 MB, the log currently says cache=2300 MiB, so that part makes sense?

19:30 < provoostenator> It's the just the other 6 GB that needs explaining. Memory usage is now 10 GB. 38 more and the machine is going to OOM, which I'm not going to allow.

19:32 < provoostenator> MALLOC_TINY is now at 8.7, so that seems to be the thing that's mooning.

19:36 < provoostenator> (actually this is still v0.17.0rc2, sorry, though hopefully that doesn't matter here)

19:36 < provoostenator> (no, it is v0.17.0rc4)

19:37 < provoostenator> kill has no effect, kill -9 did

19:46 < provoostenator> Getting fairly consistent behavior now, even with disablewallet=1. bitcoin-cli stop seems to stop the RPC server, but not the invalidation process. Curious if anyone can reproduce. I'll let it sync to the tip before trying again.

19:47 < sipa> if invalidateblock does not succeed, its state isn't writte

19:48 < sipa> it first disconnects the blocks, and then marks them as invalid

19:48 < sipa> so at startup they will be connected again if bitcoind was killed in the middle

19:50 < provoostenator> That makes sense. I wonder if it matter that I was essentially interrupting IBD with that invalidateblock call. Memory usage seemed way worse than what I saw earlier today.

19:50 < sipa> invalidateblock also keeps a list of transactions to re-add to the mempool after the invalidation completes

19:51 < sipa> i assume that's the memory usage you see

19:51 < provoostenator> Is it also not abortable once in progress?

19:52 < sipa> no

19:53 < provoostenator> Ok, so in that case the way to roll back a long way would be to do it in smaller increments.

19:54 < sipa> right

19:54 < sipa> that should work

19:55 < gmaxwell> sipa: the mempool usage is limited.

19:55 < sipa> gmaxwell: how?

19:58 < gmaxwell> https://github.com/bitcoin/bitcoin/pull/9208

20:01 < sipa> gmaxwell: it doesn't look like DisconnectedBlockTransactions enforces any memory limits

20:01 < gmaxwell> MAX_DISCONNECTED_TX_POOL_SIZE

20:02 < sipa> oh

20:02 < provoostenator> There's also this open issue: #9027

20:02 < gribble> https://github.com/bitcoin/bitcoin/issues/9027 | Unbounded reorg memory usage · Issue #9027 · bitcoin/bitcoin · GitHub

20:02 < sipa> yup

20:03 < sipa> i was expecting the code to be elsewhere, my bad

20:05 < provoostenator> I take great pride in doing stupid things that lead to a new release candidate, so hopefully you'll find something :-)

20:08 < gmaxwell> provoostenator: are you running txindex?

20:08 < provoostenator> No, I did the first time today, but turned that off in more recent attempts.

20:13 < provoostenator> Re incremental approach: I rolled back ~10,000 blocks using about 13GB of RAM, cache=2200 at the peak. Sounds like it's holding all transactions in memory.

20:16 < provoostenator> But then it gets weird. ERROR: AcceptToMemoryPoolWorker: Consensus::CheckTxInputs: ... bad-txns-premature-spend-of-coinbase, tried to spend coinbase at depth 92

20:17 < provoostenator> InvalidChainFound: invalid block [the block I invalidated]

20:17 < gmaxwell> thats normal.

20:17 < provoostenator> Yeah, but then it starts syncing again.

20:22 < provoostenator> Ok, now I think I destroyed my chain :-) At boot: "assertion failed: (!setBlockIndexCandidates.empty()), function PruneBlockIndexCandidates, file validation.cpp, line 2547"

20:32 < sipa> provoostenator: i found the issue

20:33 < sipa> it's specific to InvalidateBlock

20:33 < provoostenator> sipa: nice!

20:41 < provoostenator> sipa: is it because disconnectpool holds on to transactions which reference a shared_ptr<CBlock> pblock, so those don't get deallocated?

20:41 < sipa> provoostenator: no

20:44 < sipa> the event queue holds on to the shared_ptr<CBlock> objects in callbacks to DisconnectedBlock

20:44 < sipa> and InvalidateBlock doesn't limit the size of the queue

20:46 < sipa> provoostenator: could you check whether this issue also occurs when Rewinding?

20:47 < sipa> create a 0.13.0 node, sync it to tip, and then upgrade to 0.17+

20:47 < sipa> i suspect it is, and if that's the case, i would consider it a release blocker

20:48 < provoostenator> That's the rewind that happens if you upgrade a non-segwit node to a segwit node?

20:48 < sipa> yup

20:49 < provoostenator> I'll give it a try this weekend or early next week. Getting a bit late here. Maybe someone else gets to it first.

20:49 < sipa> thanks!

20:50 < provoostenator> I'm not looking forward to doing another release notes for 0.14 and 0.15 backports :-)

21:42 < MarcoFalke> About #14289, was it ever supported to call invalidateblock on a block very far back?

21:42 < gribble> https://github.com/bitcoin/bitcoin/issues/14289 | Unbounded growth of scheduler queue · Issue #14289 · bitcoin/bitcoin · GitHub

21:43 < sipa> MarcoFalke: i would say no, but it'd be a nice-to-have if it worked

21:43 < sipa> having invalidateblock 100000 blocks deep use a massive amount of memory is not a blocker, i think

21:43 < MarcoFalke> Ok, that was my impression because every time I tried that it would deadlock the node a bit until I got impatient and CTRL+C out

21:44 < MarcoFalke> If that is supported with reasonable memory gurantees, we should add a test/benchmark so it doesn't randomly regress

21:45 < MarcoFalke> Also my key is de-expired, but I am having issues uploading it to keyservers.

21:45 < MarcoFalke> All of them return some obscure proxy error or timeout or ...

21:46 < gmaxwell> MarcoFalke: yes worked fine since 9208.

21:46 < gmaxwell> the rpc will disconnect, because the rpc timeout isn't long enough for it to finish, but a node will happly work its way back to block 0.

22:02 < Murch> MarcoFalke: Luckily Keyservers may soonish be a thing of the past: https://wiki.gnupg.org/WKD