#bitcoin-core-dev on 2020-09-09 — searchable irc log

02:41 < bitcoin-git> [bitcoin] sipa closed pull request #19695: [do not merge] Test impact of secp256k1 endianness detection change (master...202008_test_appveyer_secp256k1) https://github.com/bitcoin/bitcoin/pull/19695

06:30 < vasild> Next step to TORv3 is at #19845, waiting for some reviewers to shoot it down :)

06:30 < gribble> https://github.com/bitcoin/bitcoin/issues/19845 | net: CNetAddr: add support to (un)serialize as ADDRv2 by vasild · Pull Request #19845 · bitcoin/bitcoin · GitHub

06:34 < jonatack> 👍

07:09 < bitcoin-git> [bitcoin] MarcoFalke pushed 3 commits to master: https://github.com/bitcoin/bitcoin/compare/4f229d8904f8...564e1ab0f3dc

07:09 < bitcoin-git> bitcoin/master fa39c62 MarcoFalke: test: inline hashToHex

07:09 < bitcoin-git> bitcoin/master fa188c9 MarcoFalke: test: Use MiniWalet in p2p_feefilter

07:09 < bitcoin-git> bitcoin/master 564e1ab MarcoFalke: Merge #19800: test: Mockwallet

07:09 < bitcoin-git> [bitcoin] MarcoFalke merged pull request #19800: test: Mockwallet (master...2008-testMiniWallet) https://github.com/bitcoin/bitcoin/pull/19800

07:58 < jonasschnelli> How is the process of merging "back" the state from the GUI repository? Is there a planed timeframe? Will there just be one PR to the main repository that includes all changes (like a backport PR)? Is that process documented already?

07:58 < jonasschnelli> ^ MarcoFalke

08:06 < promag> jonasschnelli: +1

08:27 < jnewbery> #proposed meeting topic: stategies for removing recursive locking in the mempool (https://github.com/bitcoin/bitcoin/pull/19872#issuecomment-688852261)

08:44 < bitcoin-git> [bitcoin] MarcoFalke opened pull request #19922: test: Run rpc_txoutproof.py even with wallet disabled (master...2009-testMoreMiniWallet) https://github.com/bitcoin/bitcoin/pull/19922

10:39 < promag> jnewbery: +1

11:07 < elichai2> MarcoFalke: I can't manage to reproduce the error in #19920, instead I'm getting a really weird bug without the full details, any idea why? (I'm getting this: https://pastebin.com/raw/MBzJ0ixB)

11:07 < gribble> https://github.com/bitcoin/bitcoin/issues/19920 | test: Fuzzing siphash against reference implementation [Request for feedback] by elichai · Pull Request #19920 · bitcoin/bitcoin · GitHub

11:16 < elichai2> found the bug :) you're not allowed to do `&*it` on null 😅

14:25 < bitcoin-git> [bitcoin] hebasto opened pull request #19926: gui: Add Tor icon (master...200909-tor) https://github.com/bitcoin/bitcoin/pull/19926

17:08 < bitcoin-git> [bitcoin] dongcarl opened pull request #19927: validation: Reduce direct g_chainman usage (master...2020-09-reduce-g_chainman-usage) https://github.com/bitcoin/bitcoin/pull/19927

17:20 < ariard> jonasschnelli: just replied on bip324, AFAICT real-or-random and I favor MACing the length a la noise, even if as Lloyd is pointing we don't a concrete exploitation of it

17:27 < achow101> jonasschnelli: IIRC PRs merged to the GUI repo are also pushed to the main repo simultaneously

18:57 < luke-jr> wtf :/ https://github.com/bitcoin/bitcoin/issues/19928

19:00 < achow101> luke-jr: O.o

19:00 < achow101> the filenames are hardcoded...

19:01 < luke-jr> achow101: isn't this the directory name, actually?

19:02 < achow101> https://twitter.com/2btc10000pizzas/status/1303767335258542085 would indicate it effects the wallet.dat file

19:03 < achow101> although the previous tweet also suggests the directory name

19:03 < luke-jr> ackup.dat doesn't

19:03 < bitcoin-git> [bitcoin] vasild opened pull request #19929: sync: use proper TSA attributes (master...use_proper_tsa_attributes) https://github.com/bitcoin/bitcoin/pull/19929

19:04 < achow101> right

19:04 < achow101> allet.dat does but maybe he specified wallet.dat and not just ""

19:04 < achow101> so that would mean it's a wallet name handling thing

19:04 < luke-jr> "" is never on disk :p

19:05 < luke-jr> you saw that apparently the actual on-disk filenames are being renamed, right?

19:05 < achow101> yes

19:05 < luke-jr> I'm not 100% sure they know what they're talking about in that regard, but it's weird

19:06 < gwillen> off the top of my head, one way to eat the first character of a path component would be issues with quoting and backslash as a path separator on Windows

19:06 < luke-jr> hmm

19:07 < luke-jr> "It would have been Gentoo Linux with the wallet files on an NTFS partition." lol totally unexpected

19:07 < luke-jr> I doubt the other guy has the same setup tho

19:07 < gwillen> iiiiiinteresting

19:07 < gwillen> could be an issue in the NTFS driver, that thing was always marked 'experimental'

19:08 < luke-jr> what's the chance the other guy had Linux+NTFS tho

19:08 < achow101> luke-jr: yeah, other guy is Win 10

19:08 < achow101> could be a NTFS issue

19:09 < luke-jr> seems unlikely

19:09 < luke-jr> it's not like Windows and Linux share the same NTFS code

19:10 < achow101> if he only see the problem on knots, then we could probably find the problem by looking at the diff?

19:10 < luke-jr> achow101: the second guy had it on Core

19:10 < luke-jr> I think

19:10 < gwillen> any chance the win 10 guy is using WSL or something weird like that?

19:11 < luke-jr> maybe if the Linux guy was using Captive NTFS.. he did say it was a long time ago

19:12 < gwillen> in that case why doesn't every windows user see it, though

19:12 < luke-jr> even the same user couldn't reproduce :/

19:58 < phantomcircuit> anybody know how many transaction outputs are in the chain? (not utxo, txo)

20:00 < sipa> years ago it was half a billion iirc

20:18 < andytoshi> i have a simple script i used for my mimblewimble presentation, i can get this number in a couple hours

20:18 < andytoshi> it seems to be taking 2-3 seconds per 100 blocks to scan, i don't remember it being so slow

20:36 < phantomcircuit> andytoshi, i've rigged up rescanblockchain to tell me

21:00 < andytoshi> ok cool. i had rigged the `getblock` rpc to dump the number of txouts per block and was using bash from there, but this is pretty brutal ... in the 40 minutes since i last spoke i'm up to block 200k. so it'll finish tonight :P

21:00 < sipa> andytoshi: it may not... there are barely any transactions before 200k i think

21:02 < aj> maybe update the coin stats index from #19521 and use that?

21:02 < andytoshi> hmm, so, i definitely did this in fall 2016 for scaling bitcoin milan and it only took a few hours

21:02 < andytoshi> i guess it's been 4 years :P

21:02 < gribble> https://github.com/bitcoin/bitcoin/issues/19521 | Coinstats Index (without UTXO set hash) by fjahr · Pull Request #19521 · bitcoin/bitcoin · GitHub

21:03 < aj> andytoshi: (maybe rusty's bitcoin-iterate is faster?)

21:04 < sipa> 7316308 transactions up to block 200000

21:04 < sipa> out of 566745810 in total

21:05 < sipa> phantomcircuit: given that there have been more transactions now than my claimed earlier historical number for the total txouts, you can safely disregard it

21:05 < yanmaani> luke-jr: Maybe worth adding a check for it?

21:05 < yanmaani> "if wallet.dat doesn't exist and allet.dat does, show a message box"

21:06 < yanmaani> "Hi, a very rare bug has occured. We would be happy if you could email us at asd@asd.com and tell us what filesystem drivers you're using. To fix it, open that folder and rename allet.dat again."

21:06 < yanmaani> bit hacky though

21:11 < phantomcircuit> i think we've regressed on IBD somewhere, i have a server that's comically overpowered and there doesn't seem to be anything that's bottlenecking

21:13 < phantomcircuit> running steady at about 100mbps cpu and disk basically idle on a 1gbps connection

21:14 < sipa> phantomcircuit: do you have good peers to sync from?

21:14 < sipa> the stalling detection logic can kick out the worst peers, but whether you get actually good ones can be hit or miss

21:18 < yanmaani> what's usually the bottleneck for IBD? DB sync?

21:19 < aj> phantomcircuit: i often find i'm stuck on block X from a slow peer, while the other peers are on block X+500 or so

21:20 < sipa> yanmaani: depends... with lots of cache it's either network or (in-memory) utxo datastructure maintenance; with low cache it can be disk I/O

21:20 < phantomcircuit> sipa, it must be the eviction logic cause im sure it would otherwise be network limited

21:21 < yanmaani> how much disk I/O do you need? It's just 300gb or so right?

21:22 < aj> yanmaani: disk io is mostly updating the utxo set, which is mitigated by cache

21:22 < yanmaani> Can't you disable disk IO during IBD for utxo set?

21:22 < sipa> yanmaani: yes, by making your cache big enough for the entire utxo set :)

21:22 < sipa> which is 8 GB or so

21:23 < yanmaani> No I mean can't you turn off DB sync and so?

21:23 < yanmaani> Or will it just need to spill to disk regardless?

21:23 < sipa> well the UTXOs need to be stored somewhere!

21:23 < sipa> how will you validate transactions otherwise?

21:23 < yanmaani> yeah, but there's no need to sync the database

21:23 < yanmaani> You can have MongoDB tier safety

21:24 < yanmaani> (during IBD)

21:24 < aj> you have to have a database, it can be in memory or on disk; if it's in memory, it's in cache

21:24 < sipa> if you set the cache big enough to keep the entire utxo set in memory, there will be no database I/O whatsoever during IBD

21:24 < sipa> and it'll be flushed once at the end

21:35 < phantomcircuit> yanmaani, if you set the dbcache high enough you will only write to disk once when you shutdown the node

21:37 < yanmaani> If I set it say 99% of the way, will I notice a sharp slowdown, or is it smart enough to cache as much as possible?

21:39 < sipa> it's a sawtooth function; our cache is kind of a weird mix between a buffer and a cache

21:39 < sipa> once it fills up, it's written entirely to disk, and cleared

21:40 < sipa> (the reason for this is an unusual design that lets us remove entries from the cache if they're created and deleted between flushes, without them ever hitting disk)

21:40 < sipa> and at least years ago, we tried several alternative designs that kept some part in memory when flushing, but this turned out to be always worse

21:40 < yanmaani> And I'm guessing the database is being interacted with by fwrite() rather than mmap

21:41 < sipa> it's leveldb

21:41 < sipa> so whatever leveldb uses, which is a mix (iirc it's all fwrite on 32-bit platforms, and a combination of fwrite and mmap on 64-bit ones)

21:41 < yanmaani> So it's leveldb but with a custom cache?

21:42 < sipa> it's probably better to call it an in-memory database, backed by an on-disk leveldb database

21:42 < sipa> leveldb has its own caching too

21:43 < yanmaani> wouldn't a file-backed mmap be better?

21:43 < yanmaani> it's like malloc but with explicit swap handled by the OS

21:43 < sipa> you're welcome to try, but we're really talking about different layers

21:44 < sipa> the on-disk caching layer is a byte array

21:44 < yanmaani> No, I mean isntead of the RAM blob being used for in-ram caching

21:44 < sipa> there is no RAM blob

21:44 < sipa> there is an in-memory database, with expanded, efficient, data structures

21:44 < sipa> not serialized bytes

21:45 < yanmaani> Isn't the in-memory database in RAM????

21:45 < sipa> yes, but it's not a blob

21:45 < sipa> no need to yell

21:45 < yanmaani> It's several mallocs?

21:45 < sipa> yes

21:46 < yanmaani> so, wouldn't it make more sense to replace them with backed mmaps that you never flush? Then the OS would have a lot more liberty to optimize

21:46 < yanmaani> than if you force it into RAM

21:46 < sipa> seriouysly, you're welcome to try

21:46 < sipa> i've spent months on optizing that stuff

21:47 < sipa> it's a highly unusual design, but yes, based on the experiments we did back then, it works very well

21:47 < yanmaani> huh

21:48 < sipa> the unusual part is that the UTXOs really have a create-lookuponce-deleteimmediate cycle

21:48 < sipa> which is very strange for databases

21:48 < sipa> usual things aren't designed to take advantage of the degree to which looked up entries are immediately deleted

21:49 < sipa> (and they'll instead create some sort of log that contains the creation and deletion, which still get written to disk at flush time)

21:50 < sipa> by having an allocation per entry, you can just throw it away instantly when spent, and forget about its existence entirely

21:51 < sipa> if you have a few hundred MB or more of cache, it means most UTXOs never hit disk at all

21:52 < yanmaani> Wouldn't mmaps do this nearly as efficiently? Or is the OS too eager to flush changes?

21:52 < sipa> sigh

21:52 < sipa> you're talking about a different layer

21:52 < yanmaani> No, I mean that the malloc is replaced by a mmap

21:53 < yanmaani> And the mmap'd file is then treated like a RAM buffer of 8 GB

21:53 < sipa> then you'd get inconsistent state on disk in case of a crash

21:53 < yanmaani> Yeah, is that a problem?

21:53 < sipa> yes

21:53 < yanmaani> Can't you just remove the UTXO state in case of a crash?

21:53 < yanmaani> at least during IBD

21:53 < sipa> and sync from scratch? :o

21:53 < yanmaani> if you make it fast enough it should be a gain on net

21:53 < yanmaani> and dropping ACID guarantees and giving it the MongoDB treatment seems like it would make things faster

21:53 < sipa> in pruned mode, you'd need to start over redownloading even

21:54 < yanmaani> yeah, that's true. For pruned mode, you'd need to make sure it was synced properly.

21:54 < yanmaani> Although if you're substituting malloc() for mmap() of a temporary file, isn't the persistence as good? "The synced stuff stays, the stuff in RAM doesn't"

21:55 < sipa> there is no guarantee that mmap flushing happens in the same order as writes

21:55 < sipa> is there?

21:57 < yanmaani> no

21:58 < yanmaani> if it crashes, your mmap will be garbage

21:58 < yanmaani> but if you're using it to substitute malloc it should be fine

21:58 < yanmaani> since there's no expectation malloc persists on crash

21:58 < sipa> ah, i see

22:01 < sipa> what advantages would this have? if used with the same cache size as you'd use now, it wouldn't be any faster or have other advantages i think

22:02 < sipa> it'd permit you to make a cache larger than your ram, which may or may not be better

22:02 < sipa> depending on how fast disk is etc

22:02 < yanmaani> If used with the same cache size as you have now, it's roughly identical, but uses less RAM/is more fair

22:02 < yanmaani> (OS can swap it out as it needs if there's a deficit of RAM)

22:02 < yanmaani> if used with the max cache size*

22:03 < sipa> that assumes the OS can predict better what's useful to have cached

22:03 < yanmaani> it has some caching algorithm on a block level, yes

22:03 < yanmaani> and users who are using zram/zswap will benefit from compression

22:03 < yanmaani> it'll avoid sync disk writes in the cases where cache is too small

22:03 < sipa> sure, but it doesn't know for example that after deleting some UTXO entry it's no longer useful to keep it around

22:04 < yanmaani> after deleting the utxo entry, the ram is filled with something else surely?

22:04 < yanmaani> (or it's never touched again, in which case the OS won't give it a very high priority)

22:04 < sipa> at some point, sure

22:05 < sipa> anyway, you're welcome to try and benchmark :)

22:06 < yanmaani> yeah. Where is the cache?

22:06 < yanmaani> i.e. what file

22:06 < yanmaani> is it src/index/*/

22:07 < sipa> CCoinsViewCache in src/coins.h

22:09 < yanmaani> right, thanks!

22:13 < luke-jr> mallocs can get swapped out too..

22:13 < yanmaani> Only if you have swap enabled

22:13 < yanmaani> Otherwise it'll go straight to thrashing

22:14 < yanmaani> With a file-backed mmap, it can flush the pages to disk without consuming your swap

22:14 < sipa> yanmaani: it would add I/O though, because the OS will start writing dirty pages from the mmap to disk, and then you'll read them again and write them again when flushing to the "real" database on disk

22:14 < sipa> though that wouldn't be I/O on the critical latency path

22:16 < yanmaani> yes, but so does normal thrashing

22:17 < sipa> if the cache is so large that it gets swapped out to disk, you're better off picking a smaller cache

22:17 < luke-jr> yanmaani: did you see my recent PR?

22:17 < yanmaani> there's two options with malloc: either swap (if that's enabled), or thrashing (swap out libc). With mmap, you can also flush it

22:18 < yanmaani> sipa: not necessarily - it might figure out which bits aren't so useful, and swap out those, for a net gain

22:18 < luke-jr> #19873

22:18 < gribble> https://github.com/bitcoin/bitcoin/issues/19873 | [WIP] Flush dbcache early if system is under memory pressure by luke-jr · Pull Request #19873 · bitcoin/bitcoin · GitHub

22:19 < yanmaani> That might also work. I don't know which approach is better.

22:19 < sipa> yanmaani: given that every piece of data in the cache is accessed exactly once - when it's spent, i don't see how the OS could predict what is useful and what isn't

22:19 < yanmaani> I suppose I'll have to benchmark it

22:19 < sipa> yeah, it'd be interesting to know

22:19 < sipa> you'll need some mmap-backed allocator i guess

22:20 < luke-jr> I think a more likely improvement would be to flag cache entries rather than delete them, when writing to db

22:20 < sipa> luke-jr: ?

22:20 < yanmaani> I wonder if it'd make more sense to write everything to DB and have it in some extremely lax sync mode

22:20 < yanmaani> so, take out the cache, and set the DB to MongoDB mode

22:20 < luke-jr> sipa: after flushing changes to db, keep them in memory in case they're read soon

22:20 < yanmaani> write during IBD, then flush when synced

22:20 < sipa> luke-jr: i tried that

22:20 < luke-jr> flag them so you know they don't need to be written anymore

22:21 < luke-jr> sipa: why didn't it work?

22:21 < sipa> luke-jr: at least a few years ago, it will never a win; the reason is that there is less memory available to exploit the "newly created entries that get deleted before ever hitting disk"

22:22 < luke-jr> sipa: you'd delete the flagged entries when you need more space?

22:22 < sipa> luke-jr: yes, i believe i tried something like that

22:22 < luke-jr> I don't see how this can be a lose :/

22:23 < sipa> where the flushing is done in two tiers; in one, you'd flush everything, but keep the most recently created half around

22:23 < sipa> and in the second tier, when the memory is full, delete all non-dirty entries

22:23 < sipa> luke-jr: because of extra CPU to walk the cache and find things to delete

22:24 < luke-jr> std::move it to a second cache? :x

22:24 < sipa> there are definitely more combinations that could be tried, not claiming it's a certain loss

22:24 < sipa> but after trying half a dozen things, i think it was time to give up :)

22:25 < sipa> it may also depend on relative speeds of RAM/CPU/disk

22:25 < sipa> this was also pre-pertxout that was added in 0.15; that may have changed things

22:25 < luke-jr> hmm

22:26 < sipa> i did have a design a few years ago that i'd like to get back to at some point, which would permit flushing in the background without invalidating on-disk cache

22:26 < phantomcircuit> sipa, nvm for some reason my gateway<->modem was in 100 not 1000

22:26 < phantomcircuit> <.<

22:26 < sipa> eh, on-disk storage

22:26 < phantomcircuit> >.>

22:26 < luke-jr> hmmmm

22:27 < sipa> so you could be continuously writing the oldest entries, outside of the latency critical path

22:27 < sipa> though it'd need extra memory to keep things ordered

22:27 < yanmaani> Is std::unordered_map really the fastest in-memory kv store around?

22:27 < sipa> it's nontrivial to make it work correctly with reorgs etc though, but not impossible, if i remember

22:28 < sipa> yanmaani: it's not

22:29 < yanmaani> But it's advantageous for some other reason? Or is it just being used for some small part?

22:31 < sipa> i think people have tried some variations

22:32 < sipa> the biggest differency would come from using different allocation strategies, i think

22:32 < sipa> this was tried before though: https://github.com/bitcoin/bitcoin/pull/16801

22:32 < sipa> #16801

22:32 < gribble> https://github.com/bitcoin/bitcoin/issues/16801 | faster & less memory for sync: bulk pool allocator for node based containers by martinus · Pull Request #16801 · bitcoin/bitcoin · GitHub

22:32 < achow101> how do I make a const unsigned char* into a span?

22:33 < sipa> achow101: do you have its length?

22:33 < achow101> yes

22:33 < sipa> Span<const unsigned char>(ptr, len) should work

22:33 < achow101> ah, thanks

22:34 < sipa> post c++17 we can add type inference, and you can use Span(ptr, len)

22:34 < luke-jr> doesn't C++17 include std::span anyway? :P

22:34 < sipa> luke-jr: no, that's only in c++20

22:34 < sipa> achow101: if you're passing to a function that takes a Span<const unsigned char> already, you can use fn({ptr, len})

22:34 < luke-jr> oh :x

22:35 < achow101> sipa: even better

22:35 < sipa> or {beginptr, endptr}

22:44 < jb55> is there something in 0.20.0 -> 0.20.1 that would cause it to redownload the blockchain? trying to figure out why it's doing that after I upgraded my kernel+bitcoin. no configs changed ...

22:44 < jb55> maybe this ? https://jb55.com/s/a8e916c5b6f0404b.txt

22:46 < sipa> jb55: there shouldn't be any changes relates to that in minor releases

22:47 < phantomcircuit> sipa, can confirm i am dumb

22:47 < jb55> hmm yeah I figured, maybe weird io issue