#bitcoin-core-dev on 2018-09-01 — searchable irc log

00:31 < jimpo> gmaxwell: By "like the undo data", do you mean just that there are flat file storing the large values and disk positions stored in LevelDB or are you suggesting specifically that the filters be computed in validation code and referenced by the block index?

00:33 < jimpo> in #14121

00:33 < gribble> https://github.com/bitcoin/bitcoin/issues/14121 | Index for BIP 157 block filters by jimpo · Pull Request #14121 · bitcoin/bitcoin · GitHub

01:13 < gmaxwell> jimpo: the former.

01:13 < gmaxwell> Flat files with the filters, indexed by position.

01:33 < jimpo> Yeah, that makes sense to me. I'll code that up and compare read/write perf.

01:46 < sipa> i doubt it matters much here; we're not throughput limited

01:46 < sipa> leveldb writes all data twice, which is a reason against writing huge things like blocks and undo data

01:49 < gmaxwell> also has varrious caching behaviors, also stores it somewhat inefficiently. I think that for recent blocks the filters are about 30KB per block? in any case if you think its okay it probably is.

01:49 < gmaxwell> I was also thinking about future dependency on leveldb... since we got non-atomic flushing, there are many other things possible for the chainstate.

01:54 < echeveria> mongodb?

01:56 < gmaxwell> it has webscale

01:57 < gmaxwell> I don't think that even with non-atomic-flushing would mongo's consistency behavior be acceptable. :P

02:32 < jimpo> mongo write consistency could be a decent entropy source

17:02 < jimpo> hmm, seems the whole block tree db could be moved to flat files since it's all read into memory on startup anyway

18:35 < sipa> jimpo: i guess!

20:44 < wumpus> I think the idea is to not read it all into memory at some point

20:44 < wumpus> just like with the wallet, FWIW

20:45 < wumpus> for the block index, the pointers could be handles that prompt fetching some more specific data only on demand

20:46 < wumpus> ken2812221: yes, that is funny

20:46 < wumpus> ken2812221: the argument handling code is pretty weird in some regards, now

20:47 < wumpus> I tried to document it, but I guess I failed

20:48 < wumpus> I guess I'm going to untag #14105 and #14100 from 0.17.0

20:48 < gribble> https://github.com/bitcoin/bitcoin/issues/14105 | util: Report parse errors in configuration file by laanwj · Pull Request #14105 · bitcoin/bitcoin · GitHubAsset 1Asset 1

20:48 < gribble> https://github.com/bitcoin/bitcoin/issues/14100 | doc: Change documentation for =0 for non-boolean options by laanwj · Pull Request #14100 · bitcoin/bitcoin · GitHub

20:49 < wumpus> we're never going to do a release if we try to solve this first

20:53 < luke-jr> wumpus: aren't block indexes so small that it wouldn't be worth doing fetch-on-demand handles? (as opposed to fetch-on-demand map)

20:56 < wumpus> luke-jr: there is certainly some minimum state that would make no sense to fetch on deman

20:56 < wumpus> luke-jr: on the other hand, the structure per block is growing every release, I'm sure there are also things that don't make sense to read and store persistently

20:58 < wumpus> luke-jr: I just meant I don't want to commit to a flat file because of that; also for updates, that would be much harder to manage

20:59 < wumpus> having a block index database makes sense, no matter how exactly it's managed now

21:02 < luke-jr> sure

21:04 < wumpus> what is wrong with travis on master: https://travis-ci.org/bitcoin/bitcoin/builds/423430634?utm_source=email&utm_medium=notification

21:04 < wumpus> the linting stage is failing but there are no errors

21:11 < wumpus> of course it all passes perfectly locally

21:29 < gmaxwell> luke-jr: so sizeof(CBlockIndex) is 144 bytes, so thats 78MB (and slowly growing) of memory used for little particular purpose, excluding malloc overheads (which I guess are probably at least another 16 bytes per header). The fact that we also keep so many of them in memory means a longer start time, and constant pressure to not add things to those objects with a result of reducing

21:29 < gmaxwell> functionality.

21:29 < gmaxwell> so I think it would make sense to eventually not keep them in memory.

21:31 < gmaxwell> there should be no particular reason that someone couldn't run a fully functional bitcoin node using a few tens of MB of ram... though obviously not one with the lowest possible latency.

21:31 < luke-jr> gmaxwell: sure, I'm just saying, a handle wouldn't be a big improvement

21:32 < luke-jr> seems to make more sense to just create the indexx object itself on demand

21:32 < luke-jr> and not store anything in memory per-block

21:33 < gmaxwell> ah, I think I agree with that.

21:34 < gmaxwell> Well really the access to the block index could be intermediated through a caching layer, so that the policy of what is in memory vs not is hidden from the rest of the code.

21:34 < luke-jr> sure

21:47 < wumpus> "so I think it would make sense to eventually not keep them in memory" exactly

21:48 < wumpus> I just meant we shouldn't be making any code changes in the direction of making that more difficult

21:49 < wumpus> not so much 'we should be doing that now'

21:49 < wumpus> I'd agree it's certainly not the biggest memory sink at the moent

21:52 < gmaxwell> maybe one of the least useful ones, however.

21:55 < wumpus> true