#bitcoin-core-dev on 2015-10-28 — searchable irc log

00:21 < GitHub30> [bitcoin] sipa opened pull request #6895: Update to my new key (master...newkey) https://github.com/bitcoin/bitcoin/pull/6895

01:07 < GitHub19> [bitcoin] gmaxwell pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/8f3b3cdee497...d0badb916e51

01:07 < GitHub19> bitcoin/master 298e040 Pieter Wuille: Fix chainstate serialized_size computation

01:07 < GitHub19> bitcoin/master d0badb9 Gregory Maxwell: Merge pull request #6865...

01:07 < GitHub22> [bitcoin] gmaxwell closed pull request #6865: Fix chainstate serialized_size computation (master...fixchainsize) https://github.com/bitcoin/bitcoin/pull/6865

01:22 < GitHub43> [bitcoin] sipa pushed 4 new commits to master: https://github.com/bitcoin/bitcoin/compare/d0badb916e51...93521a4f56ce

01:22 < GitHub43> bitcoin/master 27252b7 Matt Corallo: Fix pre-push-hook regexes

01:22 < GitHub43> bitcoin/master 1d94b72 Matt Corallo: Whitelist commits signed with Pieter's now-revoked key

01:22 < GitHub43> bitcoin/master 6e800c2 Matt Corallo: Add Pieter's new PGP key to verify-commits/trusted-keys

01:22 < GitHub113> [bitcoin] sipa closed pull request #6875: Fix pre-push-hook regexes (master...verify-commits-fixes) https://github.com/bitcoin/bitcoin/pull/6875

01:24 < GitHub157> [bitcoin] sipa pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/93521a4f56ce...8756c986420c

01:24 < GitHub157> bitcoin/master 4252cd0 Pieter Wuille: Update to my new key

01:24 < GitHub157> bitcoin/master 8756c98 Pieter Wuille: Merge pull request #6895...

01:24 < GitHub179> [bitcoin] sipa closed pull request #6895: Update to my new key (master...newkey) https://github.com/bitcoin/bitcoin/pull/6895

01:25 < GitHub93> [bitcoin] sipa pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/8756c986420c...e06c14fb59ee

01:25 < GitHub93> bitcoin/master ab1f560 Pieter Wuille: Support -checkmempool=N, which runs checks on average once every N transactions

01:25 < GitHub93> bitcoin/master e06c14f Pieter Wuille: Merge pull request #6776...

01:25 < GitHub73> [bitcoin] sipa closed pull request #6776: Support -checkmempool=N, which runs checks once every N transactions (master...fraccheck) https://github.com/bitcoin/bitcoin/pull/6776

01:31 < GitHub67> [bitcoin] sipa pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/e06c14fb59ee...4764f5db9d2c

01:31 < GitHub67> bitcoin/master 214de7e Philip Kaufmann: [Trivial] ensure minimal header conventions...

01:31 < GitHub67> bitcoin/master 4764f5d Pieter Wuille: Merge pull request #6892...

01:31 < GitHub110> [bitcoin] sipa closed pull request #6892: [Trivial] ensure minimal header conventions (master...headers-new) https://github.com/bitcoin/bitcoin/pull/6892

01:34 < GitHub1> [bitcoin] sipa pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/4764f5db9d2c...8daffe227bc6

01:34 < GitHub1> bitcoin/master ad5aae1 Philip Kaufmann: constify missing catch cases...

01:34 < GitHub1> bitcoin/master 8daffe2 Pieter Wuille: Merge pull request #6891...

01:34 < GitHub130> [bitcoin] sipa closed pull request #6891: constify missing catch cases (master...const-ex) https://github.com/bitcoin/bitcoin/pull/6891

02:01 < gmaxwell> Oh I think I know one reason we see corruption reports on windows.

02:02 < gmaxwell> Stopnode often takes an awful long time, for example on my laptop running defaults it just took 12 seconds. In windows, on shutdown, tasks that don't stop pretty much immediately get killed.

02:02 < gmaxwell> thats probably a maximally bad case for leveldb as it'll be in the middle of flushing out a bunch of cached updates.

02:10 < GitHub4> [bitcoin] sipa opened pull request #6896: Make -checkmempool=1 not fail through int32 overflow (master...fixchainsize) https://github.com/bitcoin/bitcoin/pull/6896

02:35 < tripleslash> gmaxwell: its specifically HungAppTimeout and is defaulted to 5 seconds.

02:38 < morcos> test_bitcoin hangs for me a decent fraction of the time. It looks like it gets stuck at line 109 in scheduler_tests.cpp

02:56 < morcos> ah, i see it is an issue, #6540

08:13 < wumpus> gmaxwell: that will likely contribute to it, although most of the time corruption seems to happen on crashes / power failures, when there is no time to flush at all

08:27 < wumpus> but if that is the case too then the flush+sync on windows is not essentially not working at all

08:27 < wumpus> -first not

08:40 < gmaxwell> someone was commenting that we were writing via mmap on windows and that the sync we were using there didn't work on maps; which sounds like the mac problem. I didn't verify these claims at all.

09:11 < wumpus> didn't check that either

10:03 < dcousens> wumpus: aye, 1 OOM and my chain was broke

14:55 < jcorgan> cfields: did you ever make progress on #6681?

14:58 < jcorgan> i guess that would be #6819 now

15:16 < cfields> jcorgan: no, i stopped there. I just wanted to get it building so that someone who knows zmq could make it actually work

15:22 < jcorgan> got it

17:47 < morcos> gmaxwell: For this fast template generation on new block code. Are you envisioning you switch to a new empty template after a new most work header? Even if you haven't validated the block you're connecting yet? And then set some sort of timeout with which you'll switch back to not being willing to build off a headers only chain?

17:48 < morcos> Once you've connected a new block, I'd say there is no reason not to wait the extra few ms to generate a block template with txs in it. Which can then be validated after its already been served to you.

17:49 < gmaxwell> No I was not. This is less safe than many people think it is, and I think not needed if the other details are handled correctly.

17:49 < morcos> Well thats where all the delay is right? Receiving and connecting the new best block.

17:49 < sipa> how about building a new template, switching to working on it immediately, and then starting a validation for it

17:50 < morcos> sipa: yes thats what i'm suggesting. but thats after you've connected the best block. if we're still requiring to wait for that, don't we think people will choose to short circuit it less safely on their own

17:50 < gmaxwell> morcos: No, CNB latency is tens of times slower than validating normally.

17:51 < morcos> gmaxwell: ok, i'll give you multiples, but probably less than 10x unless your mempool is really really big. and thats just compared to validating, what about waiting to receive?

17:52 < morcos> i haven't looked at the time delay from receiving most work header to finishing connecting the block, any idea what that is typically?

17:52 < morcos> i bet its long

17:52 < gmaxwell> When the relay network is working normally about 80% of blocks are transmitted in a single packet.

17:52 < morcos> ah yes, forgot about relay network

17:52 < morcos> thats why i ask questions

17:53 < sipa> it still makes sense to have numbers for the time between receive inv and CNB building a template on top

17:53 < gmaxwell> Most of the delays in mining right now appear to be from outside of bitcoin core, actually.

17:54 < morcos> sipa in a relay node connected case or regular node or both

17:54 < sipa> morcos: in "reality"

17:54 < sipa> :)

17:55 < morcos> ok, well i'm almost ready to push a WIP branch. it still doesn't do it in a thread, but the gain is really rather limited at this point, and i'll save that i think for a second pull

17:55 < morcos> but the question i want to resolve is what to do when TestBlockValidity fails

17:55 < morcos> right now it throws an error

17:55 < gmaxwell> hm. we've done something recently that slowed down connectblock (or the network behavior changes have)

17:55 < gmaxwell> oh dear

17:55 < morcos> connectblock has always been slow since i've measured it

17:56 < gmaxwell> debug.log:2015-10-28 16:11:38 - Connect block: 7256.55ms [7475.14s]

17:56 < sipa> how much is fetching inputs vs verifying inputs?

17:56 < gmaxwell> wtf.

17:57 < gmaxwell> since my node here's last update to master connect block's time has increased monotonically for every block.

17:58 < sipa> gmaxwell: coincache being thrown out my mempool bloat?

17:59 < morcos> gmaxwell, what block hash was that?

18:01 < morcos> gmaxwell: are you still running mempool.check? it runs inside that timer

18:01 < gmaxwell> morcos: before this latest rounds of attack this node was taking <100ms to connect block.

18:01 < gmaxwell> morcos: must be the mempool checks then.

18:02 < gmaxwell> morcos: it's all of them.

18:02 < sipa> gmaxwell: turning on mempool checks are a certain way to blow away your cache every block

18:02 < morcos> the blocks around that time for me took 500ms and then 1ms (only coinbase)

18:04 < gmaxwell> in any case, debug logs ran this thing out of space a couple hours ago, so I've restarted it. I'll run without mempool checks to get some good timings.

18:05 < gmaxwell> Numers I posted from shortly before the MTL event were about 80ms.

18:06 < morcos> 80ms?? hmm...

18:07 < sipa> so with a 200 MB mempool i seem to need a 700-1200 MB coincache

18:07 < sipa> with matt's latest patch

18:09 < sipa> i wonder if we should (as a short term hack) treat some factor of the size-of-pulled-in-memory of a tx as its txsize

18:11 < morcos> gmaxwell: i looked at some old numbers of mine (couple of months ago) and they were like 500ms on average (during fairly busy time, mid July)

18:11 < gmaxwell> morcos: Interesting!

18:12 < morcos> sipa: so what do you mean "need"?

18:13 < morcos> a 200MB mempool isn't really the right measure right

18:13 < sipa> morcos: how do you mean?

18:13 < morcos> because as you go from 0 - 200MB you'll pull in a certain amount of txin's. but as you keep running, youre mempool stays at 200MB, but you'll i think tend to pull in more txin's? or does matt's patch actually remove txin's no longer requested when a tx gets mined?

18:14 < sipa> when a tx gets mined it goes into the cache with dirty bit on

18:15 < sipa> so it can't be removed from the cache anymore

18:15 < sipa> until a flush

18:15 < sipa> but just a 700 MB coincache sounds pretty painful already

18:16 < morcos> yes i think 700MB is pretty painful, but we need to think a bit about how to be smarter about it

18:17 < sipa> per-txout cache...

18:18 < morcos> yeah if it was pertxout, then you'd solve that problem

18:18 < morcos> every so often you don't flush your cache, you just write out the dirty entries

18:19 < morcos> we could still do that now, except we don't know if that tx should still be in cache b/c of other mempool txs

18:19 < sipa> well an lru or random eviction of the utxo set could work too

18:19 < sipa> the cache would just become less effective

18:25 < morcos> so when we flush the cache, we have to write everything correct? so its consistent. i think if we could jsut get smarter about what we wiped from the cache at that point, then maybe we could jsut flush a lot more often, or do you think that would be bad.

18:26 < sipa> yes, a flush shouldn't wipe everything

18:26 < morcos> tell me if this would be too cumbersome. i think it might be pretty fast, how long does a flush take?

18:27 < morcos> after you write everything, you quickly scan the top 10MB of txs in the mempool, and insert all of their txin.prevout.hash's into a set, and then you iterate throught the cachemap erasing anything thats not one of those

18:33 < morcos> so actually flushing takes over a second right? i think you might be able to do something like i suggested on the order of 10's of ms, but not sure

18:36 < morcos> i don't know if it's bad (or maybe its even good) to flush more regularly. but if we did something like that, we wouldn't even need to worry about matt's patch. we could just "flush" every time the cache was getting too big.

18:51 < morcos> re: TestBlockValidity failing. I think I'm going to log an error and return NULL. Seems better than throwing an error. I'd like to reuse FormatStateMessage (in main.cpp), should I move it to a different file, or just declare it in main.h?

20:54 < Luke-Jr> morcos: ⁇? if the block is invalid you do NOT want to use the template ever

20:55 < Luke-Jr> morcos: pretty sure you would entirely break proposals too

20:55 < morcos> Luke-Jr: I know you don't want to use it, the question is how to handle that case.

20:56 < morcos> It means code is broken somewhere, so human intervention is going to be required at some point

20:56 < morcos> The existing code would have thrown an error. I chose to return NULL and log the error (which will cause getblocktemplate to throw a now misnamed JSONRPC error)

20:57 < morcos> Another option would be to try to return a template with no tx's instead (since the likely bug is mempool consistency was broken)

20:57 < morcos> I'm about to PR as a WIP... , would be great if you want to take a look

20:58 < gmaxwell> morcos: I vaguly recall something that if createnewblock can fail there is a crash elsewher.e

20:58 < GitHub181> [bitcoin] morcos opened pull request #6898: [WIP] Rewrite CreateNewBlock (master...fasterCNB) https://github.com/bitcoin/bitcoin/pull/6898

20:58 < gmaxwell> or maybe I also fixed that in anticipation of people making it possible to fail again.

20:59 < morcos> It took a long time to make it produce the exact same blocks as the old code, but helped work out a couple of bugs.

20:59 < Luke-Jr> IMO human intervention is preferable to silently mining empty blocks

21:00 < Luke-Jr> and debug.log is not non-silent

21:00 < gmaxwell> Most miners will never see anything in debug.log.

21:00 < morcos> I really hate the ugliness around hacking in the ability to still do priority space in the blocks.

21:00 < Luke-Jr> morcos: if that requires ugliness, the old code was better ?/

21:00 < Luke-Jr> :\

21:01 < morcos> well, I'd call the old code ugly as well.

21:01 < gmaxwell> Luke-Jr: the old code is very ugly.

21:01 < Luke-Jr> slow != ugly

21:01 < morcos> Also I didn't clean up the whole thing, I just am putting this out there for proof of concept.

21:02 < Luke-Jr> but I cant see the new code to compare yet..

21:03 < morcos> Luke-Jr: the big problem is priority is very difficult to calculate

21:04 < Luke-Jr> ? no

21:04 < morcos> If there is a consensus that its an important metric to keep, then I think we should #6357 which would make it much faster to calculate the correct priority in the mining code

21:04 < morcos> however it'll still be impossible to keep a sort based on it (i think)

21:05 < morcos> it changes!

21:05 < Luke-Jr> you need to lookup the inputs anyway

21:05 < morcos> not anymore

21:06 < morcos> but even if you do, the biggest problem i see with the old code, is that you have to look up the inputs for ALL the txs in your mempool

21:06 < morcos> not just the ones you're putting in a block

21:06 < Luke-Jr> hmm

21:06 < morcos> and if you're going to do a priority portion of the block, you have to keep doing that

21:07 < Luke-Jr> resorting once per block seems reasonable imo?

21:07 < morcos> although maybe the dynamic priority calculation would fix that, i haven't looked at it in a while.. and maybe it could be made even easier now with the concept of mempool children

21:08 < morcos> Luke-Jr: you're right, that would be the way to improve this code if priority isn't going away

21:09 < Luke-Jr> morcos: I still plan to redo all this btw :p

21:09 < morcos> keep an index (either part of the multi-index or separate) of all the priorities sorted, and only update it once per block

21:09 < morcos> yeah me too! :)

21:09 < Luke-Jr> so the mempool is a list of block templates

21:10 < morcos> but i was hoping we might be able to get something simple'ish done for 0.12, which would make GBT run a lot faster.

21:10 < gmaxwell> morcos: the input look up problem exists for fees too. :(

21:10 < morcos> gmaxwell: they are already stored in CTxMemPoolEntries

21:10 < morcos> and now i addded sigops to that too

21:10 < gmaxwell> Oh I see right fees can be cached but priority cannot.

21:10 < Luke-Jr> right

21:11 < Luke-Jr> priority is our best metric right now I think, so I wouldnt want to lose ikt even temporarily

21:12 < morcos> best metric for what? why do you think its better than fees?

21:12 < gmaxwell> Luke-Jr: I don't really agree there. Priority works fine for you and I, I don't think he serves most users all that well.

21:13 < Luke-Jr> morcos: spammers are happy to pay fees

21:13 < morcos> Luke-Jr: yeah i actually agree its a pretty good anti-spam mechanism

21:13 < morcos> but thats not how we use it now!

21:14 < Luke-Jr> we do both now

21:14 < Luke-Jr> gmaxwell: imo thats why it isnt *exclusively* priority

21:14 < gmaxwell> Luke-Jr: point was spam goes through currently.

21:14 < morcos> Luke-Jr: there is also the problem of incentives

21:15 < Luke-Jr> gmaxwell: not via priority…?

21:15 < morcos> how do you make miners prefer priority vs fee

21:15 < Luke-Jr> morcos: you dont

21:16 < phantomcircuit> wumpus, i haven't forgotten about getting you a copy of a corrupted datadir btw

21:16 < Luke-Jr> longterm fees are the only realistic metric

21:16 < morcos> Luke-Jr: so you view priority as like an HOV lane. at least some txs will sneak past even if the spam is causing congestion on most of the block

21:16 < Luke-Jr> but for now we want to try to keep fees low

21:16 < gmaxwell> morcos: and it does have that effect.

21:17 < Luke-Jr> morcos: basically

21:17 < Luke-Jr> also a nice fallback

21:17 < morcos> gmaxwell: if we care about preserving that, why don't we just redefine priority, to be your priority at the time the tx was accepted

21:17 < morcos> then it can be cached and its easy to reason about and who cares if different nodes/miners calculate it differently

21:17 < Luke-Jr> as long as we have priority, every tx can get confirmed *eventually*

21:17 < gmaxwell> morcos: we could expect then it'll turn into dead weight in the mempool.

21:18 < morcos> oh sorry, thats not what i mean

21:18 < morcos> t

21:18 < morcos> i meant the priority only depends on your inputs that were confirmed at the time you were accpeted

21:18 < morcos> so its still a bit complicated, but way less than currently

21:19 < phantomcircuit> a better question is, why would miners use priority?

21:19 < gmaxwell> I think that would be fine. Then it only needs to update by some ratio of the size, I guess.

21:19 < Luke-Jr> that may work good enough as a temporary thing

21:19 < Luke-Jr> phantomcircuit: already had that minidiscussion scroll up

21:21 < morcos> gmaxwell: but yeah, i mean even easier, lets just make it not age once its in your mempool. if you guessed wrong and it doesn't get confirmed soonish, then you resubmit after it expires (currently 72 hours)

21:22 < Luke-Jr> morcos: maybe have a post-block resort for new confirmed inputs optionally enabled in conf file

21:22 < Luke-Jr> just in case it turns out bad

21:22 < Luke-Jr> re solrt I mean

21:23 < Luke-Jr> re sort sorry

21:23 < gmaxwell> morcos: thats a little obnoxious, in that if it doesn't make it in the first block then immediately anything new added has an advantage. Maybe it okay?

21:25 < morcos> gmaxwell: yeah i think its a tradeoff. the annoying thing however is i'm not sure you can combine it into one score very easily and still have it serve the purpose you want it to serve

21:25 < Luke-Jr> afk

21:26 < gmaxwell> I supposed if we cared more we could have a background task that just goes and recosts them from time to time.,

21:26 < gmaxwell> presumably we're doing some kind of linear scan for the expiration? (I haven't kept up with the latest changes)

21:33 < morcos> gmaxwell: expiration? oh no, there is an entry time index as well.

21:55 < phantomcircuit> gmaxwell, im seeing ~200ms on average for Connect total on my server for the last two months

22:00 < gmaxwell> morcos: so... perhaps another way to handle priority is to maintain a seperate very small mempool for it.

22:00 < gmaxwell> so then the cost of having to update and resort the 'whole mempool' is not very lage.

22:00 < gmaxwell> er large*

22:16 < phantomcircuit> gmaxwell, why not just mark transaction sin the mempool as dirty when there's a new block and update the priority in a background thread?

22:40 < sipa> I'm probably wasting my time, but I'm writing a more (space) efficient std::vector drop-in replacement

22:41 < sipa> a vector has an average overhead of 40 bytes currently...

22:42 < gmaxwell> sipa: probably; ... maybe a more useful thing to do is to find things using vectors for small amounts of data and make them not use vectors. :P

22:42 < gmaxwell> sipa: question; I can't figure out why we would need to cache failing signatures at all: rationale: constructing a new novel failing signature is free.

22:42 < sipa> gmaxwell: the idea is to use a union to either store elements directly, or store the size + pointer of a dynamic array

22:43 < sipa> gmaxwell: we cache failing transactions, not signatures

22:43 < sipa> to prevent downloading and verifying them over and over again

22:44 < gmaxwell> that was just a brainfart, for some reason I thought the sigcache type included a validity flag.

22:44 < gmaxwell> of course it doesn't.

22:45 < gmaxwell> sipa: I've wondered before why the normal std::vector didn't special case small numbers of objects by making one of the pointers null and storing the objects internally.

22:47 < gmaxwell> sipa: even if you superoptimize the vector, there is still malloc overhead and fragmentation and the pointer to the vector object in its parent object which can all be eliminated in cases where the vector can be avoided.

22:48 < sipa> gmaxwell: well for example almost every CScript (which is a std::vector<unsigned char>) in the utxo cache and mempool stores at least 25 bytes

22:49 < gmaxwell> Every vector<char> is probably suspect to begin with. :P

22:49 < sipa> i can make a vector that is exactly 32 bytes (4 bytes size + 28 bytes data, or 4 bytes size + 4 bytes alloced size + pointer to actual data)

22:50 < gmaxwell> It might be interesting to instrument std::vector so we can get a report of vectors in the codebase that have a constant number of objects in them.

22:50 < sipa> so it's 32B for up to 28 bytes of data, or 32B + N for N > 28 bytes of data

22:51 < sipa> instead of vector's 40B + N for everything

22:52 < gmaxwell> sipa: or you could make the parent object able to store up to 32 bytes internally, and only create a vector if there is more than that. (I suppose it could use a union with the char[32] to store the pointer to the vector when there is one).

22:53 < sipa> gmaxwell: that's what i'm doing

22:53 < sipa> when i say "vector" i mean parent type; i think you're interpreting that word as "malloced data"

22:54 < gmaxwell> Got it.

22:55 < gmaxwell> why 28? why not 31? you don't need a 4 byte length for only 31 bytes of range.

22:56 < sipa> it complicates things a bit if you can't share the size for one with the other

22:56 < sipa> but yes, indeed

22:58 < gmaxwell> ah I see, actually that also lets you avoid a flag.. since you can switch the behavior on size.

22:59 < sipa> i thought about that first, but that's unfortunately not the case

22:59 < sipa> as vector guarantees that a resize that shrinks doesn't invalidate pointers

23:00 < sipa> so dropping from over the threshold to below the threshold cannot switch to in-object storage

23:01 < sipa> so i use 1 bit of the 4-byte size to determine what type of storage to use

23:02 < gmaxwell> what I was thinking about when I said changing the datastructure was just flattening. e.g. one dynamically sized txout object that contains everything in it directly, and no pointers at all. It's not like we ever modify any of this except setting flags.

23:02 < gmaxwell> so having any pointers in it under any condition, is just waste and overhead.

23:02 < gmaxwell> but it means that there needs to be smart accessors for it.

23:49 < GitHub107> [bitcoin] mcelrath opened pull request #6899: Warnings clean with C++11 (master...cpp11) https://github.com/bitcoin/bitcoin/pull/6899