#bitcoin-core-dev on 2015-11-06 — searchable irc log

03:42 < Luke-Jr> a claim/report that running std::bad_alloc can cause a corrupt chainstate.. https://www.reddit.com/r/Bitcoin/comments/3rp0jb/another_dos_attack_how_to_safely_restart_node/

04:09 < fanquake> jonasschnelli How easy is it for your nightly builder to spin up builds that contain multiple PRs?

04:32 < fanquake> Thinking #6931/2 + #6918 + #6954 + #6851

04:33 < dcousens> what is "Work Queue depth exceeded" coming from the bitcoin rpc

04:33 < dcousens> Its not JSON encoded either

04:33 < dcousens> Guessing something we didn't catch in libevent?

04:34 < dcousens> Seems to happen if I smash the RPC with a few hundred requests

04:34 < jgarzik> an internal libevent queue of requests waiting to be serviced by the rpc thread(s)

04:35 < fanquake> Look at the http request callback function in httpserver

04:36 < dcousens> Where is that limit set?

04:36 < jgarzik> basically a bunch of HTTPWorkItem's

04:36 < dcousens> also, wouldn't it be better to reply using application logic (aka, the RPC) rather than just "Work queue depth exceeded"

04:36 < jgarzik> int workQueueDepth = std::max((long)GetArg("-rpcworkqueue", DEFAULT_HTTP_WORK

04:37 < dcousens> ta, why isn't my search finding this

04:37 < dcousens> lol

04:37 < fanquake> Need to reindex :p

05:04 < gmaxwell> sdaftuar: FWIW, in memory UTXO during a IBD currently peaks out at 5486.7MB according to updatetip messages.

05:05 < sipa> gmaxwell: would be interested in what that number is with #6914

05:06 < gmaxwell> I'll benchmark a reindex next, and then a reindex with 6914

05:06 < sipa> great

05:06 < gmaxwell> currently waiting on stopnode.... turns out flushing a 5468 MB cache takes an awful long time.

05:07 < gmaxwell> about a minute and a half.

05:09 < gmaxwell> FWIW that sync from the network with no checkpoints took 3hrs 39 minutes.

05:20 < gmaxwell> https://people.xiph.org/~greg/utxo-memory-2015-11-06.png

05:20 < gmaxwell> :(

06:18 < dcousens> gmaxwell: IBD?

06:22 < phantomcircuit> gmaxwell, libsecp?

06:23 < gmaxwell> dcousens: initial block download

06:23 < gmaxwell> phantomcircuit: yes.

06:24 < dcousens> gmaxwell: and before libsecp?

06:25 < phantomcircuit> "more"

06:25 < gmaxwell> probably also about 3 hours.

06:25 < gmaxwell> also with no signature validation.. also about three hours.

06:25 < gmaxwell> can't really make use of all the cpus on this host.

06:25 < dcousens> IO bound?

06:26 < gmaxwell> no, it's cpu bound in all the things that aren't signature validation.

06:26 < dcousens> haha

06:26 < gmaxwell> but thats what you get with over 8 cores or so.

06:28 < gmaxwell> (in particular it gets bound on the leveldb, block handing, hashing, etc. and that inserts bubbles in the downloading pipeline)

06:29 < phantomcircuit> oh no signature validation

06:29 < phantomcircuit> heh

06:31 < gmaxwell> phantomcircuit: no, this is with signature validation, specifically I'm testing the libsecp256k1 pull.

06:31 < gmaxwell> But if I bench with none it won't be much faster.

06:31 < gmaxwell> or openssl much slower. (because it'll just manage to use more of the cpus)

06:33 < phantomcircuit> ohh

06:35 < phantomcircuit> gmaxwell, it'll be a bit faster but yeah

06:57 < wumpus> dcousens: the work queue is part of the HTTP server, happens before requests are handed off to the appliction logic (REST or RPC)

06:59 < dcousens> wumpus: all good :), just hadn't received it before, caught me off guard

07:01 < wumpus> don't know if the default number is high enough, do you hit it under quasi-normal circumstances? at some point under heavy 'abuse' it's better to reject clients instead of adding them to an overlong queue (also to prevents the requests from filling up memory), but the default of 16 may well be too conservative

07:11 < GitHub62> [bitcoin] MarcoFalke opened pull request #6958: [trivial] Cleanup maxuploadtarget (doc & log) (master...MarcoFalke-2015-maxupload) https://github.com/bitcoin/bitcoin/pull/6958

07:24 < phantomcircuit> gmaxwell, bleh it's really annoying nto having a MAC for the plaintext key

07:24 < phantomcircuit> any opposition to storing one?

07:31 < wumpus> what plaintext key and MAC are you talking about?

07:31 < * jonasschnelli> is also wondering...

07:33 < phantomcircuit> oh the encrypted wallet entires

07:33 < phantomcircuit> entires*

07:33 < phantomcircuit> entries*

07:33 < phantomcircuit> the only way to tell which keys go to which master key (technically we support more than one... kind of) is to derive the public key

07:33 < wumpus> why do you want to change wallet encryption? it works fine

07:34 < phantomcircuit> it's very slow and corruption can only be detected when you unlock

07:34 < jonasschnelli> i think phantomcircuits optimizations makes sense

07:34 < phantomcircuit> we're basically using the private key -> public key derivation as a hash function right now

07:34 < phantomcircuit> which doesn't make a lot of sense

07:34 < wumpus> requiring an explicit pubkey derivation was on purpose, although due to AES padding trick you don't actually need tot do it

07:35 < wumpus> what do you want to optimize? what is not fast enough?

07:35 < wumpus> 'trying more passphrases per second'? :p

07:36 < gmaxwell> phantomcircuit: why do you need one for the plaintext key? you're checking the ciphertext.

07:36 < phantomcircuit> gmaxwell, the current check will detect whether you have multiple master keys mixed together into a single wallet file

07:36 < wumpus> what are we even trying to accomplish here? what is too slow that needs to be optimized?

07:37 < wumpus> I really don't like aimless optimization in the context of crypto code :)

07:37 < gmaxwell> wumpus: please see his existing PR, thie performance is not the main focus (though its quite slow with large wallets)

07:37 < gmaxwell> the existing behavior has no integrity checking at all for keys in an encrypted wallet, which is not very safe.

07:37 < phantomcircuit> wumpus, the main focus is detecting corruption on load instead of at the first unlock

07:38 < wumpus> what I want to avoid is to make it really fast but insecure

07:38 < gmaxwell> Nothing he's talking about would reduce security.

07:38 < wumpus> then hashing database records (eg, add a checksum to the data) will work well enough

07:38 < wumpus> no need to mess with the crypto

07:38 < gmaxwell> wumpus: Yes thats what his PR does.

07:39 < wumpus> ok, then what's the problem?

07:39 < gmaxwell> he's trying to also eliminate the need for the really slow check on unlock, it sounds like. (which is problamatically slow for wallets with very large numbers of keys)

07:40 < gmaxwell> the same is avoided already for non-encrypted wallets.

07:40 < wumpus> that never used to be the case, we only have to check one key/value pair

07:40 < wumpus> if you can assure the database is correct some other way

07:40 < wumpus> at some point someone wanted to check *all* keys on first unlock, as an integrity check

07:40 < wumpus> but that's not necessary if you add one at the db level

07:41 < phantomcircuit> that's what it does now :)

07:41 < wumpus> yeah, I heavily disagreed with that

07:41 < gmaxwell> wumpus: We _do_ check all of them on first unlock, as there is no other integrity check.

07:41 < wumpus> that indeed makes the first unlock slow on big wallets, that was clear, I even commented that back then

07:41 < phantomcircuit> gmaxwell, if we're OK with saying all the encrypted keys in the wallet have to be from the same master key

07:42 < gmaxwell> Sure, and thats still better than having no integrity check.

07:42 < phantomcircuit> then we dont need the hash on the plaintext key

07:42 < wumpus> anyhow, adding an integrity mechanism at the db level sounds good to me

07:42 < phantomcircuit> but then we should assert if there's more than 1 master key entry

07:42 < wumpus> but please don't change this code unnecessarily

07:44 < gmaxwell> phantomcircuit: does it actually work correctly now if there is more than one?

07:44 < phantomcircuit> gmaxwell, it correctly fails

07:44 < phantomcircuit> with the hashes it will silently not fail

07:45 < phantomcircuit> which would be double plus not good

07:45 < gmaxwell> ah so without the exhaustive test it won't know that some are encryted but with another key.

07:45 < wumpus> if you just add a checksum to the data, how can it add paths where it will silently work? you just add more checks, so more failures

07:46 < gmaxwell> wumpus: if he adds the checksum and turns off the exhaustive test (now not so important with the checksum)

07:46 < gmaxwell> phantomcircuit: so make it fail the load if there is more than one master key and call it done.

07:46 < wumpus> right

07:46 < wumpus> so that was the same in most actual releases (before the exhaustive check was added)

07:47 < wumpus> agreed

07:47 < phantomcircuit> sounds reasonable to me :)

07:47 < wumpus> at this point having more master keys *means* corruption

07:47 < wumpus> are you protecting all records or just the keys?

07:48 < wumpus> in any case, an assert to only have one master key is a good sanity check

07:48 < wumpus> who knows what happens with more, did anyone ever test that part...

07:49 < phantomcircuit> wumpus, it's unfortunately a huge nuisance to protect all the records

07:49 < phantomcircuit> i should look at doing that more carefully though...

07:50 < wumpus> phantomcircuit: ok, then doing just the keys is a good compromise

07:50 < wumpus> it is the most important data in the wallet

07:50 < gmaxwell> key records are already protected, ckey are not. but yea, most concerning is keys, and I think its a reasonable compromise.

07:51 < wumpus> if this was starting from scratch I'd want to do it for all data, but I can believe you if you say that becomes too ugly now

07:54 < phantomcircuit> hmm maybe it's not as hard as i was thinking

07:55 < wumpus> we should keep in mind that older versions should still be able to open the wallet

07:56 < phantomcircuit> wumpus, yeah i need to look at the serialization format to be sure

07:59 < jonasschnelli> does -debug=bench affect the performance in a recognizable way?

08:00 < wumpus> it shouldn't - I think the timings are always done, just not logged normally

08:00 < wumpus> so any performance overhead will be in generating more log data

08:01 < gmaxwell> really chatty log messages do, but the debug bench log data is not that chatty.

08:01 < jonasschnelli> okay. thanks... i do no compare the IBD time of the secp256k1 branch with master (with standard parameters)

08:06 < jonasschnelli> *now

08:07 < phantomcircuit> are there windows builds for 6954 ?

08:09 < gmaxwell> As far as protecting all the data goes, --- assuming it could be compatible it would be useful, though otoh, it would be nice to keep the checksums for pubkeys around in memory so we could test against at point of use to reduce the window for corruption further.

08:12 < jonasschnelli> phantomcircuit: sure: https://bitcoin.jonasschnelli.ch/pulls/6954/

08:30 < gmaxwell> phantomcircuit: 3h 16m for the reindex on that host.

09:15 < arowser> gmaxwell: I got the the error: ‘EVENT_LOG_WARN’ when try to build master on ubuntu12.04, the libevent version is 2.0-5, and I found you got the same error in 10-21 https://botbot.me/freenode/bitcoin-core-dev/2015-10-21/?msg=52437191&page=2

09:16 < arowser> Is that a known issue

09:18 < wumpus> yes, that is a known issue, should be easy to solve by adding some ifdefs

11:28 < GitHub98> [bitcoin] MarcoFalke opened pull request #6961: luke-jr constants (master...luke-jr-const) https://github.com/bitcoin/bitcoin/pull/6961

12:26 < GitHub13> [bitcoin] MarcoFalke opened pull request #6962: translations: Don't translate markup or force English grammar (master...MarcoFalke-2015-translations) https://github.com/bitcoin/bitcoin/pull/6962

13:08 < GitHub153> [bitcoin] laanwj closed pull request #6825: Backport bugfixes to 0.11 (2015-10-22 / f2c869a) (0.11...backport-bugfixes-to-0.11-20151014) https://github.com/bitcoin/bitcoin/pull/6825

13:08 < GitHub120> [bitcoin] laanwj pushed 18 new commits to 0.11: https://github.com/bitcoin/bitcoin/compare/df616ae43ed4...6c31ac019f1b

13:08 < GitHub120> bitcoin/0.11 9b9acc2 Diego Viola: Fix spelling of Qt

13:08 < GitHub120> bitcoin/0.11 01878c9 Alex Morcos: Fix locking in GetTransaction....

13:08 < GitHub120> bitcoin/0.11 b3eaa30 MarcoFalke: [Qt] Raise debug window when requested...

13:33 < GitHub30> [bitcoin] laanwj pushed 1 new commit to 0.11: https://github.com/bitcoin/bitcoin/commit/4e895b08da5dc2dfa94ccefb24632062481a8d97

13:33 < GitHub30> bitcoin/0.11 4e895b0 Pieter Wuille: Always flush block and undo when switching to new file...

13:37 < sdaftuar> gmaxwell: sipa: off the top of your head, do you have a sense of the relative speed of calculating a sha256 hash versus performing a signature validation using secp?

13:38 < jgarzik> 256x2 itym?

13:39 < sdaftuar> yeah probably that is what i mean

13:41 < sdaftuar> actually it might not be what i mean, i think #6918 (which i'm analyzing performance on now) does a single sha256 if i'm reading the code right

14:41 < morcos> wumpus: re my comment on flushing undo files. i think i didn't realize that a reindex could pick up from where it left off. so it still reads through all the block files, but if the hash is in mapBlockIndex it skips processing?

14:41 < wumpus> yes

14:41 < morcos> i was assuming the corruption happened after a reindex finished, but before the flush

14:41 < morcos> ok, thx

14:42 < wumpus> it used to remember where it was, but this is complicated by block files being possibly out of order, so now it scans from the beginning again after restart (but this goes pretty fast)

16:46 < jonasschnelli> IBD up to 382324 with secp256k1 branch took 9 hours...

16:46 < jonasschnelli> standard parameters

16:47 < jonasschnelli> Quad Core Intel(R) Xeon(R) CPU E31245 @ 3.30GHz

16:47 < jonasschnelli> 16GB ram (but i did not change -dbcache)

16:50 < jonasschnelli> (now doing the same with current master [openssl], just curios how the performance benefit are with standard args on a standard system)

17:05 < gmaxwell> sdaftuar: a verify with libsecp256k1 as it's used in bitcoin core is 280 times slower than a single run of the sha256 compression function. That PR ends up calling the sha256 compression function twice.

17:08 < gmaxwell> sdaftuar: I think at some point we should add a faster cryptgraphic hash to the source code base for these internal non-normative uses. (E.g. blake2 is a bit over 5x faster than sha2-256; though if the user has a not-yet-released cpu with a sha256 instruction, sha256 will be faster again).

17:10 < sdaftuar> gmaxwell: thanks. i was investigating whether the use of sha256 in the PR might cause an unexpected slowdown... after some more investigation i think my concern was misplaced, it looks like any "slowdown" is pretty minuscule, and i think i wasn't hitting the conditions under which the PR would provide expected speedups. will keep testing...

17:10 < gmaxwell> sdaftuar: figuring out the if there was any tradeoff there is part of why pieter was measuring the cache performance with many sizes (because a small amount of hitrate increase offsets any constant cost). Also, the remove operation in the current code ends up being pretty slow compared to the hashing, in fact.

17:11 < sdaftuar> because of map versus unordered_map?

17:13 < gmaxwell> Yes.

17:16 < gmaxwell> sipa: 6954 results--- so reindex with libsecp256k1, without 6954 was 3hr 16 minutes, reindex with it was _2 hours_ 7 minutes. Going to benchmark without signature validation.

17:17 < gmaxwell> UTXO in memory at the end was 4012 MB.

17:35 < sipa> gmaxwell: so from 5486 MB to 4012 MB, because of 6914?

17:35 < sipa> gmaxwell: 6954 is libsecp validation, so i guess you mean 6914 instead?

17:38 < morcos> sipa: is 6914 something you think should be considered for 0.12. i haven't really looked at it much, but i'd happily test it out if you think its a near term consideration?

17:38 < gmaxwell> sipa: oops yes.

17:39 < sipa> morcos: i'm not sure what i can do more for it than get review

17:40 < morcos> gmaxwell's test seems to indicate there is a HUGE speed improvment, thats from allocation overhead? i think that would be something nice to test in my Connect Block benching

17:40 < morcos> sipa: understood, but it wasn't clear to me from your previous conversation with gmaxwell whether you even wanted to do 6914 or wanted to wait and do a bigger change

17:40 < sipa> morcos: as far as my reading of the specification is concerned, i think it's fully compatible with std::vector

17:40 < sipa> morcos: i don't think a bigger change is in scope

17:41 < sipa> i mean... i'd like to change allocation of scripts to some pooled storage per block and tx

17:41 < gmaxwell> One third less sync time and 25% less memory consumption is pretty nice. These results don't surprise me -- well, so I'm a little surprised that _just_ fixing the scripts did this, I would be less surprised if the whole cache entry were made a single allocation.

17:42 < morcos> sipa: yes thats what i was referring to, but that doesnt mean you don't want 6914 for now is what i'm clarifying, you still do want 6914

17:43 < morcos> gmaxwell, i was seeing a significant chunk of ConnectBlock time being just destroying the temporary cache at the end.

17:44 < morcos> while i have you both here. do you have any thoughts on how we could get rid of the 1 remaining database lookup for the coinbase in 6932

17:44 < morcos> i hate that you have to access the DB for every block just in case you're processing the 2 historical violations of BIP 30

17:46 < sipa> looking

17:46 < jgarzik> morcos, agreed

17:47 < jgarzik> that wants fixing (as you've proposed)

17:48 < morcos> maybe i'm trying to overoptimize, but i wasn't able to eliminate checking the database for preexisting coins when you are creating the new outputs for a new coinbase

17:53 < sipa> morcos: how about writing them and just not marking them as fresh?

17:53 < sipa> make ModifyNewCoins take a boolean fPossiblePreexisting or something

17:53 < morcos> that was my first plan, but it assert fails

17:54 < morcos> the code assumes that if you're not fresh you must exist in the parent

17:54 < sipa> i see

17:54 < sipa> maybe that can be relaxed

17:54 < morcos> i think you coudl remove that assert, but i didn't like making that change

17:57 < morcos> so the idea i've been hesitant to suggest is to actually check if its one of those two hashes

17:57 < sipa> yuck

17:57 < morcos> hence the hesitancy, but its really more clear as to why you're doing what you're doing

17:58 < morcos> otherwise you have to have a long winded explanation of why you treat a coinbase differently anyway

17:58 < morcos> also i assume way faster than having to not mark all coinbases as fresh or first check the database for them

17:59 < morcos> i don't know maybe not faster that not FRESH marking i guess

18:00 < morcos> anyway, lunch.. i'm happy to do whatever you guys are most comfortable with

18:00 < sipa> morcos: the only advantage of fresh is that it's removed from memory altogether before hitting disk, if it's spent before flush

18:02 < morcos> sipa: right. so i guess that doesn't happen unless you have a big cache, with coinbases... but even if it happens a small fraction of the time, a DB write has to be orders of magnitude more expensive than checking 2 hashes on every coinbase

18:09 < sipa> morcos: i think removing the assertion is fine

18:11 < sipa> gmaxwell: my sigcache performance at 320 MB (with validating everything) keeps improving (it's an exponentially decaying average with 0.99 factor per block, so ~half a day halflife), down to 6% now

18:14 < sipa> gmaxwell: feel like commenting on 6914?

18:16 < sipa> morcos: i guess in 6914 i can add more comments that refer to the specification and unit tests for the iterators

18:36 < morcos> sipa: what size in MB would be equivalent to a 1M sigcache of the old version

18:37 < morcos> i tried 80MB which was my guess from the pull, and verification seemed to get a little faster, but overall run time slowed down

18:37 < sipa> morcos: it should be around 3 times smaller

18:37 < sipa> between 70 and 85 bytes per entry now

18:37 < sipa> around 220 before

18:44 < gmaxwell> sipa: will comment when I've through testing things with it.

18:45 < gmaxwell> reindex with 6914 and no signature checks, 1h 17m.

18:52 < morcos> sipa: did you benchmark adding things to the sigcache, or only looking them up? I'm wondering if calling GetRand() that often is too expensive. something slowed down for me for sure...

18:55 < gmaxwell> morcos: it GetRand's on evicition (and the initial initilization) but not otherwise.

18:56 < morcos> yeah i guess in the steady state it shouldn't be more than a couple evictions per new tx.. hmm.

19:01 < sipa> morcos: the eviction takes like a microsecond

19:02 < morcos> yeah i misremembered GetRand speed, but its a tad under a micro, i thought it was a few micros. still maybe better to use insecure_rand(), but doesn't explain my performance issue

19:02 < sipa> morcos: in any case, the getrand is certainly the largest cpu time offender in the sigcache behaviour

19:02 < sipa> more than the hash

19:03 < morcos> i'm checking to see if i didn't match cache sizes, but i don't think thats the problem, since verification did speed up

19:03 < morcos> its the rest of the run time that slowed down...

19:03 < sipa> do you measure this by running two nodes in parallel?

19:03 < sipa> or just long period of time averaging?

19:05 < sipa> morcos: where exactly are you seeing slowdown?

19:05 < morcos> total time to run a simulation over 7 days, increased by about 5%

19:06 < sipa> interesting!

19:06 < morcos> time spent in Connect Block had about a 10% decrease (but this number i've found has a lot of noise, i think due to the vagaries of the level db cache and/or whatever else is happening with my hard disk)

19:07 < morcos> btw, have you thought about the effect on reorgs

19:07 < morcos> reorgs are already incredibly slow, and now you've just lost the signature cache for every tx in the block

19:09 < morcos> maybe we can skip signature checking on blocks added back from a reorg?

19:09 < morcos> txs from the disconnected block added to the mempool that is

19:10 < morcos> eh, that won't help since a bunch of them will need to be checked if they are included inthe replacement block

19:15 < morcos> sipa: looks like it might be GetRand(). just ran with insecure_rand and it was a 5% overall speedup instead. i'm going to try again removing a bunch of extraneous debugging i have.

19:18 < sipa> morcos: oh, that's an optimization i've thought about before

19:18 < sipa> morcos: we actually store in the block index whether a block was verified already

19:19 < morcos> sipa: are you talking about something different? a re-re-org?

19:19 < sipa> right

19:19 < sipa> morcos: it's strange that getrand makes things slower... the earlier version used way more entropy

19:19 < morcos> i'm just talking about regular reorg, where you add back all the txs from the disconnected block. so now you have to process on the order of a few thousand txs with literally none of their sigs in the cahce

19:20 < sipa> by using GetRandBytes()

19:20 < sipa> morcos: one way to mitigate that would be to not remove sigcache entries during validation, but instead keep a list of txids added in each block in the past, and remove entries a few blocks deep

19:21 < sipa> eh, nevermind

19:21 < sipa> that would mean keeping a list of all signature checks

19:21 < morcos> wait why wouldnt' that work

19:22 < morcos> just keep a list of entries to remove

19:22 < sipa> it would work, but it's ugly, large, and a layer violation :)

19:22 < morcos> every new block add to the back of the list and remove from the front

19:22 < sipa> we could use greg's idea of annotating the sigcache entries with a sequence number

19:22 < sipa> and then after every block wipe things with a sequence number a few before

19:22 < morcos> as for GetRand(). you're right, that doesn't make sense that its slower

19:23 < morcos> so maybe the speedup from insecure_rand is real, but that still doesn't explain why it was slower in the first place

19:23 < sipa> can you try with a 5 MB cache?

19:23 < sipa> maybe the slowdown is just from larger working set

19:24 < cfields> morcos: something you might try, i discovered this last week when profiling new network code...

19:24 < morcos> the base case i was comparing against was a 1M entry cache

19:24 < sipa> 1M entries with old code?

19:24 < morcos> yes

19:25 < sipa> does it get filled?

19:25 < cfields> morcos: the secure_delete openssl allocator was taking ~25% of the time to process a transaction. Commenting that out made a massive difference in the profile

19:25 < cfields> not sure if that's artificial or if it actually surfaces in the real-world

19:25 < cfields> s/transaction/message/

19:25 < morcos> i didn't check, but over 7 days it accepts 1.3M txs to the mempool, so i would assume thats several million sigs

19:25 < sipa> cfields: wut

19:25 < cfields> sipa: take that with a grain of salt, it was a very artificial benchmark

19:26 < sipa> morcos: so an interesting thing i saw is that with master and a 300 M mempool, and delete-on-use-in-block sigcache, i never got higher than 944k sigcache entries

19:26 < sipa> cfields: what kind of profiling?

19:27 < morcos> sipa: that seems plausible, but the old code doesn't have delete on use

19:27 < cfields> sipa: gprof of echoing messages back/forth between nodes

19:27 < sipa> morcos: in fact, it stayed between 936k and 944k very consistently

19:27 < cfields> sipa: CDataStream is backed by the zeroafterfree alloc, so i suspect it'd be an issue in many other places

19:28 < sipa> cfields: yeah, that may be unnecessarily much

19:28 < cfields> i'd prefer if we were more conservative with where that was used. for non-sensitive data, it just seems wasteful

19:29 < sipa> indeed

19:29 < sipa> and network data should never be sensitive

19:30 < cfields> sipa: right. very likely that that particular case was highly inflated though. It was a worst-case of bouncing thousands of tiny messages around very quickly. Still goes to show the trend, though.

20:30 < morcos> sipa: apologies. that slowdown was an artifact. i've run it a couple times now, and it is faster over all with 6918. i'm still seeing 10% speedup in ConnectBlock, but then also an an overall speedup

20:30 < morcos> switching to insecure_rand offered incremental improvement over that, but nothing like what i thought i saw before

20:39 < sipa> morcos: ok, good to know!

23:13 < gmaxwell> 18 of the last 101 blocks have been v4.

23:14 < phantomcircuit> gmaxwell, oon mainnet?

23:15 < phantomcircuit> that was fast

23:18 < gmaxwell> cfields: the zeroafterfree allocator has the benefit of killing making some use after free or use-of-uninitilized memory vulnerabilities unexploitable.

23:19 < cfields> gmaxwell: mmm

23:21 < GitHub14> [bitcoin] TheBlueMatt opened pull request #6964: Benchmark sanity checks and fork checks in ConnectBlock (master...verify-commits-fixes) https://github.com/bitcoin/bitcoin/pull/6964

23:23 < gmaxwell> I dunno if its worth the cost, of course-- but it's not totally pointless.

23:24 < jgarzik> perhaps makes valgrind happier

23:25 < GitHub124> [bitcoin] TheBlueMatt opened pull request #6965: Benchmark sanity checks and fork checks in ConnectBlock (master...bench) https://github.com/bitcoin/bitcoin/pull/6965

23:28 < gmaxwell> nah. it's on free, won't make valgrind happier.

23:37 < phantomcircuit> gmaxwell, what's the cost?

23:55 < gmaxwell> phantomcircuit: memory bandwidth.