#bitcoin-core-dev on 2015-11-29 — searchable irc log

00:06 < GitHub107> [bitcoin] gmaxwell closed pull request #7123: [WIP] Make trickle logic useful again, delay trickle when past upload limit. (master...actually_trickle) https://github.com/bitcoin/bitcoin/pull/7123

00:10 < GitHub27> [bitcoin] gmaxwell pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/61457c29d735...c894fbbb1dc0

00:10 < GitHub27> bitcoin/master a9f3d3d Pieter Wuille: Fix and improve relay from whitelisted peers...

00:10 < GitHub27> bitcoin/master c894fbb Gregory Maxwell: Merge pull request #7106...

00:10 < GitHub63> [bitcoin] gmaxwell closed pull request #7106: Fix and improve relay from whitelisted peers (master...realwhiterelay) https://github.com/bitcoin/bitcoin/pull/7106

01:59 < GitHub12> [bitcoin] gmaxwell closed pull request #7119: Add option to opt into full-RBF when sending funds (master...2015-11-opt-into-full-rbf-option) https://github.com/bitcoin/bitcoin/pull/7119

08:57 < phantomcircuit> gmaxwell, what do you think about removing RelayTransaction entirely and simply sending the top n MB of the mempool every m seconds?

08:58 < gmaxwell> phantomcircuit: I want to try a protocol which does an efficient set reconcillation of the top N mb of mempool. Which is like the pro version of what you're thinking.

08:58 < gmaxwell> Not the actual data, but set reconcile the TXids and then getdata the bodies you need.

08:59 < phantomcircuit> gmaxwell, yes long term goal for sure

08:59 < phantomcircuit> but i can do this version like

08:59 < phantomcircuit> today

09:02 < phantomcircuit> gmaxwell, can you rename setInventoryKnown to filterInventoryKnown

09:02 < phantomcircuit> there's a bunch of those now that are a bit confusing because of not changing the name

09:29 < phantomcircuit> gmaxwell, im pretty sure there's a bug in 7100

09:29 < phantomcircuit> it looks like you can get a false positive inv for blocks

09:31 < gmaxwell> phantomcircuit: where?

09:33 < phantomcircuit> gmaxwell, "getblocks" calls pfrom->PushInventory which uses setInventoryKnown to decide whether to actually do that

09:33 < phantomcircuit> so a false positive could prevent a block inv as well as tx inv

09:33 < phantomcircuit> i dont much care about tx inv's getting droped but it would be bad for block inv's to be also

09:33 < gmaxwell> agreed.

09:34 < phantomcircuit> so im thinking there needs to be a setInventoryKnownBlocks and filterInventoryKnownTransactions

09:39 < sipa> with 6494 we can just drop the known blocks

09:39 < sipa> as we keep track much more efficiently what peers know

09:44 < phantomcircuit> sipa, yes i agree that's a better solution

09:44 < phantomcircuit> except it's optional so we would still need the setInventoryKnownBlocks

09:44 < phantomcircuit> (or maybe we simply always send block invs ?)

09:45 < sipa> yes, we should always send them

09:47 < sipa> i'll rebase #6494; it's overdue for merging

09:57 < phantomcircuit> gmaxwell, https://github.com/gmaxwell/bitcoin/pull/2

10:03 < gmaxwell> woot, managed to apply that without loading the webpage.

10:05 < sipa> you have now passed the entry exam for the school for git wizardry and magic

10:06 < gmaxwell> well the git part was never a problem, just the github part. :P

10:07 < phantomcircuit> just gotta know the special pulls refspec

10:11 < phantomcircuit> sipa, in 7113 you calculate the optimal number of hash functions and then restrict it to 1-50

10:11 < phantomcircuit> why?

10:14 < phantomcircuit> an fp rate of 1/10^15 would get you nHashFuncs ~= 50

10:14 < phantomcircuit> i dont see this being an issue

10:18 < sipa> eh, i guess i wrote that part before realizing that the number of hash finctions only dependee on the fprate :)

10:18 < sipa> functions, depended

10:31 < phantomcircuit> anybody have an opinion of changing gbt to default not mine a transaction until it's been in the mempool for more than a few seconds

10:31 < sipa> phantomcircuit: do that on top of morcos' rewrite then :)

10:32 < phantomcircuit> sipa, it's a trivial 1 line change so i was mostly just wondering about the concept

10:32 < sipa> ok

10:32 < phantomcircuit> it seems like a good idea to me but i just thought of it soo

10:33 < sipa> among miners who are not in a position to exploit high hashrate or fast relay to a majority, that should improve their relay

10:35 < gmaxwell> phantomcircuit: I think it's a good idea. I thought of it before but figured it would screw up the recent improvements. :(

10:35 < sipa> should be trivial now that we remember time-in-mempool

10:36 < gmaxwell> hm. actually yea, you'd just skip them, so it doesn't change the sort.

10:36 < sipa> indeed

10:36 < phantomcircuit> sipa, it's been a trivial 1 loc patch for about a year now

10:36 < sipa> orly?

10:37 < phantomcircuit> actually more than that

10:37 < phantomcircuit> Date: Mon Nov 11 17:35:14 2013 +1000

10:37 < sipa> ha, wow

10:37 < phantomcircuit> on a related note

10:37 < phantomcircuit> we want to maximally prime the sigcache for receiving a block

10:38 < phantomcircuit> while also limiting the mempool

10:38 < phantomcircuit> these are kind of at odds at the moment

10:38 < gmaxwell> I tried to have sipa verify rejects and it polluted the crap out of his cache.

10:38 < phantomcircuit> i've got a few ideas of how to deal with this but they are admittedly mostly insane

10:38 < gmaxwell> it actually makes sense to verify rejects for several reasons.

10:38 < gmaxwell> but I think we have to improve cache management first.

10:38 < phantomcircuit> gmaxwell, well for a miner you can just run with a huge sigcache

10:39 < phantomcircuit> i've got mine set to 4GB...

10:39 < gmaxwell> Sipa found it was polluting it even with a stupidly big one. (but not that stupidly big)

10:39 < phantomcircuit> yeah the definition of stupidly big varies here :)

10:39 < gmaxwell> phantomcircuit: we'd like to uh.. not centeralize mining "you can mine but if you want a low orphan rate you'll have to dedicate 16GB ram to it.. not great. :P

10:40 < sipa> hey let's use a rolling bloom filter for tue sigcache!!!

10:40 < sipa> *ducks*

10:40 < gmaxwell> but I think ultimately thats the best thing to do, we could try to estimate "reject but still likely to get mined soon based on recent history" but I'd rather spend the complexity on making the cache smarter.

10:41 < phantomcircuit> the cache doesn't evict when a block is found does it?

10:41 < phantomcircuit> that would be an easy win

10:41 < sipa> it does

10:41 < gmaxwell> e.g. attach feerate and "tip change counter" to entries in the cache, and evict using them.

10:41 < phantomcircuit> oh

10:41 < phantomcircuit> nvm

10:41 < phantomcircuit> :|

10:41 < sipa> phantomcircuit: well, it evicts after use in a block

10:41 < gmaxwell> e.g. so on full it evits the lowest feerate that went into the cache the most blocks ago.

10:41 < sipa> but not after use in mempool

10:42 < phantomcircuit> so there is also the issue that processing transactions into the mempool acquires cs_main

10:42 < phantomcircuit> which is bad for latency of gbt calls

10:42 < gmaxwell> (by lowest I don't mean a sort, I mean a N random draw...)

10:42 < phantomcircuit> soooo rpc "addsigcachentries"

10:42 < * phantomcircuit> runs away

10:44 < gmaxwell> die

10:45 < sipa> rpc "pollutecache"

10:47 < gmaxwell> should just hash random items in memory and add them, some might be signatures.

10:47 < phantomcircuit> :)

10:47 < phantomcircuit> <phantomcircuit> i've got a few ideas of how to deal with this but they are admittedly mostly insane

10:47 < phantomcircuit> i wasn't lying

10:48 < gmaxwell> phantomcircuit: the obvious thing to do is to just have a EWMA minimum feerate for blocks, and any transaction over that, you verify even if its rejected.

10:48 < sipa> phantomcircuit: when you say insane, i should probably start worrying

10:49 < gmaxwell> (or over 0.95 * that)

10:49 < gmaxwell> sipa: dude, not like he's suggesting turning the G/2 nonce into a cryptosystem.

10:49 < tulip> would having two isolated sigcache (rejects, accepts) do roughly the same job?

10:50 < gmaxwell> tulip: probably no, because the rejects would get polluted and then not be useful (why have it)

10:50 < tulip> people can perform an eviction attack against the rejects cache of course, but that doesn't completely destroy your validation time.

10:50 < sipa> we don't have a negative cache now

10:51 < tulip> gmaxwell: suppose.

10:53 < sipa> gmaxwell: European Momputer Manufacturers Organization minimum feerate (sorry, google was slow in telling me what ECMA stood for)

10:54 < sipa> Womputer, of course

10:54 < gmaxwell> exponentially weighed moving average.

10:59 < sipa> ah, of course

11:00 < phantomcircuit> gmaxwell, EWMA ?

11:01 < sipa> phantomcircuit: European Momputer Manufacturers Organization.

11:03 < phantomcircuit> oh

11:03 < phantomcircuit> lol derp

11:03 < phantomcircuit> i should have continued reading before asking :)

11:10 < GitHub47> [bitcoin] sipa opened pull request #7129: Direct headers announcement (rebase of #6494) (master...direct-headers-announcement) https://github.com/bitcoin/bitcoin/pull/7129

11:14 < gmaxwell> "that isn't merged yet"

11:15 < gmaxwell> "?"

11:16 < sipa> gmaxwell: i'll merge upon happy travis (though maybe someone should proofread my docs)

11:18 < gmaxwell> hm. so I wonder if the sigcache rejects stuff will have less pollution problems once limited mempool is more common?

11:19 < sipa> perhaps yes

11:19 < tulip> sipa: the docs read fine.

11:21 < phantomcircuit> gmaxwell, probably

11:21 < phantomcircuit> i actually like tulip's suggestion of having two sigcaches

11:21 < phantomcircuit> it handles the "i have lots of memory and dont care" case pretty well

11:27 < gmaxwell> why not have a seperate sigcache for each band of feerate? :P and if the highest feerate cache is full it evits to a lower feerate cache.

11:28 < gmaxwell> oh you could also put a neural network in it, and it could do unsupervised classification to decide which transactions will get confirmed... and ... :P

11:29 < sipa> and then skynet

11:29 < sipa> and a genisys block

11:31 < phantomcircuit> gmaxwell, hr

11:31 < phantomcircuit> har har

11:32 < phantomcircuit> but in all seriousness it's potentially a large win for miners and just annoying for everybody else

11:32 < gmaxwell> oh, and if your cache gets too big you can sign chunks of it and ship it off to peers, and they can send it back if you need it later...

11:32 < gmaxwell> phantomcircuit: not the kind of wins we should be hunting, because it presumes a more centeralized world of mining where running a mining node takes a lot of resources.

11:33 < gmaxwell> Better to spend effort on optimizations that don't need that.

11:35 < phantomcircuit> gmaxwell, which reminds me, what kind of work would you want to see before revising the advise on mining with a pruned node?

11:35 < gmaxwell> I think it's fine already.

11:35 < gmaxwell> "You mean you don't mine exclusively on pruned nodes?" ... really the worse problem is that you're hosed in index corruption...

11:37 < phantomcircuit> gmaxwell, miners are hosed if their index is corrupt even without pruning

11:37 < phantomcircuit> hmm i seem to remember asking someone and getting a "wat dont do dat" response recently

11:37 < gmaxwell> it just means a three hour outage vs ... more.

11:37 < gmaxwell> I don't think you got that from me.

11:38 < sipa> you can always make a backup of the block chain data

11:38 < gmaxwell> like... half the reason I care about pruning is to try to rescue p2pool.

11:38 < phantomcircuit> gmaxwell, yes but it might mean running 5 nodes instead of 1

11:38 < sipa> and use that to recover a mining node

11:39 < gmaxwell> the main risk from pruning + mining is that you can't reorg past some depth, which means consensus fault; but we won't let you prune shallower than 288 blocks back to make that not much of a pratical concern.

11:39 < gmaxwell> (or at least if there is a 288 block reorg, manual intervention is.. least of the worries)

11:40 < gmaxwell> though we should make sure that that it cleanly fails (And importantly stops mining) if it tries a reorg beyond pruning.

11:42 < gmaxwell> phantomcircuit: if you're looking for mining related improvments; bringing back the old lukejr patch to forward unverified blocks would be an obvious candidate.

11:44 < gmaxwell> E.g. a new proto messages activated like sendheaders where you can say "here is a block, I haven't verified it"; you're allowed to relay it to others (as non-validated block) without validatiting long as the hash and headers checkout, and so long as it extends the current tip.

11:44 < gmaxwell> when luke did it before, it didn't speed anything up, but that was presumably because of all the dumb sleeps in networking.

11:45 < gmaxwell> I think this would reduce your mining complex by one node, since those relays wouldn't need to lock the chainstate, and so they wouldn't compete with create new block.

11:46 < sipa> checking whether it extends the current tip would need a lock

11:46 < phantomcircuit> gmaxwell, what happens generally if you try a reorg past the pruning depth?

11:46 < phantomcircuit> it should probably pull those blocks from peers

11:46 < gmaxwell> phantomcircuit: it can't. :( :(

11:46 < gmaxwell> undo data is gone too.

11:46 < sipa> though i guess you could have a pindexTipCopy which has an R/W lock, and is updated by the normal sync, but can be read (and used for verification) by other things

11:46 < gmaxwell> someday we could make undo data normative and commit to it, perhaps and then you could pull it.

11:47 < sipa> gmaxwell: i think the biggest bottleneck is the fact that message processing is single threaded

11:47 < tulip> phantomcircuit: 288 blocks is the minimum, if you reorg that far all of your peers have pinged out as well, it gets super messy and everything is likely on fire anyway.

11:47 < gmaxwell> sipa: doesn't happen that all our messaging hashes the data...

11:47 < sipa> gmaxwell: parse error

11:48 < gmaxwell> I'm just saying that message handling is computationally expensive.

11:48 < sipa> gmaxwell: so?

11:48 < phantomcircuit> gmaxwell, iirc the undo data is much smaller than the blocks

11:48 < sipa> phantomcircuit: a factor of 9

11:48 < phantomcircuit> ok so probably we should keep it going back 9x deeper than we do blocks

11:48 < gmaxwell> phantomcircuit: yes, and we could prune it to different depths, but keeping it all removes a lot of the pruning gains.

11:49 < sipa> gmaxwell: the fact that we're processing a new incoming block for half a second is no reason why we couldn't respond to a ping from another peer

11:50 < gmaxwell> On earlier; though I guess if we introduced a new p2p message for block relay, it should do RNC like compression.

11:50 < phantomcircuit> gmaxwell, for the average pool i suspect relaying before validating anything but the header would be a win even without the new p2p message to prevent getting banned

11:50 < gmaxwell> ... and bring MRU sets back to track what transactions we've sent a peer? :P

11:51 < gmaxwell> phantomcircuit: it's really important to not hand unvalidated blocks to spv clients.

11:53 < tulip> how do you prevent unvalidated blocks from becoming a DoS vector? 25BTC is expensive, but if I can make a block which takes you 2 minutes to validate it when you see it that could be a problem.

11:54 < tulip> (two minutes of grinding SHA256, then you find you have to reject it)

11:54 < phantomcircuit> gmaxwell, iirc the message hashing is done in the networking thread so it's at least partially threaded

11:55 < sipa> phantomcircuit: no

11:55 < sipa> only the checksum

11:55 < sipa> i think?

11:55 < sipa> yes

11:55 < phantomcircuit> i thought that's what he was talking about

11:56 < sipa> not the txid's, for example

11:56 < sipa> or sighashes which are even more work

11:56 < tulip> me? I was talking about sighash hashing.

11:57 < phantomcircuit> tulip, no i was talking about what gmaxwell said

11:57 < tulip> right.

11:58 < sipa> message handling is of course expensive; it's where block validation (and a part of signature validation even) happens

11:58 < sipa> doesn't mean it needs to be done single threadedly

12:06 < GitHub182> [bitcoin] sipa pushed 3 new commits to master: https://github.com/bitcoin/bitcoin/compare/c894fbbb1dc0...5d5ef3a4cf8e

12:06 < GitHub182> bitcoin/master 50262d8 Suhas Daftuar: Allow block announcements with headers...

12:06 < GitHub182> bitcoin/master 49fb8e8 Pieter Wuille: Documentation updates for BIP 130

12:06 < GitHub182> bitcoin/master 5d5ef3a Pieter Wuille: Merge pull request #7129...

12:06 < GitHub38> [bitcoin] sipa closed pull request #7129: Direct headers announcement (rebase of #6494) (master...direct-headers-announcement) https://github.com/bitcoin/bitcoin/pull/7129

12:07 < GitHub105> [bitcoin] sipa closed pull request #6494: Allow block announcements with headers (master...direct-headers-announcement) https://github.com/bitcoin/bitcoin/pull/6494

17:40 < phantomcircuit> morcos, when a block is invalidated are we updating CTxMempoolEntry::hadNoDependencies properly?

22:46 < phantomcircuit> morcos, optimal is probably very hard

22:47 < phantomcircuit> morcos, knapsack problem with at least two optimization variables :|

22:47 < sipa> s/optimal/an approximation of optimal with reasonable conputational limits/