#bitcoin-core-dev on 2018-12-26 — searchable irc log

00:13 < sipa> jimmysong: "block store" ?

00:18 < jimmysong> storing that data in the db for a full node so it can be given to a peer that requests it

00:18 < sipa> a bloomfilter (or gcd filter) can't store any data, it's just a probabilistic structure to let you query for set elements

00:18 < sipa> a full node has no need for that data

00:19 < jimmysong> a full node doesn't but it'd be useful for a light client wanting to verify a cfilter hash

00:19 < sipa> it is store on disk though, in the undo block data in bitcoin core, for as long as the block itself is stored

00:19 < sipa> ah, for verification it should just be committed to

00:20 < jimmysong> a coinbase commitment is better, but is that something that's got a proposal?

00:21 < sipa> it's pretty trivial to do, but mucb easier to get agreement on once bip157 itself is deployed and used

00:21 < sipa> i expect

00:21 < jimmysong> makes sense

00:22 < jimmysong> i'm asking because i'm trying to write a wallet that uses neutrino

00:22 < jimmysong> the annoying part is the verification

00:22 < jimmysong> but getting the filter, checking against my script pubkeys, etc is great

00:23 < jimmysong> much better for privacy

00:23 < sipa> the node giving you prevour scripts along with the filter doesn't let you verify anything

00:23 < sipa> they can forge anything

00:24 < jimmysong> it lets you verify that the filter bytes are what they said

00:24 < jimmysong> and you can check that the scripts for the inputs verify

00:24 < sipa> no, because the prevout scripts can be forged

00:24 < sipa> there is nothing committing to them

00:24 < jimmysong> wait, if you have a signature, you can make a pubkey that works with it?

00:25 < jimmysong> err, rather, a pubkey, you can make a hash preimage?

00:25 < jimmysong> the scriptsig/witness for p2pkh/p2wpkh is pubkey, sig

00:25 < jimmysong> so you'd have to forge a preimage of the pubkey

00:25 < sipa> oh, you mean full script validation?

00:26 < jimmysong> yes

00:26 < sipa> that shouldn't be something a light client cares about

00:26 < jimmysong> if it helps verify the cfilter hash, why not?

00:26 < jimmysong> commitment is obviously preferable

00:26 < jimmysong> but if we don't have that?

00:27 < roasbeef> jimmysong: yeh there's a half proposal on the ML to add that, the prior versions had the outpoint so they were fully verifiable, as is now you can verify half of it

00:27 < sipa> what is your attack model?

00:27 < roasbeef> differntiate honest peers from dishonest peers

00:27 < sipa> you don't want to get the prevout script for every block, right?

00:28 < roasbeef> it's not full script validation, it's verifying the filter is correct

00:28 < sipa> sure, but you'd need a full script interpreter for doing so

00:28 < roasbeef> yeh you'd fetch the block with the prior scripts and value why not while you're at it, w/ value you can verify fees to a degree as well

00:28 < roasbeef> no need to verify the script, just to verify a filter actually fully matchs a given block

00:29 < jimmysong> i was thinking full script interpreter

00:29 < jimmysong> which isn't trivial

00:29 < sipa> roasbeef: sure, that makes sense; but jimmysong was suggesring script validation, as without it, the prevout data can be trivially forged

00:29 < roasbeef> txscript.VM ;)

00:30 < sipa> jimmysong: but for which blocks would you get this prevout data?

00:30 < jimmysong> the blocks you download when you search for your utxos

00:30 < roasbeef> blocks for which you get conflicting advertisements

00:30 < jimmysong> that too

00:30 < sipa> jimmysong: a peer can still lie and give you a filter with no matches at all

00:31 < roasbeef> you can fetch the block and verify the outputs are properly included, but not fully inputs

00:31 < sipa> yes, for confloct detection it makes sense, and you need no script validation etc

00:31 < roasbeef> main purpose if conflict detection

00:31 < roasbeef> is*

00:32 < roasbeef> harding: the prev outs are in the undo blocks

00:32 < roasbeef> this was a concern during the switch over (that it would be slower to construct our possibly impossible), but the data is still around

00:34 < jimmysong> what's the objection to writing a script interpreter?

00:34 < jimmysong> is it because of all the edge cases?

00:34 < roasbeef> many already exist

00:34 < jimmysong> i've written a partial one

00:35 < jimmysong> i still don't get op_codeseparator

00:35 < roasbeef> code sep should die, ppl should stop trying to save it lol

00:36 < jimmysong> btw, roasbeef, those test vectors for neutrino broke my script interpreter =)

00:36 < sipa> jimmysong: in my view script interpreting is so easy to get wrong, it should only be done by full nodes

00:37 < jimmysong> it is easy to get wrong, but isn't there value to at least interpreting some subset?

00:37 < jimmysong> p2wsh/p2sh stuff can get pretty complicated

00:37 < sipa> i don't think so

00:37 < sipa> a wallet can recognize its own outputs, bit that doesn't need interpretatiom

00:38 < roasbeef> you can't even verify the outputs fully exist, so script verification is w/e, main reason for getting the prevouts is filter verification and fee awareness

00:38 < sipa> for debugging purposes it can be useful of course, but imo that's it

00:39 < jimmysong> so wallets should stick to standard scripts, or whatever has been analyzed beforehand

00:39 < sipa> well they construct them themselves

00:40 < sipa> so you don't need an interpreter to find its semantics

00:40 < jimmysong> right, the ones analyzed beforehand by the programmer

00:40 < sipa> and you won't trigger any edge cases unless you go out of your way to trigger them

00:42 < jimmysong> in any case, if the outpoint data is in the undo blocks, that can be made available?

00:42 < roasbeef> yep, just add a new inv type, and message format

00:43 < jimmysong> sure, you need the full tx's to verify against the hashes, which as harding pointed out can get pathologically large in edge cases

00:44 < jimmysong> similar to jj's attack from breaking bitcoin last year

00:44 < sipa> jimmysong: needing the full txn is unrealistic

00:44 < sipa> that data isn't available in general, and expensive to index

00:44 < roasbeef> why do you care about the hashes? get the script+value, verify the sig+fee, reconstruct filter, dunzo

00:45 < roasbeef> ofc if the witness commitment was to a merkalized version of the transaction, you could fetch that proof as well, but mo bytes there, what we have available atm is good enough to achieve the goal of cross checking

00:46 < sipa> in general we should aim to only expose data that is verifiable or can be made verifiable

00:46 < sipa> and adding comitments to undo data seems overkill, when all that's needed is a commitmemt to the filters

00:46 < jimmysong> yes, that's the preferred solution

00:47 < roasbeef> yeh not undo, just filters eventually, mean as in if the wtxid was a merkle tree with leaves of the inputs/outputs etc

00:47 < roasbeef> would also let you have a more compact proof of spentnesss, especially if things are also mega super coin-joiny in the future

00:48 < jimmysong> wow, good point

00:49 < sipa> i think all these adhoc methods of verifying filters break down if you look at them in an adverserial setting... you can use heuristics and download from multiple peers, and detect conflicts, and spot check here and there... but those approaches all add complexity and bandwidth proportionl to the level of guarantee they give

00:49 < roasbeef> yep those other appraoches are just plain easier to mess up, i prefer dead simple methods

00:49 < sipa> the only foolproof, cheap, ... way is a filter comitment in blocks

00:50 < jimmysong> yep, which costs something for a miner, but not very much. but it is a soft fork

00:50 < jimmysong> any idea how that would be received?

00:51 < gmaxwell> the bigger limit there is just that there needs to be a higher level of confidence in the design, since removing the requirement is a hardfork. Fortunately, as it is now has actually had a fair level of refinement.

00:51 < roasbeef> jimmysong: filter commit in op return, message to fetch header along w/ path for coinbase transaction? i think the filter chain would prob still have some use in that case as well

00:52 < gmaxwell> I've suggested in the past potentially doing a kind of rolling softfork, where the commitment is required to be valid only if its provided and only until some height. And so long as people keep using it and it isn't replaced with something better, we just keep soft-forking in updated heights.. but if ever it gets replaced, it can just be allowed to expire.

00:52 < jimmysong> that would be a nice escape clause

00:52 < roasbeef> oooOOo, i guess the witness commitment is kinda like that for blocks w/o any witness data

00:53 < sipa> jimmysong: i think it's necessary; without commitment to the filters you can basically only use it locally or with a trusted full node

00:53 < sipa> which is pretty useful on itself, but nearly the same

00:53 < sipa> but not nearly

00:53 < gmaxwell> I'm very glad, e.g. that nothing about BIP37 ended up softforked in... that protocol turned out to be a lemon in a number of ways, but it took a couple years of use to realize that.

00:54 < roasbeef> well you can still use it in the open net, you have the same "one honest peer" assumption going on, the scripts just help you to distinguish between zero and 1 honest peer

00:55 < gmaxwell> roasbeef: even 1 honest peer requires you to have some kind of complex resolution to deal with disagreement, also-- because the p2p network is trivally sybable 'one honest peer' probably doesn't mean much unless you're doing something like manually configuring a trusted one.

00:56 < gmaxwell> One should also consider the effect of incentiving varrious kinds of trouble making. (like generally we've found that when we added vulnerablities like BIP37 attackers emerged that didn't exist previously, and then made more trouble even for people that didn't care about using BIP37)

00:57 < roasbeef> gmaxwell: you'd just fetch the blocks+scripts and reconstruct the filter, the commitment to the filters (the filter chain) helps you notice if something is funky at a glanec

00:58 < roasbeef> but then again, for smaller values you're prob not super concerned about this stuff

00:59 < gmaxwell> roasbeef: matching the spent inputs too requires fetching the inputs, which as sipa pointed out above is intractable in the worst case (and also astronishingly expensive even in the normal case)

00:59 < roasbeef> yeh the assumption is a new inv type to allow you to fetch the the input scripts, i guess i miss this worst case scenario?

00:59 < roasbeef> fetch with the block*

01:00 < gmaxwell> keeping also in mind that this stuff is saving you 30kbit/sec of bandwidth over downloading the whole blocks... in the ongoing case.

01:00 < gmaxwell> roasbeef: we have no way to provide that, and even if we did it couldn't be validated without fetching potentially gigabytes of additional data (you actually have to fetch the whole transactions).

01:07 < roasbeef> ahh ok yeh i was missing the outpoints in the equation

01:07 < sipa> roasbeef: what if you receive two different filters, and request block/prevout data for both, and they are both correct? (and it turns out you receiced one true prevout data and one false prevout data, but with correct filters for that data)

01:07 < roasbeef> well can still verify half of it today ;)

01:07 < gmaxwell> roasbeef: yes, half indeed. and really the more useful half.

01:07 < gmaxwell> but at the expense of fetching the whole block.

01:08 < sipa> that's still detectable, but you're rediced to a majoroty vote kind of model, and at a possibly very high bandwidh cost

01:08 < gmaxwell> which also means that it really only takes one clown with an aws instance to cause a lot of users to fetch the blocks anyways. I mean, perhaps not totally useless. But as I mentioned, vulnerablties seem to attract attackers.

01:10 < gmaxwell> so then you're left with the work of implementing the validation, testing it, dealing with vulnerablities in it... and some clown spins up a bunch of sybils and everyone is downloading the entire blocks anyways. And-- the sybils themselves act as a lasting nussance. I don't mean to argue against the non-commited usage, but considering the all in effects it's not the obvious big win that it

01:10 < gmaxwell> might seem at a glance.

01:11 < gmaxwell> and the historical rate of protocol additions introducing vulnerablties (either in implementation or design) is really high...

01:12 < roasbeef> it's a win for full nodes at least, serving the filters is much less intensive (and also stateless) compared to serving bip37, you also can't trigger worst case matching behavior over the entire chain

01:13 < gmaxwell> roasbeef: quite a few nodes just disable BIP37 completely (which seemed to stop the BIP37 based attacks)

01:14 < roasbeef> yeh i ended up doing that on my testnet nodes, seemed somsone was practicing their attacks on testnet lol

01:14 < gmaxwell> (I'm not disagreeing with your point though)

01:15 < gmaxwell> roasbeef: they were doing it on mainnet for a while. Though they seemed to give up after even a small set of their targets started disabling them. To the extent that they might have been targeting miners to cause block orphaning, that makes sense, but otherwise it's not really clear why it stopped.

01:16 < gmaxwell> Perhaps because they realized if they kept it up BIP37 was just going to end up removed and then they'd lose their toy. Who knows.

05:22 < bitcoin-git> [bitcoin] markaw67 opened pull request #15036: Mwortham (master...patch-2) https://github.com/bitcoin/bitcoin/pull/15036

05:55 < fanquake> Wondering if I should even bother in #15036. The guy has turned up to the repo, spammed in one PR, then opened 15036.

05:55 < gribble> https://github.com/bitcoin/bitcoin/issues/15036 | Mwortham by markaw67 · Pull Request #15036 · bitcoin/bitcoin · GitHub

05:55 < fanquake> Clearly not bothering to read my comment, or from what I can tell, anything related to actually contributing to the project.

06:01 < gmaxwell> I'd recommend just closing and locking the PR. They aren't following the guidelines, and there is a good chance that its just trolling.

06:01 < gmaxwell> no different than any other driveby pr

06:09 < gmaxwell> and especially due to the principle that the project isn't a place that we'll tolerate other people turning into their performance art or battleground.

06:34 < bitcoin-git> [bitcoin] fanquake closed pull request #15036: Mwortham (master...patch-2) https://github.com/bitcoin/bitcoin/pull/15036

06:53 < gwillen> fanquake: if you are able, it seems like preemptively locking might be a good idea.

06:55 < fanquake> I was going to leave it for a reply, but have done it now.

06:57 < gwillen> the quality of his previous comment doesn't make me sanguine

06:58 < gwillen> also, the account that hit 'approve' looks like a sockpuppet

12:29 < fanquake> wumpus are you around tonight?

13:56 < bitcoin-git> [bitcoin] hebasto opened pull request #15038: docs: Get more info about GUI-related issue on Linux (master...20181226-issue-template-gui-linux) https://github.com/bitcoin/bitcoin/pull/15038

15:03 < wumpus> fanquake: maybe a bit

15:07 < bitcoin-git> [bitcoin] MarcoFalke opened pull request #15039: wallet: Avoid leaking nLockTime fingerprint when anti-fee-sniping (master...Mf1812-walletLocktimeFingerprint) https://github.com/bitcoin/bitcoin/pull/15039

15:08 < fanquake> wumpus np, bunch of PRs mergable, but can deal with em later

15:13 < wumpus> PSA: there's no meeting today (there was confusion about this last week)

15:15 < hebasto> wumpus: today is Wednesday. Do you mean tomorrow?

15:18 < wumpus> oh sorry yes I mean tomorrow

15:19 < fanquake> only Wednesday for another 41 minutes anyway

15:20 < sipa> wumpus: no, you're right, no meeting today!

17:10 < andytoshi> are there any guarantees about the order of `getrawmempool` output? in particular do ancestors always precede descendants?

17:19 < instagibbs> from my cursory reading they are indexed in 4 different ways, none of which are related to ancestors/decendants

17:20 < andytoshi> kk thanks, i won't rely on that then

17:20 < instagibbs> salted txid, feerate(including desc), entry time, and sorted feerate(including ancestor

17:20 < andytoshi> i mean, somehow when creating blocks they have to wind up in ancestor order ... is there an explicit sorting step then?

17:21 < instagibbs> they grab "packages" as they sort via package feerate

17:22 < instagibbs> there are internal links between the entries, just not exposed here afaik

17:22 < andytoshi> ah, yep, that makes sense

17:22 < andytoshi> my goal here is to recreate the packages from the output of getrawmempool

17:26 < sipa> andytoshi: i'm pretty sure they're sorted by increasing total number of (recursive) unconfirmed dependencies, and then by feerate

17:26 < sipa> and then by txid as tiebreaker or so

17:26 < sipa> which guarantees that dependencies always come before the dependendees (?)

17:27 < sipa> that code is not shared with the block creation code, btw

17:30 < instagibbs> andytoshi, also try getmempoolentry which comes with more details?

17:32 < andytoshi> instagibbs: oh, yeah (or `getrawmempool true` which gives me the same details). will take a look at that to see if it's useful. i suspect not, i think i need to manually recreate a lot of this data in my code because i need a bunch of extra details, like the set of yet-ununspent inputs/outputs for the whole package

17:32 < andytoshi> sipa: cool, thanks. but i guess that's an implementation detail and if i were to write production software that depended on it, you'd be annoyed :)

17:35 < andytoshi> i wish there was some way i could signal core that i don't want certain outputs to be 0conf-spendable (or if they are, that i don't want cpfp rules to be applied)

17:36 < gmaxwell> andytoshi: encountering the RBF pinning problem?

17:37 < andytoshi> gmaxwell: roughly ... but i think it doesn't even need RBF to be a problem .. e.g. see russell's mailing list post https://lists.linuxfoundation.org/pipermail/bitcoin-dev/2018-November/016519.html

17:38 < andytoshi> the issue is that if my software is making a package A of transactions and doing all sorts of CPFP logic to make that sensible, and meanwhile some customer of mine is creating a package B with a massive low-fee sweep or whatever

17:38 < andytoshi> and then that customer creates a tx spending an output of A alongside the output of B...

17:38 < andytoshi> ...those packages become merged and suddenly my logic has been blindsided

17:38 < gmaxwell> that isn't how the implementation works.

17:39 < gmaxwell> I think Russell is like... reasoning from the feature's name.

17:40 < andytoshi> well, the code is not super straightforward to someone uninvolved with it. the mempool logic related to the descendant limit looks kinda like it would do stuff like what i described

17:41 < gmaxwell> The descndant limit stuff can do things along those lines, but requires actually hitting the descendant limit.

17:45 < gmaxwell> If the limits are getting hit by ordnary usage we should look into fixing that. They were set so they were never hit when established (except for some obviously dumb floody crap), and only exist to prevent a bad computational blowup in the tracking code.

17:48 < instagibbs> one thing to note is that a single ~100kvB sweep can hose the descendant size limit pretty much

17:54 < gmaxwell> so fix it?

17:55 < gmaxwell> IIRC the reason for the size limits in the tracking is just so it doesn't falsely credit parents for feerate coming from transactions that are never going to fit in the same block.

18:00 < andytoshi> maybe i'm confused about cpfp. my understanding is that if i make a tx with outputs controlled by other people, those other people are able to grief me and undermine my ability to use cpfp

18:00 < andytoshi> by extending the package such that i'd be hitting limits

18:02 < gmaxwell> yes, though also at the expense of delaying their own transaction confirmation.

18:02 < gmaxwell> I don't believe we've ever seen cpfp 'griefing' reported.

18:02 < gmaxwell> RBF pinning for sure, because a common usage pattern immediately causes it.

18:04 < gmaxwell> The limits exist only because there are computational overheads in the tracking, e.g. when removing a transaction its ancestors and descendants need to be walked to update their tracking.

18:16 < andytoshi> sorry for the dumb questions, but can you clarify - if i'm making cpfp packages, and one of my customers 0conf spends one of the outputs i create, will their transaction pull my effective feerate toward the feerate of that tx? if so, then i need logic to reason about that, and by nature that logic needs to know about the limits (even though in practice i don't expect anyone to pull me close to

18:16 < andytoshi> them)

18:16 < andytoshi> or is it safe if i just make my own transactions that chain off each others' change outputs, and ignore everything else?

18:17 < andytoshi> maybe i should just pester sipa in person in a couple of weeks :)

18:17 < andytoshi> and i will try to write down what i'm learning as i do this

18:18 < gmaxwell> No, it will not.

18:20 < gmaxwell> The parents effective feerate is the highest feerate you can construct with it.

18:20 < andytoshi> ok, i think i've got it

18:21 < gmaxwell> They can, if they spam out to the limits, prevent new descendants from being taken.

18:21 < gmaxwell> But thats only in the case that the tracking limits are hit.

18:21 < andytoshi> so is this a rough high-level view of cpfp in core?: (a) "packages" only exist during miners' transaction selection; in the case that a transaction might be in multiple packages, they're computed greedily to maximize feerate; (b) but when accepting to the mempool, Core checks whether a transaction might cause a limit-violating package to exist, and if it would, the tx is rejected

18:26 < gmaxwell> close enough; the limits aren't really limits on 'packages', they're more limits on the tracking datastructures used to create packages.

18:26 < andytoshi> yep, that's what i meant

20:27 < bitcoin-git> [bitcoin] hebasto opened pull request #15040: qt: Add workaround for QProgressDialog bug on macOS (master...20181226-fix-macos-qprogressdialog) https://github.com/bitcoin/bitcoin/pull/15040

23:16 < jnewbery> andytoshi: I think that's mostly right. In (a), the miner is ordering by ancestor feerate (see BlockAssembler::addPackageTxs in miner.cpp). In (b), all of the {(ancestor|descedant) (count|size)} are taken into account (see CTxMemPool::CalculateMemPoolAncestors() in txmempool.cpp)

23:18 < jnewbery> "and i will try to write down what i'm learning as i do this". Please consider contributing that to https://github.com/bitcoinops/scaling-book/blob/master/1.fee_bumping/fee_bumping.md if you think you can document it!

23:31 < gmaxwell> jnewbery: your last statement sounds confusing, and plays into roconnor's misunderstanding.

23:32 < gmaxwell> Basically what roconnor was thinking was that if there is an unconfirmed txn and then I add a gigantic low feerate child to it, I lower the feerate of the txn because the "package" has a lower feerate.. And that is not how it works, because of the max operation in the combining.

23:44 < jnewbery> (b) is not looking at feerate. Just tx count and size