#bitcoin-core-dev on 2018-08-12 — searchable irc log

06:05 < jonasschnelli> gmaxwell: yes. rekeying is done after a fix amount of traffic in bytes. But re-hashing the secret would not change anything if ECDA of logged traffic can be broken?

06:33 < gmaxwell> jonasschnelli: right, we should also perhaps consider rekeying once an hour. What rekeying accomplishes, assuming the old key is deleted, is that if a system is compromised you can't extract the keys from memory to decrypt traffic you logged before compromise.

06:38 < gmaxwell> The reason I suggest triggering on time is because if you have e.g. a SPV client, it might be days until it has transfered 1GB of traffic. which might make it interesting to try to go seize other nodes a target under observation was connected to in order to decrypt their traffic. Admittedly a really fring risk, but it should be ~free to avoid.

06:39 < gmaxwell> (basically a similar motivation to why we don't log IPs by default)

06:40 < sipa> the only reason not to rekey on every message is performance, right?

06:41 < gmaxwell> Right.

06:41 < gmaxwell> sha2 is slower than chacha.. :)

06:42 < gmaxwell> interestingly, I'm not aware of any well known cipher mode which natively has irreversable state.

06:42 < sipa> chacha takes a 256 bits key, and produces blobs of 512 bits of output

06:43 < sipa> why not say encrypt every message with the current encryption key, and then afterwards extract another 256 bits from the stream, which become the new encryption key?

06:43 < sipa> chacha has 0 initialization cost

06:44 < gmaxwell> Because thats not a well studied construct. it would also be 50% of the speed of using it normally.

06:46 < sipa> why would it be slower?

06:46 < sipa> ah, if you assume the messages are small i guess

06:47 < gmaxwell> ah I thought you meant per 512 bits of output rather than per protocol message.

06:48 < gmaxwell> we could also email djb to ask, might well be that someone has published on a mode that does this. Though I think elsewhere where this concern was addressed, it was always just addressed by rekeying from a higher level rather than at the block cipher level.

06:50 < gmaxwell> Though as I was saying, I think it's kind of a fringe concern, if we want to do something complicated, I'd rather it be armoring against ECDH break then N-th level optimizations to how fast we forget keying material.

06:51 < gmaxwell> (or even better, get the indistinguishable authentication protocol finished)

06:55 < sipa> right; i'm mostly wondering why "use a prng-based stream cipher, and after each message, read the next encrpytion key from the stream" isn't a common construction

06:55 < gmaxwell> Because almost everything has key init costs?

06:56 < gmaxwell> also because the whole reason you normally use a stream cipher is random access.

06:57 < gmaxwell> there have been some 'reuse resistant' quasi-stream-cipher proposals perhaps some of those get irreversability as a side effect. dunno.

09:51 < fanquake> wumpus 13938 should be ok to go in

09:57 < fanquake> Also 13808

09:59 < Varunram> is the bot dead?

10:58 < jonasschnelli> gmaxwell, sipa: the new protocol does have encryption optional, therefore the question rises if detecting the key handshake versus a version message is sane

10:58 < jonasschnelli> I guess its acceptable to assume a version message (an not a key) when we detect a message magic and the rest of a legacy header

10:59 < jonasschnelli> I guess it's almost impossible to derive a pubkey with the network magic & version part of the header

13:11 < reald0ff1> hi

13:12 < reald0ff1> Can someone please provide me download stats (or at least share in %) of bitcoin core, regarding the different platform versions (Win, Linux, OSX, etc) ?

13:13 < reald0ff1> that would be very helpful for my master thesis

13:20 < reald0ff1> would very appreciate it, if someone could help me with that question

13:20 < harding> reald0ff1: I don't know if anyone has that information for BitcoinCore.org, sorry. In addition, the binaries can also be downloaded from Bitcoin.org (maintained by a different team) or via a torrent (with optional magnet URI) that contains the binaries for all platforms.

13:34 < reald0ff1> well thanks for the answer. I think I will try to contact bitcoincore.org via email and bitcoin.org also via email to the website maintainer. Maybe some of them could provide me stats

13:35 < reald0ff1> I developing a security tool for cryptocurrency users and I selected windows as target plattform. I have the feeling that the most users use windows (I am not talking about devs, etc.)

13:36 < reald0ff1> however, would be still nice to have some stats to prove that "feeling"

14:03 < devmob> hi, I'd really like to know how bitcoin does gossip, like how the gossip protocol is implemented

14:03 < devmob> can someone point me somewhere ?

15:47 < itaseski> devmob: https://en.wikipedia.org/wiki/Gossip_protocol

15:47 < itaseski> is this helpful?

15:48 < itaseski> it is general but it explains how gossip protocols work

15:48 < itaseski> i wasn't able to find anything bitcoin specific ...

15:59 < sipa> he left. also, https://bitcoin.stackexchange.com is your friend

17:29 < gmaxwell> jonasschnelli: we could just discard keys that match the strictest version handshake pattern we can come up with, no biggie

17:30 < gmaxwell> sipa: on the encryption subject, libsodium has avx2 and ssse3 chacha20 implementations: https://github.com/jedisct1/libsodium/tree/1.0.16/src/libsodium/crypto_stream/chacha20/dolbeau

17:31 < gmaxwell> and sse2 poly1305 https://github.com/jedisct1/libsodium/tree/1.0.16/src/libsodium/crypto_onetimeauth/poly1305/sse2

18:35 < jonasschnelli> gmaxwell: about dolbeau's Chacha20 AVX/SSSE implementation: "Beware: those implementations are purely designed for speed on recent Intel architectures (mostly Haswell and newer), and ARMv8 (64 bits) with the crypto extension. They were not verified to be resistant to side channel attacks."

18:36 < jonasschnelli> The later would probably require further analysis since the timing side channel attackes seems to be one of the big benefits of chacha20 (I may be wrong though)

18:36 < jonasschnelli> *attack resistance

19:20 < MarcoFalke> sipa: Wouldn't the stempool make every mempool action half as fast (since everything would have to be done once for the mempool and then for the stempool)

19:20 < MarcoFalke> Also I am not sure about the memory overhead of having the mempool duplicated

19:20 < MarcoFalke> The transactions are shared ptrs, but still...

19:20 < sipa> MarcoFalke: well, dandelion needs some way of dealing with unconfirmed dependencies

19:22 < sipa> MarcoFalke: the reference code the authors posted included a stempool, though i commented on memory usage concerns

19:23 < MarcoFalke> I know that the BIP mentions a stempool

19:23 < MarcoFalke> Agree that we need to handle dependencies

19:24 < MarcoFalke> I was hoping we could do a primitive cache for now and later replace that with https://github.com/bitcoin/bitcoin/pull/13804 (tx pool layer)

19:24 < sipa> an alternative would be to have a 2-tier mempool, where each transaction has a flag whether it's public or not

19:25 < sipa> and accepting a public tx ignores (and kicks out) any nonpublic conflicts

19:26 < MarcoFalke> That sounds like every single line of txmempool.cpp had to be amended with an if(public) else ...

19:27 < sipa> i doubt that, tbh

19:27 < sipa> most of it is just data structurr maintenance which would be unaffected

19:27 < sipa> but i don't think it'd be a trivial change either

19:28 < sipa> the set of non-public transactions in general should be very small

19:29 < sipa> as you expect every non-public tx to become a public one after some time (and the auto fluff after timeout essentially guarantees that after some timeout)

19:30 < sipa> so perhaps the "stempool excluding mempool" can be small and have lower consistency requirements

19:30 < sipa> like, we run ATMP to accept things into it, but don't require it is at all times consistent with the actual mempool

19:30 < sipa> as things expire quickly from the extra set, it can have a tight memory limit and not much avenue for dos

19:31 < MarcoFalke> sipa: The dos protection should happen per edge (peer) and not on the global mempool, no?

19:32 < MarcoFalke> The stempool limit would only be a fallback limit

19:32 < MarcoFalke> We wouldn't want one peer use up all the stempool capacity

19:32 < sipa> right

19:33 < MarcoFalke> Also, I am certain that we leak information by using the global (shared among all peers) stempool

19:33 < sipa> so perhaps it can even be a per-peer small set of unconfirmed dandelion txn, which you use to do dependency checks for dandelion txn coming from that peer

19:34 < sipa> which has much clearer privacy and dos reasonong

19:34 < MarcoFalke> You'd forward but then later discard dandelion txs

19:34 < sipa> well the combined set of those extra txn is your set of to-fluff things

19:35 < MarcoFalke> So if an attacker send the same dandelion tx twice with a rbf one on another route they can guess part of the route

19:35 < sipa> how so?

19:35 < MarcoFalke> (talking about the shared mempool) Not the per-peer set of txs

19:36 < sipa> i like the per-peer set :)

19:36 < sipa> i think you're right that there is risk in a global stempool

19:36 < sipa> the per-peer set sounds like it wouldn't need much more than a way to pass in an entra map with txn to ATMP

19:37 < MarcoFalke> That would have a compute overhead

19:37 < sipa> hardly, i think

19:38 < MarcoFalke> (re-calculating the set of dependencies for all txs)

19:38 < MarcoFalke> just to check on tx

19:38 < sipa> no no

19:38 < MarcoFalke> why?

19:39 < sipa> just something that feeds into the lookup of utxos being spent logic

19:39 < sipa> "if not found in mempool or chainstate, also look here'

19:40 < sipa> but you don't do complete conflict analysis or replacement or whatever in those extra sets

19:40 < MarcoFalke> So you could send a tx that spends and output and the output that was used to create that output (assuming 1in-1out txs for now)?

19:40 < MarcoFalke> s/and/an

19:41 < sipa> right

19:41 < sipa> perhaps you could even permit double spends inside the extra set

19:41 < MarcoFalke> So a peer could drain your allowance

19:41 < MarcoFalke> for free

19:41 < sipa> what allowance?

19:42 < MarcoFalke> "allowance" = txs your dandelion destinations accept

19:42 < MarcoFalke> num tx/minute or whatever

19:42 < sipa> i'm confused

19:43 < MarcoFalke> I think we just concluded that the "cheap check" (pass in set of previous txs) can lead to thinking an invalid tx is valid

19:43 < MarcoFalke> so we'd forward invalid txs

19:43 < sipa> right

19:44 < sipa> well, not invalid

19:44 < sipa> but conflicting, yes

19:44 < MarcoFalke> They'd never be accepted to a real mempool

19:44 < MarcoFalke> never as is invalid consensus

19:44 < sipa> that depends on the order those extra txn get added to people's mempool

19:44 < sipa> no, you do full consensus validation

19:45 < MarcoFalke> So you need to calculate all mempool dependencies and stuff

19:45 < sipa> how so?

19:46 < sipa> validity is just a) can we find the inputs b) are those inputs not yet spent by another mempool txn c) do scripts validate

19:46 < sipa> i suggest skipping just (b) for dandelion relay

19:47 < MarcoFalke> Though, for a) you use the set of {mempool inputs} OR {prev dandelion txs inputs}

19:47 < sipa> right

19:47 < MarcoFalke> so if a dandelion txs spent mempool tx

19:48 < sipa> but whenever the mempool changes, you don't update the extra sets, so they can grow inconsistent with eachother

19:48 < sipa> but i don't think that's a problem; you'll notice when trying to fluff

19:48 < MarcoFalke> hmm, give me a sec

19:48 < MarcoFalke> How do I draw a picture in irc?

19:49 < sipa> haha

19:50 < sipa> we should discuss this on the ML though

19:54 < MarcoFalke> Assume mempool has one output: A. Assume dandelion tx spends this input A and creates output B. We send this dandelion tx. Assume another dandelion tx spends {A,B} and creates output C, which is valid, since we use the set of outputs in the mempool and previous dandelion txs, but the tx itself is consensus invalid. Send this tx. Repeat with {A,C}->D, {A,D}->E ... for free

19:55 < sipa> i see your point.

19:55 < MarcoFalke> I hope you prove me wrong, because I also like the per peer set

19:58 < sipa> is there some rate limiting on dandelion txn per peer?

19:58 < MarcoFalke> In my implementation, yes

19:59 < sipa> is there in the BIP? (i haven't read the latest draft)

19:59 < MarcoFalke> not explicitly mentioned

20:00 < MarcoFalke> Maybe there is in the appendix (reference implementation), haven't looked too closely at that, though

20:08 < sipa> if you disallow replacement of dandelion txn, it becomes a lot easier

20:08 < MarcoFalke> yeah, but we don't want to kill rbf for dandelion txs

20:08 < sipa> and perhaps that's not crazy; you can replace, but first need to wait until the dandelion relay has settled into the mempool

20:08 < MarcoFalke> I'd rather enforce rbf

20:09 < MarcoFalke> (which is what my cache is effectively doing, I think)

20:09 < sipa> but you don't support dependencies between dandelion txn, or do you?

20:09 < MarcoFalke> nope

20:10 < MarcoFalke> You'd have to use rbf to "eat up" all dependencies

20:12 < sipa> replacement generally seems to be something that happens in the scale of hours, and certainly longer than interblock time

20:12 < sipa> both in use cases and incentives

20:13 < sipa> while dependent transactions can be in the scale of seconds

20:13 < sipa> (blobs of interdependent txn)

20:13 < MarcoFalke> What about the use case of "replacement to avoid a change output-round-trip"

20:14 < MarcoFalke> i.e. avoid long chain of unconfirmed

20:14 < sipa> if you're doing that in a scale of seconds-minutes you should probably just batch better

20:16 < MarcoFalke> hmm, starting to like that idea

20:30 < gmaxwell> jonasschnelli: It's somewhat implausable to me that someone managed to make a sidechannel vulnerable chacha20 which was also fast. I'm happy to review them for it.

20:31 < gmaxwell> sipa: two layer mempool sounds hard to now screw up and accidentally leak data.

20:36 < gmaxwell> A per peer stempool (which of course shares the actual tx data itself across all peers) makes sense to me.

20:37 < gmaxwell> it it requires you augment the protocol to route the dependencies along the same path as the parent.

20:37 < gmaxwell> which might have privacy implications. .. I think none of the research on dandelion so far really considered chains of unconfirmed txn.

20:38 < gmaxwell> (I'd _generally_ expect that routing children along the same path as parents would be privacy improving, but there may be factors like leaking out of the stem at different points that have bad effects like reducing the privacy of the whole chain to that of the weakest one)

20:39 < sipa> gmaxwell: read on

20:39 < sipa> ah, you already saw the per-peer idea

20:42 < gmaxwell> I also agree that we don't need to care about stem transactions getting invalidated by mempool txn. But I think we do want to check them against each other. In particular I shouldn't be able to give you 100 distinct spends of the same coin and have you route them all out to the same peer. To send two of them to two different peers would be ducky.

20:44 < sipa> gmaxwell: yeah, if you don't care about replacing txn while they are not in the mempool that sounds easy

20:44 < sipa> it means you don't need the dependency tracking or replacement or whatever logic

20:45 < sipa> just verify against the combined set of chainstate+mempool+ peer-specific set of unconfirmed dandelion txn

20:46 < gmaxwell> right, but again, if dandelion parents are peer spectific, we must endeavor to route children along the same path as parents, or otherwise they'll propagate poorly.

20:47 < sipa> dandelion already does that; it has a per-peer destination peer

20:47 < sipa> so subsequent transactions will go to the same outgoing peer

20:47 < gmaxwell> sipa: it has _two_.

20:47 < sipa> unless there is a shuffle in between

20:47 < sipa> only one per incoming peer

20:47 < sipa> two globally

20:47 < gmaxwell> oh right, okay.

20:47 < MarcoFalke> What is the use case for tx chains of dandelion txs?

20:48 < gmaxwell> MarcoFalke: uh, being able to spend your funds without waiting for a block.

20:48 < sipa> MarcoFalke: what is the use cade for tx chains in general? :)

20:48 < sipa> same answer

20:48 < MarcoFalke> why block, you can wait for a fluff

20:48 < sipa> that's ~minute or so?

20:49 < gmaxwell> if someone pays you 1 BTC, you spend 0.1 ... now your wallet interface needs to randomly _fail_ and tell you that you can't spend again, until a fluff has happened?

20:49 < sipa> you're right, waiting for a block is not relevant here

20:49 < MarcoFalke> yeah, I mean if we don't allow replacement of dandelion txs, we might as well not allow chains

20:49 < sipa> MarcoFalke: i disagree

20:49 < MarcoFalke> and ask people to batch if the time between spends is ~1minute

20:49 < sipa> there is a different timescale

20:50 < gmaxwell> That would mean that we couldn't use dandelion as the standard way to announce transactions, if that were the decision I'd say we shouldn't bother implementing it at all.

20:50 < sipa> as i said before, i think it's reasonable if replacement only works in a timescale of minutes/hours

20:50 < gmaxwell> Ideally people sould batch, sure, but someone cannot guarentee that they won't need to make another payment 40 seconds after the last.

20:50 < sipa> but dependencies need to work in seconds

20:51 < gmaxwell> Why wouldn't we allow replacements?

20:51 < MarcoFalke> Would be more expensive to check

20:52 < MarcoFalke> potentially scales with the number of txs in this edges cache (stem)

20:52 < gmaxwell> you don't actually 'replace' the transaction, but you can relay a transaction that conflicts with the peer's stemppool if it would otherwise pass the replacement criteria.

20:52 < sipa> gmaxwell: i think that's an order of magnitude more complex to implement

20:52 < gmaxwell> how so? you have a map of tx parents. It's just like the orphan pool.

20:53 < gmaxwell> in any case, I don't see a fundimental reason to not allow replacement... it would probably be fine to skip it for now due to complexity.

20:53 < sipa> gmaxwell: the rules for replacement are a complex piece of policy.. that depends on relay fee, discard fee, mempool size, cyclic dependency checks, ...

20:54 < MarcoFalke> ^

20:54 < sipa> all of those don't really have a direct translation to multiple layers of mempool

20:54 < gmaxwell> so uh, how would we handle a dandelion txn which would be a replacement for something in the mempool?

20:55 < sipa> we shouldn't?

20:55 < MarcoFalke> That works

20:55 < gmaxwell> Then I think it's busted.

20:55 < sipa> heh?

20:55 < MarcoFalke> Of course you can replace mempool txs with dandelion txs

20:55 < sipa> oh, ugh.

20:55 < sipa> of course that needs to work

20:55 < MarcoFalke> I mean, maybe only once, but it works

20:55 < gmaxwell> again: I think we cannot make dandelion the standard way to announce txn we should not deploy it. And if it kills replacement of long ago announced txn, then we can't do that.

20:56 < sipa> right

20:56 < MarcoFalke> agree

20:56 < sipa> i don't think that's an issue though

20:56 < gmaxwell> It's simple in any case, see if ATMP would accept, and if so it's eligable for stem relay if not conflicted in the peers' stem cache.

20:56 < sipa> dandelion tx validation operates on the sum of mempool + extra tzn

20:56 < sipa> but it doesn't need to deal with replacements

20:56 < sipa> just validation against that set

20:57 < gmaxwell> also I think we can also 'support replacement' by fluffing anything that passes ATMP but conflicts with our stem cache.

20:57 < sipa> MarcoFalke gave an example above where that's busted

20:57 < MarcoFalke> sipa: I said it works

20:58 < MarcoFalke> [16:55] <MarcoFalke> That works

20:58 < sipa> oh? what about your a/b, a/c, a/d example?

20:59 < MarcoFalke> Well, that is what I meant with "I mean, maybe only once, but it works"

20:59 < gmaxwell> I'm not following.

20:59 < MarcoFalke> We fell back to the earlier discussion

20:59 < gmaxwell> okay

21:00 < MarcoFalke> [15:54] <MarcoFalke> Assume mempool has one output: A. Assume dandelion tx spends this input A and creates output B. We send this dandelion tx. Assume another dandelion tx spends {A,B} and creates output C, which is valid, since we use the set of outputs in the mempool and previous dandelion txs, but the tx itself is consensus invalid. Send this tx. Repeat with {A,C}->D, {A,D}->E ... for free

21:00 < sipa> if ATMP needs to do complex replacement checks w.r.t things already in the extra set, it becomes hard

21:00 < sipa> replacement checks against the mempool of the form "would this be accepted to the mempool" are easy

21:01 < gmaxwell> the combination of replacement and chaning is cancer. :(

21:01 < MarcoFalke> jup

21:01 < MarcoFalke> So pick one

21:02 < sipa> however, if replacement within the extra set is not allowed, it's easy enough - discard anything that conflicts with the extra set already

21:02 < gmaxwell> Well we can support replacement for non-chained, and also support chaining.

21:02 < sipa> otherwise, validate against the mempool with full policy check, getting utxos from the extra set as needed

21:02 < gmaxwell> and for the kind of replacement we don't support, I think we could still queue the transaction and not propagate it but fluff it when it times out.

21:02 < sipa> if accepted, put in the extra set (which is limited is size, and automatically ezpires through auto fluffing)

21:03 < gmaxwell> so at least chained replacements work, they just might have worse privacy/propagation.

21:03 < sipa> and fluffing is just implemented as adding to the local mempool... which means that stuff that has been invalidated by intermediate mempool action just gets ignored

21:06 < gmaxwell> so the criteria for going into the extra-set is "doesn't need a parent in the extraset and passes ATMP OR it needs a parent in the extraset, doesn't conflict with the extra set and with the parent its consensus valid/standard"

21:06 < gmaxwell> and if you get something that conflicts with the extraset, and doesn't pass ATMP, you throw it in the orphanmap. It'll get connected once the parents get fluffed.

21:07 < gmaxwell> Then: replacement works, chaining works, and chaining+replacement turns into orphans which still work after the parents fluff.

21:10 < gmaxwell> I totally agree that wallets shoudl be batching and whatnot, but consider: we don't even have a friendly way to do that... There is no dohicky in bitcoin core where you can queue a payment, have it draft it, but not send it, waiting for either more payments it can be bached with, timeout, or shutdown trigger.

21:14 < MarcoFalke> So fluffing a chained dandelion tx also fluffs its parents? (even though one of the parents might still be "traveling" on a stem)

21:16 < gmaxwell> thats why I was saying 'weakest in the chain' above. :(

21:16 < MarcoFalke> Yeah, so the suggestion would be to avoid chaining, but support it

21:17 < sipa> don't fluff things which have an unfluffed parent?

21:22 < MarcoFalke> You'd be keeping them much longer in the cache/embargo (on average) and thus use more space for chained txs than unchained ones on avg

21:23 < MarcoFalke> A child times out, but you couldn't fluff it because the parent's timeout is in the future

21:25 < sipa> i feel like there should perhaps be something where a dependency in the extra set results in the two txn being merged into a packaga

21:26 < sipa> and then have the timeout for the package become a weighted average of the inout timeouts or so

21:26 < sipa> *input

21:27 < sipa> but... complicated