#bitcoin-core-dev on 2017-05-18 — searchable irc log

00:14 < bitcoin-git> [bitcoin] sipa pushed 3 new commits to master: https://github.com/bitcoin/bitcoin/compare/bee35299716c...e317c0d19201

00:14 < bitcoin-git> bitcoin/master 9f7341b Gregory Sanders: Add witness data output to TxInError messages

00:14 < bitcoin-git> bitcoin/master 6e9e026 Gregory Sanders: Expand signrawtransaction.py to cover error witness checking

00:14 < bitcoin-git> bitcoin/master e317c0d Pieter Wuille: Merge #8384: Add witness data output to TxInError messages...

00:23 < bitcoin-git> [bitcoin] sipa pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/e317c0d19201...c33652576ce2

00:23 < bitcoin-git> bitcoin/master 1b936f5 practicalswift: Replace boost::function with std::function (C++11)

00:23 < bitcoin-git> bitcoin/master c336525 Pieter Wuille: Merge #10395: Replace boost::function with std::function (C++11)...

00:23 < bitcoin-git> [bitcoin] sipa closed pull request #10395: Replace boost::function with std::function (C++11) (master...replace-boost-function) https://github.com/bitcoin/bitcoin/pull/10395

00:36 < bitcoin-git> [bitcoin] sipa pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/c33652576ce2...ae786098bc58

00:36 < bitcoin-git> bitcoin/master ad415bc Thomas Snider: [net] Added SetSocketNoDelay() utility function

00:36 < bitcoin-git> bitcoin/master ae78609 Pieter Wuille: Merge #10061: [net] Added SetSocketNoDelay() utility function...

00:37 < bitcoin-git> [bitcoin] sipa closed pull request #10061: [net] Added SetSocketNoDelay() utility function (master...tjps_nodelay) https://github.com/bitcoin/bitcoin/pull/10061

00:38 < bitcoin-git> [bitcoin] sipa closed pull request #8952: Add query options to listunspent RPC call (master...enhancement/improve-rpc-listunspent) https://github.com/bitcoin/bitcoin/pull/8952

01:30 < bitcoin-git> [bitcoin] theuni closed pull request #10285: net: refactor the connection process. moving towards async connections. (master...connman-events6) https://github.com/bitcoin/bitcoin/pull/10285

04:50 < bitcoin-git> [bitcoin] laanwj closed pull request #10417: BIP 148 support (master...master-BIP148) https://github.com/bitcoin/bitcoin/pull/10417

05:25 < paveljanik> ... the first night when my IRC log buffer doesn't show the whole night communication...

05:25 < paveljanik> and more over it even rolled out my TODOs 8)

05:44 < * wumpus> made some people really angry

05:51 < jcorgan> it think we need CAPRs (Contributor Activated Pull Requests) :-)

05:54 < wumpus> heh

07:53 < bitcoin-git> [bitcoin] practicalswift opened pull request #10419: [trivial] Fix two recently introduced typos (master...typos-201705) https://github.com/bitcoin/bitcoin/pull/10419

08:09 < bitcoin-git> [bitcoin] laanwj pushed 3 new commits to master: https://github.com/bitcoin/bitcoin/compare/ae786098bc58...2acface32aba

08:09 < bitcoin-git> bitcoin/master 64aa36e ロハンダル: param variables made const

08:09 < bitcoin-git> bitcoin/master d60d54d ロハンダル: merge with bitcoin core

08:09 < bitcoin-git> bitcoin/master 2acface Wladimir J. van der Laan: Merge #9750: Bloomfilter: parameter variables made constant...

08:10 < bitcoin-git> [bitcoin] laanwj closed pull request #9750: Bloomfilter: parameter variables made constant (master...bloomVar) https://github.com/bitcoin/bitcoin/pull/9750

08:13 < fanquake> What we really need is a bot that picks out every typo, spurious include and incorrect space from every new PR, and embarrassingly notifies the contributor of their transgression

08:13 < fanquake> /s

08:14 < wumpus> in triplicate, of course

08:15 < wumpus> for typos in translation strings it could even be useful

08:15 < wumpus> but in comments, bleh

08:16 < wumpus> especially if the gist of the word is clear anyway

09:18 < bitcoin-git> [bitcoin] jonasschnelli pushed 8 new commits to master: https://github.com/bitcoin/bitcoin/compare/2acface32aba...962cd3f0587e

09:18 < bitcoin-git> bitcoin/master fbf385c Jonas Schnelli: [Qt] simple fee bumper with user verification

09:18 < bitcoin-git> bitcoin/master 2ec911f Jonas Schnelli: Add cs_wallet lock assertion to SignTransaction()

09:18 < bitcoin-git> bitcoin/master 2678d3d Jonas Schnelli: Show old-fee, increase a new-fee in Qt fee bumper confirmation dialog

09:19 < bitcoin-git> [bitcoin] jonasschnelli closed pull request #9697: [Qt] simple fee bumper with user verification (master...2017/02/qt_bumpfee) https://github.com/bitcoin/bitcoin/pull/9697

09:20 < jonasschnelli> Added https://github.com/bitcoin/bitcoin/pull/10240 to the "High-priority for review" project

09:39 < timothy> hi, is there a max size for rev*.dat files?

09:40 < wumpus> timothy: afaik no, the size of the rev depends on what is in the blk*.dat which is size-limited though

09:40 < timothy> yes, max size of blk is 128 MiB

09:41 < wumpus> I suppose the rev will always be smaller than the blk

09:41 < gmaxwell> wumpus: it could be larger if you did something absurd.

09:41 < timothy> gmaxwell: absurd like what?

09:42 < gmaxwell> wumpus: e.g. say early blocks make zillions of 9999 byte outputs, spendable with OP_TRUE. then later blocks spend them and do nothing else.

09:42 < gmaxwell> That will make the rev data larger than the related blocks.

09:43 < timothy> right

09:43 < gmaxwell> you could get them up to a ratio of perhaps 230 times larger.

09:44 < timothy> still less than 4GB (max file size of FAT32)

09:44 < gmaxwell> of course none of these txn would pass standardness tests and what not... not likely to see it in mainnet, but it's possible.

09:44 < timothy> uhm no, more than that

09:44 < timothy> 30 GB

09:46 < wumpus> oh wow that's pretty bad

09:47 < wumpus> so I guess ideally the logic should be: if *either* the rev or dat reaches 128MB, roll to the next file

09:52 < timothy> is there any reason to use 128 MiB instead of other values?

09:52 < gmaxwell> it should be relatively small because it preallocates to reduce fragmentation. (or otherwise windows users cry)

09:52 < timothy> NTFS doesn't support fallocate or similar?

09:52 < gmaxwell> but not so small that its making a kazillion files and causing poor file system performance.

09:53 < timothy> I mean, preallocation without really writing the bytes

09:53 < gmaxwell> I long since forgot what the tradeoff surface was on windows.

09:53 < gmaxwell> But I didn't think NTFS did sparse files.

09:54 < wumpus> I guess it could harmlessly be changed to 256, but I expect there to be no performance gain

09:54 < gmaxwell> wumpus: 60 GB rev files! :P

09:54 < wumpus> gmaxwell: yeah after rev files capped ofcourse

09:59 < wumpus> with a blow-up of 230, at least one block's undo data would fit into that :-)

10:00 < gmaxwell> sipa might have more accurate figures on the worst case, but it's something around that much.

10:02 < wumpus> "Most modern file systems support sparse files, including most Unix variants and NTFS, but notably not Apple's HFS+. Sparse files are commonly used for disk images, database snapshots, log files and in scientific applications."

10:03 < wumpus> so MacOS is the problem here, not windows

10:10 < gmaxwell> The extra question though is if they prevent fragmentation.

10:12 < wumpus> I don't know, is there such a guarantee for UNIX filesystems?

10:14 < wumpus> oh I was confused, this isn't about sparse files at all but posix_fallocate

10:15 < wumpus> in which case the disk space is reserved explicitly

10:17 < gmaxwell> sorry, confusion I caused, late here.

10:19 < wumpus> we apparently do have an implementation of AllocateFileRange for windows, but as I understand from MSDN it might create a sparse file (SetEndOfFile sets the file size,but not the "allocation size"), so this confusion is more general :)

10:22 < wumpus> the documentation is confusing though so I'm not sure

12:59 < bitcoin-git> [bitcoin] fanquake closed pull request #9427: Use compact blocks for blocks that have equal work to our active tip (master...UseCmpctBlockForCompetingBlocks) https://github.com/bitcoin/bitcoin/pull/9427

14:30 < bitcoin-git> [bitcoin] ryanofsky opened pull request #10420: Add Qt tests for wallet spends & bumpfee (master...pr/btest) https://github.com/bitcoin/bitcoin/pull/10420

15:27 < BartokIT> I want a clarification about the BIP32, is this the correct group

15:32 < BartokIT> The BIP32 allow to audit sharing the master public key

15:33 < BartokIT> This is mentioned in the mediawiki

15:35 < BartokIT> But if we use the hardened key this is impossible? Is this wrong?

16:36 < bitcoin-git> [bitcoin] practicalswift opened pull request #10421: [qt] Remove excess logic (master...if-expr-return-true-else-return-false) https://github.com/bitcoin/bitcoin/pull/10421

16:39 < bitcoin-git> [bitcoin] morcos opened pull request #10422: Fix timestamp in fee estimate debug message (master...fixtimeunits) https://github.com/bitcoin/bitcoin/pull/10422

16:51 < sipa> wumpus, gmaxwell: the 128 MiB is a tradeoff between fragmentation overhead and granularity for pruning

16:51 < sipa> the very first versions of the patch that introduced it (ultraprune) just used a single file per block

16:51 < sipa> but that was very slow

16:54 < luke-jr> sipa: I wonder if pruning ought to perhaps consider punching sparse holes?

16:55 < sipa> luke-jr: i think we can also just reduce that 128MiB number significantly

16:55 < morcos> i think 128 works fine for now doesn't it?

16:55 < luke-jr> maybe. but some filesystems perform differently than others..

16:55 < morcos> perhaps if we properly introduce sharding, then we need to rethink the design

16:55 < luke-jr> btrfs is annoyingly slow, I've found.

17:03 < wumpus> sipa: yes, it seems a good compromise

17:03 < wumpus> AFAIK monero stores all blocks in a single lmdb

17:04 < wumpus> why reduce the 128? agree with morcosthat it works fine

17:06 < wumpus> for pruning granularity it's also good enough, given how much space the utxo database takes a variance of 128mb+~16mb (usual rev files) doesn't seem to bad

17:19 < wumpus> although it could be worse than that in some cases depending on how blocks are distributed over the files

17:31 < sipa> wumpus: ok

17:49 < wumpus> and 128mb is at most 128 blocks, less than a day of blocks, even less than that w/ witness data, it's not that much

17:59 < bitcoin-git> [bitcoin] laanwj pushed 7 new commits to master: https://github.com/bitcoin/bitcoin/compare/962cd3f0587e...28c6e8d71b3a

17:59 < bitcoin-git> bitcoin/master d8e03c0 Jack Grigg: torcontrol: Improve comments

17:59 < bitcoin-git> bitcoin/master 29f3c20 Jack Grigg: torcontrol: Add unit tests for Tor reply parsers

17:59 < bitcoin-git> bitcoin/master d63677b Jack Grigg: torcontrol: Fix ParseTorReplyMapping...

18:00 < bitcoin-git> [bitcoin] laanwj closed pull request #10408: Net: Improvements to Tor control port parser (master...torcontrol-parser-patches) https://github.com/bitcoin/bitcoin/pull/10408

18:13 < luke-jr> wumpus: it's much more than 128 blocks early in the chain?

18:13 < sipa> yes

18:13 < wumpus> luke-jr: of course, but it blasts past that anyway

18:14 < wumpus> most pruning nodes will be - more or less - up to date

18:20 < wumpus> but yes it's easy to forget that once, blocks were that far from full

18:51 < bitcoin-git> [bitcoin] instagibbs closed pull request #9102: Really don't validate genesis block (master...dontvalidategenesis) https://github.com/bitcoin/bitcoin/pull/9102

19:01 < luke-jr> meeting?

19:01 < jonasschnelli> jup

19:01 < gmaxwell> #bitcoin-core-dev Meeting: wumpus sipa gmaxwell jonasschnelli morcos luke-jr btcdrak sdaftuar jtimon cfields petertodd kanzure bluematt instagibbs phantomcircuit codeshark michagogo marcofalke paveljanik NicolasDorier

19:01 < sdaftuar> hello

19:01 < instagibbs> here

19:01 < CodeShark> hi

19:01 < wumpus> #startmeeting

19:01 < lightningbot> Meeting started Thu May 18 19:01:43 2017 UTC. The chair is wumpus. Information about MeetBot at http://wiki.debian.org/MeetBot.

19:01 < lightningbot> Useful Commands: #action #agreed #help #info #idea #link #topic.

19:02 < kanzure> hi.

19:02 < sipa> yow

19:02 < cfields> hi

19:02 < CodeShark> I just have one topic for today, but I'll let others suggest theirs

19:02 < bitcoin-git> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/28c6e8d71b3a...ea6fde3f1d26

19:02 < bitcoin-git> bitcoin/master 618d07f Jorge Timón: MOVEONLY: tx functions to consensus/tx_verify.o...

19:02 < bitcoin-git> bitcoin/master ea6fde3 Wladimir J. van der Laan: Merge #8329: Consensus: MOVEONLY: Move functions for tx verification...

19:02 < wumpus> topics?

19:02 < CodeShark> #topic clientside filtering

19:02 < jonasschnelli> ack

19:03 < luke-jr> BIP148

19:03 < luke-jr> (after clientside filtering etc)

19:03 < wumpus> I don't think that works CodeShark, I think only the chair can set the topic

19:03 < wumpus> #topic clientside filtering

19:03 < CodeShark> :)

19:04 < CodeShark> so there are several filtering options with different performance tradeoffs

19:04 < CodeShark> bloom filters have been typically considered - but there are some other ideas that might be worth considering

19:04 < jonasschnelli> Filter for BDF, read gmaxwell's reply: https://lists.linuxfoundation.org/pipermail/bitcoin-dev/2016-May/012637.html

19:05 < CodeShark> roasbeef has worked on an idea based on golomb coded sets

19:05 < jonasschnelli> «he most efficient data structure is similar to a bloom filter, but you use more bits and only one hash function. The result will be mostly zero bits. Then you entropy code it using RLE+Rice coding or an optimal binomial packer (e.g. https://people.xiph.org/~greg/binomial_codec.c).»

19:05 < sipa> yes?

19:05 < CodeShark> gcs sacrifices CPU for space

19:06 < jonasschnelli> I think what we would need is data about the filter size for the last 100000 blocks...

19:06 < CodeShark> filters are smaller, but queries are more computationally expensive

19:06 < gmaxwell> CodeShark: CPU for who when is always the question.

19:06 < roasbeef> jonasschnelli: I have that

19:06 < jonasschnelli> roasbeef: Oo... share?

19:06 < CodeShark> hey, roasbeef! :)

19:07 < gmaxwell> what BIP37 does is very cpu expensive for the serving party, which is why it leads to dos attacks.

19:07 < gmaxwell> with any of the map based proposals that goes away and the cost to construct is not very relevant.

19:07 < CodeShark> constructing a gcs isn't very computationally expensive

19:07 < sipa> more so than bip37

19:07 < gmaxwell> Similarly, cost to lookup is not very relevant, the reciever will decode one per block.

19:07 < CodeShark> the queries are a little more computationally expensive than bloom filters, but that is done on client

19:07 < roasbeef> jonasschnelli: i have a csv file of stats for the entire chain, can easily get the last 100k out of it, the csv file itself is 14MB

19:07 < gmaxwell> sipa: maybe the lots of hash functions make it more expensive than you might guess.

19:08 < jonasschnelli> roasbeef: I take the complete one,. thanks. :)

19:08 < CodeShark> but gcs only needs to be computed once per block

19:08 < sipa> CodeShark: do you suggest this as something that blocks commit to?

19:08 < sipa> or something that a full node would precompute and store?

19:08 < roasbeef> with bloom filters, there are several hash functions, with the gcs based approach, there's a single hash function. but the set itself is compressed, so you need to decompress as you query

19:08 < CodeShark> the latter for starters

19:08 < sipa> i suppose the last

19:08 < jonasschnelli> precomp and store

19:08 < roasbeef> sipe: something a node would precompute and store, to start

19:08 < sipa> okay

19:09 < sipa> what would be stored in the set?

19:09 < gmaxwell> I'm dubious that we'd get state of the art performance from golomb coding, but interested to see.

19:09 < jonasschnelli> Can be done after the block has been connected

19:10 < gmaxwell> sipa: I believe the discussion is the 'bloom map' proposal.

19:10 < CodeShark> roasbeef was suggesting two filters - one for super lightweight clients, another for clients that require more sophisticated queries

19:10 < jonasschnelli> What are the differences? The tx template types?

19:10 < CodeShark> the former would only encode UTXOs, the latter would also encode witness data

19:10 < gmaxwell> encode witness data?!

19:11 < CodeShark> well, if you want to query for whether a particular execution path has been taken - necessary for things like lightning

19:11 < roasbeef> basic has: outpoints, script data pushes. extended has: witness stack, sig script data pushes, txids

19:11 < sipa> but do you need to _search_ based on witness data?

19:11 < sipa> i understand you may want to see it

19:12 < sipa> but you know what UTXOs to query for, no?

19:12 < CodeShark> I'm guessing revocation enforcement might be outsourced to nodes that cannot know the exact transaction format - only some key

19:12 < CodeShark> roasbeef, wanna comment?

19:12 < gmaxwell> Yes, requesting it is fine, searching on it? Be careful: it has serious long term implications if you expect that data will even be readily available. I am doubtful five years from now most nodes will have any witness data from more than a year back.

19:13 < gmaxwell> (witness data also means non-utxo transaction data in that above comment)

19:13 < gmaxwell> aside, I'm glad to hear this discussion has moved past just replicating the BIP37 mechenism.

19:13 < roasbeef> rationale to include witness data was to allow light cleitns to efficielty scan for things like reusable addresses (stealth addresses), i think my model of how folks do that on-chain these days is dated thoughu, i guess they stuff a notification on Op_returns?

19:14 < sipa> i'm not sure that is worth the cost

19:14 < sipa> also, individual scriptPubKey pushes?

19:14 < sipa> if anything, my preference would just be outpoints and full scriptPubKeys

19:14 < roasbeef> they do make the extended filters quite a bit bigger (i have testnet data also)

19:14 < gmaxwell> well no one does those things in practice, and everyone who previously has implemented them that I'm aware of performed all scanning via a centeralized server, even though they could have matched on the OP_RETURN.

19:15 < CodeShark> we can always start with the simplest minimal filter and then add more if we find use cases

19:15 < roasbeef> gmaxwell: well the intention was to allow the new light client mode to actually make using them pratcical without delegating to a central server

19:15 < gmaxwell> roasbeef: that was already possible with BIP37 and the prior design.

19:15 < jonasschnelli> Can we start with adding the same elements that bip37 does?

19:15 < roasbeef> sipa: so including the op-codes?

19:16 < gmaxwell> Usuabilty of SPV clients that scan using BIP37 is really poor though, thus the rise of electrum.

19:16 < sipa> roasbeef: bah, and 1) further encourage op_returns and 2) make them even more expensive for full nodes?

19:16 < gmaxwell> jonasschnelli: the things BIP37 added largely turned out to be a mistake that really degraded BIP37 so I hope a new proposal would do less.

19:16 < sipa> well the degradation problem doesn't exist here

19:16 < sipa> as the filter is not cumulative

19:17 < luke-jr> sipa: is there a way to do it without OP_RETURN?

19:17 < gmaxwell> yes, but you still need a bigger filter for same FP ratio. It's just less awful. :)

19:17 < sipa> luke-jr: sure, payment protocol like systems

19:17 < luke-jr> well, true, but then you don't need the crypto stuff for it

19:17 < sipa> i think that's a separate discussion and probably not one for here

19:17 < luke-jr> k

19:17 < CodeShark> for starters we should look at the most basic use cases

19:18 < gmaxwell> Yea, we should have a subcommittee. :P

19:18 < sipa> jonasschnelli, CodeShark, roasbeef: is there a use case for individual pushes in scriptPubKeys?

19:18 < jonasschnelli> the action is probably define a set of filter and create a spec that leaves room for future filter types

19:18 < CodeShark> jonasschnelli: indeed

19:18 < sipa> especially in a world where everything is P2PKH/P2SH/P2WPKH/P2WSH

19:18 < CodeShark> once we have the framework for adding new filters, it should be easy to do

19:18 < gmaxwell> jonasschnelli: multiple filter types can result in n-fold overhead, which will be a significant pressure against defining many.

19:18 < roasbeef> sipa: sure, the filter is smaller if one doesn't include the op-code as well

19:19 < sipa> roasbeef: eh?

19:19 < sipa> i must be misunderstanding something then

19:19 < roasbeef> oh you mean insert the _entire_ thing

19:19 < sipa> yes, just the whole scriptPubKey

19:19 < sipa> 1 element per output

19:19 < sipa> well, and another one for the outpoint

19:20 < roasbeef> mhmm, only advtange to data pushes in that case is in a world where mbare multi-sig is actually used

19:20 < gmaxwell> sipa: wait why?

19:20 < sipa> gmaxwell: why what?

19:20 < gmaxwell> roasbeef: yes, which we don't expect that world to exist.

19:20 < sipa> roasbeef: yes, the reason it's in BIP37 is for bare multisig support... but i don't think that's very interesting now

19:20 < gmaxwell> sipa: I expect one insert per output. The scriptpubkey. Why would you insert anything else (for normal functionality)

19:21 < gmaxwell> s/now/ever/ but hindsight is 20/20

19:21 < gmaxwell> blockchain isn't a message bus. :P

19:21 < sipa> i guess if you want to look for an outpoint, you can always search for its scriptPubKey

19:21 < gmaxwell> sipa: right.

19:21 < gmaxwell> okay.

19:21 < sipa> in BIP37 there was a reason to separate it, as it would be less bandwidth if you wanted a specific coutpoint, despite there being many scriptPubKeys with it

19:22 < sipa> but here, that reason doesn't really matter i think?

19:22 < sipa> roasbeef: what do you think? just a filter with scriptPubKeys?

19:22 < gmaxwell> sipa: the privacy leak from correlated data still exists in map proposals, based on what blocks you choose to scan further, though much less severe than BIP37. Keep that in mind.

19:22 < roasbeef> if it's just spk's, then how does one query the filters to see if an outoint has been spent?

19:23 < sipa> roasbeef: by querying for the scriptPubKey that outpoint created

19:23 < sipa> roasbeef: which you will always know, i think?

19:23 < gmaxwell> roasbeef: by looking for its spk.

19:24 < roasbeef> sipa: which would require adding parts of the witness/sigScript though?

19:24 < sipa> ?

19:24 < sipa> i'm confused

19:24 < roasbeef> me too :)

19:24 < CodeShark> txhash:txindex -> scriptPubKey

19:24 < sipa> maybe we should do this outside of the meeting

19:24 < gmaxwell> roasbeef: has nothing to do with the witness. You validate the transaction, you know the content of the outpoint.

19:24 < sipa> it seems we're doing protocol design here now

19:25 < gmaxwell> 12:17 < gmaxwell> Yea, we should have a subcommittee. :P

19:25 < CodeShark> anyhow, we don't need to decide the specifics of what goes in the filter right now

19:25 < sipa> agree

19:25 < roasbeef> ok, sure, to summarize: we have working code for the construction, have nearly finished integrating it into lnd, have a BIP draft that should be ready by next week-ish (will also integrate feedback from thjis discussion)

19:25 < CodeShark> I like the idea of creating a framework that allows us to arbitrarily define filters later on

19:25 < sipa> i think it's an interesting thing to research further

19:25 < sipa> not sure what else needs to be discussed here

19:25 < gmaxwell> well we aren't deciding anything right now... :)

19:25 < gmaxwell> CodeShark: I do not.

19:26 < jonasschnelli> BTW: kallewoof has an draft impl. on serving filters over the p2p (though bloom): https://github.com/kallewoof/bitcoin/pull/1/files (in case someone wants to drive this further)

19:26 < gmaxwell> CodeShark: there is an n-fold cost to additional filters. It is unlikely to me that nodes would be willing to carry arbritarily many in the future.

19:26 < gmaxwell> CodeShark: there might be a reasonable case for more than one, sure.

19:26 < gmaxwell> In any case, I think this is good to open up more discussion and participation.

19:27 < gmaxwell> I'm quite happy to hear that there is activity in this area and I'd like to help.

19:27 < jonasschnelli> gmaxwell: I see this point but I don't think it would hurt if the specs would allow new filter types

19:27 < CodeShark> gmaxwell: point is the code complexity to support adding arbitrary filters isn't that great and it avoids the bikeshed in writing up the initial BIP ;)

19:27 < gmaxwell> jonasschnelli: yea sure, whatever, but thats just a type paramter.

19:27 < jonasschnelli> gmaxwell: right.

19:28 < sipa> end of topic?

19:28 < * roasbeef> now uunderstands what sipa was referring to

19:28 < wumpus> I don't think any other have been proposed?

19:28 < gmaxwell> you're gonna regret saying that.. :P

19:29 < gmaxwell> quick: high priority PRs.

19:29 < wumpus> nearly halfway time

19:29 < jonasschnelli> kallewoof had also an approch that peers could serve digests of filters to check the integrity among different peers

19:29 < wumpus> #topic high priority PRs

19:29 < sipa> small topic for later: bytes_serialized

19:29 < gmaxwell> Congrats Morcos on the merge of the new fee estimator stuff.

19:29 < jonasschnelli> \o/

19:29 < sipa> it will need cleanups, but that's fine

19:29 < morcos> thanks, quick PSA.. if you run master now it'll blow away your old fee estimates, you might want to make a copy

19:30 < wumpus> quite a few high priority PRs were merged this week, so there's place for new ones, please speak up if there's any that block further work for you

19:30 < gmaxwell> "micros" not withstanding.

19:30 < morcos> i'm hoping to get an improvment which makes the transition more seamless before 0.15

19:30 < sdaftuar> sipa: i'm basically done reviewing per-txout (#10195), looks awesome! running some benchmarks now.

19:30 < gribble> https://github.com/bitcoin/bitcoin/issues/10195 | Switch chainstate db and cache to per-txout model by sipa · Pull Request #10195 · bitcoin/bitcoin · GitHub

19:30 < sipa> sdaftuar: thank you so much

19:31 < gmaxwell> I've been testing per-txout. Survived a few crashes so far.

19:31 < wumpus> I've been testing #10195 for a while, haven't run into any problems

19:31 < gribble> https://github.com/bitcoin/bitcoin/issues/10195 | Switch chainstate db and cache to per-txout model by sipa · Pull Request #10195 · bitcoin/bitcoin · GitHub

19:31 < instagibbs> morcos, dont look now but it's being used in anger on multiple large wallet services :)

19:31 < sipa> instagibbs: "in anger" ?

19:31 < instagibbs> "doing it live"

19:32 < gmaxwell> "hold my beer"

19:32 < morcos> heh.. fools, the whole reason to merge it into master was to get it some more testing

19:32 < gmaxwell> luke-jr: have you done the multiwallet rebasing?

19:32 < jtimon> there's not many explicit acks on https://github.com/bitcoin/bitcoin/pull/10339

19:32 < luke-jr> I didn't realise jtimon's PR was merged?

19:32 < instagibbs> morcos, well, other services were doing crazy things.. (ok enough off-topic)

19:33 < jtimon> luke-jr: which one?

19:33 < wumpus> so, ok, any new ones?

19:33 < luke-jr> jtimon: args refactor

19:33 < ryanofsky> i'd like more review on #10295, it is blocking my ipc prs

19:33 < gribble> https://github.com/bitcoin/bitcoin/issues/10295 | [qt] Move some WalletModel functions into CWallet by ryanofsky · Pull Request #10295 · bitcoin/bitcoin · GitHub

19:33 < sipa> ryanofsky: ack, i started reviewing that

19:33 < jonasschnelli> I have added #10240 today

19:33 < gribble> https://github.com/bitcoin/bitcoin/issues/10240 | Add HD wallet auto-restore functionality by jonasschnelli · Pull Request #10240 · bitcoin/bitcoin · GitHub

19:33 < sipa> jonasschnelli: sgtm

19:33 < jtimon> luke-jr: I see #9494

19:33 < gribble> https://github.com/bitcoin/bitcoin/issues/9494 | Introduce an ArgsManager class encapsulating cs_args, mapArgs and mapMultiArgs by jtimon · Pull Request #9494 · bitcoin/bitcoin · GitHub

19:33 < luke-jr> ok, looks like 4 days ago it was; I'll rebase multiwallet then

19:33 < sipa> luke-jr: thank you

19:34 < jonasschnelli> luke-jr: great. I promise to test

19:34 < gmaxwell> luke-jr: thank you!

19:35 < jonasschnelli> ryanofsky: will do the 10295 review. Thanks for the info

19:35 < sipa> short point: wrt the pruned-node-serving, see http://bitcoin.sipa.be/depths.png

19:35 < wumpus> added 10295 and 10339

19:35 < wumpus> #topic pruned-node serving

19:35 < sipa> see that graph, the title is wrong

19:35 < jonasschnelli> Currently overhauling the BIP

19:35 < sipa> it shows the relative depth of each block downloaded from my node _excluding_ compact blocks

19:36 < sipa> gmaxwell did some statistical analysis on it

19:36 < gmaxwell> Sipa's data is interesting. 144 is to small for sure. 1008 is fine. I'm of the view that we don't need more than a dozen or so blocks of headroom. I think the BIP should be written based on what you should keep. How you decide where to fetch depends on exactly what you're doing.

19:37 < stickcuck> hm

19:37 < gmaxwell> I found no really evidence of a real preference for N weeks in sipas data, but rather, advantages for doing 1-day 2-day 3-day ... etc. But 'day' is a lot more than 144 blocks, because of hashrate increases.

19:38 < gmaxwell> You can process the data to roughly remove IBDing peers and the fall off is pretty stark.

19:38 < gmaxwell> note sipas graph ignores depth 0.

19:38 < sipa> it'd be a hockeystick if it included 0

19:38 < jonasschnelli> What would you recommend for "day" instead 144, calc in the historical hashrate increase?

19:38 < gmaxwell> also 0 data is inaccurate because it excludes compact blocks

19:39 < sipa> gmaxwell: didn't you suggest 288?

19:39 < gmaxwell> jonasschnelli: I think we should make the first threshold 288. It's more than enough to cover a 'day' in practice.

19:39 < jonasschnelli> 288 and 1008...

19:39 < jonasschnelli> But then the current minimum (prune=550) would not allow to signal the LOW mode?

19:40 < morcos> the current minimum is 288

19:40 < gmaxwell> and then peers should estimate what they need (based on time, or headers if they have them) and choose where to connect. The estimate should be conservative but it doesn't need to be a 100 block headroom, a dozen blocks should be fine. If you get headers and find that you need more, you'll disconnect and go elsewhere.

19:40 < jonasschnelli> Or is 288 including headroom?

19:40 < morcos> the 550 is just so you don't set a prune limit which you have no hope of respecting

19:40 < gmaxwell> the minimum is 288 blocks.

19:40 < morcos> its out of date with segwit

19:40 < gmaxwell> and we'll blow over the prune setting to preserve 288 blocks.

19:40 < morcos> i think the calculation is presented in the code comments

19:41 < jonasschnelli> Yes. 288 is the minimum. So we should remove the BIP headroom/buffer from the BIP

19:41 < gmaxwell> I think eventually we should be changing the prune setting to be enum-like but thats another matter.

19:41 < gmaxwell> jonasschnelli: I think the BIP shouldn't have any buffer. "You store X from your tip" "You store Y from your tip" it can then make advice to users on how to choose connections. but the requirement is just what you promise to store.

19:42 < jonasschnelli> gmaxwell: ack

19:43 < gmaxwell> The advice can say to use the best info you have available (time or headers if you have them) to figure out what you need, and then give enough headroom maybe 6 or 12 blocks that you can fetch parents. The cost of connecting to someone that doesn't have what you need is not that great. You'll request headers from them, learn you need blocks they don't have and you'll disconnect them and connect

19:43 < gmaxwell> to someone else.

19:44 < jonasschnelli> For the 1008 I guess the BIP can no longer state blocks for 1 week. Now the question is to use 2016 or say it 3.5 days..

19:44 < sipa> ?

19:44 < sipa> i think it should just say 1008 or 2016 blocks or so, and not make any connection with time

19:44 < jonasschnelli> From what I understood is that 144 is to little for a day regarding the increasing hash-rate

19:44 < gmaxwell> jonasschnelli: I'll catch up with you later today, I don't have my processed results in front of me. But I think I found that after elimiating IBDs there were very few fetches in sipas data past 1000 blocks deep. And indeed, it shouldn't mention time.

19:45 < jonasschnelli> But light client implementations are really looking for "days" rather the blocks.. but, sure, they can do their homework... but would have been nice to mention day values in the BIP.

19:45 < jonasschnelli> But maybe they are to inaccurate

19:45 < gmaxwell> The bit(s) should just be defined as "I claim I will keep at least X blocks deep from my tip, maybe I keep more, maybe not."

19:45 < sipa> jonasschnelli: light clients know how many blocks they are behind after header sync

19:45 < gmaxwell> jonasschnelli: anyone using these bits will fetch headers.

19:46 < jonasschnelli> Indeed.... okay. Got it.

19:46 < gmaxwell> now, before you connect you won't have headers and you'll need to make a time based guess. If you guess wrong you'll need to disconnect and go elsewhere. Not the end of the world.

19:47 < jonasschnelli> Yes. I agree on that. Re-connecting should be hard.

19:47 < jonasschnelli> Maybe even an additional dns query may be involved (in case you filter)

19:48 < sipa> even if it happens, it'll happen just once

19:48 < jonasschnelli> Yeah,... shouldn't be a problem for clients

19:48 < sipa> because even if you connect to a peer that does not have enough blocks, they'll have the headers to teach you how many blocks you are behind

19:48 < sipa> so i don't think it's such a big issue

19:49 < sipa> done topic?

19:49 < gmaxwell> I think I mentioned it on the list, but it should be clear that these bits should still mean that you can serve headers for the whole chain.

19:49 < wumpus> #topic bytes_serialized (sipa)

19:49 < sipa> thanks

19:49 < gmaxwell> Kill with fire (sorry wumpus)

19:49 < jonasschnelli> gmaxwell: seems obvious.. but I'll mention it

19:49 < gmaxwell> :P

19:49 < sipa> so currently gettxoutsetinfo has a field called bytes_serialized

19:50 < sipa> which is based on some theoretical serialization of the utxo set data

19:50 < wumpus> I think there's something to be said for a neutral way of representing the utxo size, that doesn't represent on estimates of a specific database format

19:50 < sipa> wumpus: agree with that

19:50 < gmaxwell> what I said to sipa the other day was that if we list the total bytes in values and the txout counts, that lets you come up with whatever kind of seralized size estimate you want.

19:50 < sipa> but would you be fine with it just being the size of keys+values in a neutral format, _not_ accounting for the leveldb prefix compression?

19:50 < wumpus> sipa: yes

19:50 < gmaxwell> If you want you could multiply that count by 36 and add the values and that gives you the size for the dumbest seralization that hopefully no one would use.

19:50 < luke-jr> values counted as 8 bytes, or compressed?

19:51 < wumpus> sipa: that's be fine really, and the format change provides oppertunity to change the definition

19:51 < sipa> wumpus: agree

19:51 < gmaxwell> okay if wumpus and sipa agree I'll shutup.

19:51 < sipa> luke-jr: no strong opinion. do you?

19:51 < luke-jr> sipa: I don't think the compression should be exposed, ideally.

19:51 < sipa> luke-jr: seems fair

19:51 < gmaxwell> wumpus: the only concern I had with a really neutral figure is that it's misleading.

19:51 < luke-jr> not a strong opinion though

19:51 < wumpus> luke-jr: just a fixed size seems ok to me

19:52 < wumpus> luke-jr: that's more future proof likely

19:52 < wumpus> luke-jr: so we can have a statistic to compare over time

19:52 < morcos> can't we output more than one thing?

19:52 < luke-jr> wumpus: indeed

19:52 < gmaxwell> e.g. a naieve seralization would have 32 bytes for txid, but the reality is probably under 16 due to sharing. But as long as it doesn't require scanning that data I guess I don't care.

19:52 < sipa> morcos: so #10396 reports the actual disk usage

19:52 < gribble> https://github.com/bitcoin/bitcoin/issues/10396 | Report LevelDB estimate for chainstate size in gettxoutsetinfo by sipa · Pull Request #10396 · bitcoin/bitcoin · GitHub

19:52 < sipa> morcos: and the total number of utxos is also reported

19:53 < wumpus> we should definitely report the actual disk usage too!

19:53 < morcos> yeah i'm sorry if i'm behind, but i think actual disk usage is useful, even if we want this .. ok, that's all i was saying

19:53 < luke-jr> agreed

19:53 < sipa> yes yes, absolutely

19:53 < sipa> the point is that the current bytes_serialized tries to mimick disk usage, but fails

19:53 < gmaxwell> the leveldb usage is a noisy thing that goes up and down based on the mood of the table compacting gods.

19:53 < luke-jr> (although I guess users can just du the directory?)

19:53 < sipa> and will fail even more post per-txout

19:54 < sipa> so if we drop the requirement that bytes_serialized has anything to do with disk usage, all is good

19:54 < wumpus> gmaxwell: yep, it's less useful for reporting as statistics

19:54 < wumpus> sipa: indeed; I never assumed it did really

19:54 < wumpus> to me it was just 'serialization size of utxo in an arbitrary, but constant, format'

19:55 < phantomcircuit> huh what im here

19:55 < wumpus> sipa: would make sense to rename the field too

19:55 < sipa> wumpus: ok, so 10195 removes bytes_serialized - i'll create a separate PR afterwards to add a (new) bytes_serialized again

19:55 < sipa> wumpus: agree

19:55 < gmaxwell> wumpus: it will be odd if the serialized size is larger than the database but not that odd.

19:55 < sipa> gmaxwell: at least it will be obvious that it has nothing to do with it then!

19:55 < wumpus> (after all we don't want people to report weird jumps in statistics, renaming the field is ag ood hint)

19:56 < luke-jr> sipa: maybe it should be renamed?

19:56 < sipa> luke-jr: yes, it should be

19:56 < wumpus> "bogosize"

19:56 < gmaxwell> bogosize++

19:56 < sipa> hash_serialized is renamed too

19:56 < sipa> hahaha bogosize

19:56 < sipa> ok, deal

19:56 < gmaxwell> should be in nibbles.

19:56 < gmaxwell> :P

19:56 < luke-jr> lol

19:56 < wumpus> :D

19:56 < sipa> in nepers

19:56 < instagibbs> buy one get one size?

19:56 < gmaxwell> ehats the base e entropy unit?

19:56 < sipa> gmaxwell: yes

19:57 < luke-jr> can I add an OP_CHECKBOGOSIZE? *hides*

19:57 < gmaxwell> Good. (that was supposted to be a "Whats?" but seems you were a step ahead of me)

19:57 < sipa> ah, no, nats

19:57 < sipa> nepers are just for ratios, like db

19:57 < sipa> </offtopic>

19:58 < wumpus> time to close the meeting I think

19:58 < instagibbs> 2 minutes

19:58 < instagibbs> review begging?

19:58 < instagibbs> :P

19:58 < wumpus> we already did that one

19:58 < instagibbs> ah k

19:58 < luke-jr> defer BIP148 to next week?

19:58 < wumpus> (though if you have any proposals just say so)

19:58 < instagibbs> https://github.com/bitcoin/bitcoin/pull/10333 <-- my beg

19:58 < wumpus> luke-jr: oh forgot about that one

19:58 < luke-jr> it's okay, a week might be good anyway

19:58 < gmaxwell> I'm sure you can discuss it in one minute.

19:59 < gmaxwell> :P

19:59 < kanzure> we need a meeting extension block

19:59 < * morcos> refrains

19:59 < wumpus> #endmeeting

19:59 < lightningbot> Meeting ended Thu May 18 19:59:09 2017 UTC. Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)

19:59 < lightningbot> Minutes: http://www.erisian.com.au/meetbot/bitcoin-core-dev/2017/bitcoin-core-dev.2017-05-18-19.01.html

19:59 < lightningbot> Minutes (text): http://www.erisian.com.au/meetbot/bitcoin-core-dev/2017/bitcoin-core-dev.2017-05-18-19.01.txt

19:59 < lightningbot> Log: http://www.erisian.com.au/meetbot/bitcoin-core-dev/2017/bitcoin-core-dev.2017-05-18-19.01.log.html

19:59 < luke-jr> gmaxwell: well, to be fair, we've never had a formal time limit for meetings..

19:59 < luke-jr> :p

19:59 < instagibbs> it's a standardness rule...

19:59 < kanzure> it was to prevent spam

19:59 < gmaxwell> I like that they're limited. even though I always spend another half hour in resulting discussions.

19:59 < gmaxwell> kanzure: that limit was temporary!

20:00 < instagibbs> I think it's good to focus and respect people's time

20:00 < wumpus> agree

20:00 < sipa> we should revert to the original limit of 24 hours

20:00 < luke-jr> >_<

20:00 < gmaxwell> esp considering timezones don't put this meeting at good times of day for many.

20:00 < wumpus> so make sure that you have topics ready at the beginning, that makes it easier to schedule time for topics

20:00 < sipa> it's especially annoying for people in asia

20:00 < luke-jr> sipa: IMO the original limit was 5 hours

20:00 < sipa> i wonder if we should have the meeting alternate between two times

20:00 < luke-jr> sipa: since that's how long until the day changes in UTC

20:01 < gmaxwell> luke-jr: That isn't consistent with Craig Wright^W^WSatoshi's vision!

20:01 < luke-jr> gmaxwell: it's consistent with tonal though

20:01 < cfields> sipa: nah, let's just use an accounting trick and have meetings on a plane zooming through timezones.

20:01 < luke-jr> anyway, my parents showed up, so going to say hi and then get back to multiwallet

20:01 < kanzure> yes if you navigate the plane correctly, you can actually not spend any time at all in the meeting if you hop between timezones just right.

20:01 < cfields> I'm pretty sure we can cram 2 days into 1 that way :p

20:02 < luke-jr> cfields: rofl

20:02 < gmaxwell> too bad they stopped flying the concord.

20:02 < sipa> you just need a plane circeling the arctic

20:02 < kanzure> sounds like bip148 discussion is slightly blocked by luke-jr parental units

20:03 < wumpus> sipa: if there's interest from people from asia joining we should certainly do that; in practice I never had any concerete complains about the current meeting time though

20:03 < sipa> wumpus: we did, a long time ago

20:04 < gmaxwell> wumpus: jl2012 has lamented, and I believe kallewoof too.

20:04 < cfields> iirc it's prohibitive for jl2012, at least

20:04 < instagibbs> oh yeah, kalle too

20:04 < wumpus> ok good to know

20:04 < wumpus> maybe fanquake too (australia)

20:05 < instagibbs> he's a kiwi I thought

20:05 < jtimon> luke-jr: aug2017 seems to soon to me, I have no problems with bip149 on the other hand

20:05 < gmaxwell> we could also just look at the log data, determine a time when when most of us are already here that the asian people can meet, and maybe just setup a hour to talk to them when they know people will be around.

20:05 < sipa> 7am europe, 10pm westcoast, 1pm hongkong, 2pm japan?

20:05 < sipa> maybe too early in europe

20:05 < instagibbs> 1am East coast US, hmmm

20:05 < sipa> instagibbs: oops

20:05 < wumpus> I'm usualy up very early so that'd be ok with me

20:05 < gmaxwell> I think there is no time everyone can meet. But thats okay.

20:06 < gmaxwell> wumpus is up that early.

20:06 < gmaxwell> oh oops.

20:06 < wumpus> better than late at night

20:06 < instagibbs> I'll survive once a week if that works

20:06 < instagibbs> oh right Chaincode...

20:06 < instagibbs> :)

20:07 < sipa> damn timezones

20:07 < achow101> I'd rather not be up at 1 am

20:07 < sipa> achow101: you'll be on the west coast soon :)

20:07 < instagibbs> Maybe figuring a way to reliably rotate or something. I dunno.

20:08 < achow101> sipa: thinking ahead a bit past the summer :)

20:08 < gmaxwell> instagibbs: well Above I just suggested we have a second meeting at another time. It may be the case that the activity level in the meetings with asia are low enough that rotating wouldn't make sense.

20:08 < sipa> otherwise 4pm europe, 7am westcoast, 10am eastcoast, 10pm hongkong, 11pm japan?

20:08 < gmaxwell> instagibbs: if we pick at time when 'enough' people are here anyways, then it's not like setting aside the slot has a huge cost.

20:08 < instagibbs> hm yeah that makes more sense

20:08 < luke-jr> jtimon: well, it's already happening Aug 1 with BIP148..

20:09 < jtimon> luke-jr: right, I mean that seems too soon

20:10 < jtimon> so I don't think I will run bip148 myself

20:10 < gmaxwell> sipa: so there is like 3 hours between japan and auckland, so that might actually fail to get everyone in that part of the globe.

20:10 < luke-jr> jtimon: oh well. :<

20:11 < sipa> gmaxwell: yes, we need a slower earth rotation

20:11 < instagibbs> don't give kanzure any ideas

20:14 < gmaxwell> instagibbs: kanzure wants to destroy the moon I thought, that would reduce the slowing a lot.

20:14 < gmaxwell> sipa: thats already happening, just wait a while.

20:15 < sipa> gmaxwell: 2ms per century isn't very much

20:15 < kanzure> yeah i have some plans but it's sort of off-topic

20:25 < stickcuck> ok

20:41 < bitcoin-git> [bitcoin] jnewbery opened pull request #10423: [tests] skipped tests should clean up after themselves (master...cleanup_skipped) https://github.com/bitcoin/bitcoin/pull/10423

21:04 < bitcoin-git> [bitcoin] morcos opened pull request #10424: Populate services in GetLocalAddress (master...notnodenone) https://github.com/bitcoin/bitcoin/pull/10424

21:56 < jtimon> travis tests seem to be stuck for https://github.com/bitcoin/bitcoin/pull/9176

22:27 < kallewoof> Being able to participate in a meeting occasionally would be spiffy for sure.

22:55 < bitcoin-git> [bitcoin] earonesty opened pull request #10425: 0.14 (0.14...0.14) https://github.com/bitcoin/bitcoin/pull/10425

23:44 < bitcoin-git> [bitcoin] sipa opened pull request #10426: Replace bytes_serialized with bogosize (master...bogosize) https://github.com/bitcoin/bitcoin/pull/10426

23:49 < bitcoin-git> [bitcoin] MarcoFalke closed pull request #10241: Allow tests to pass even when stderr got populated (master...2017/04/test_stderr) https://github.com/bitcoin/bitcoin/pull/10241