#bitcoin-core-dev on 2016-09-29 — searchable irc log

05:58 < GitHub190> [bitcoin] laanwj pushed 5 new commits to master: https://github.com/bitcoin/bitcoin/compare/dc641415e75e...d675984fdfa4

05:58 < GitHub190> bitcoin/master f4dffdd Luke Dashjr: Add MIT license to Makefiles

05:58 < GitHub190> bitcoin/master 3b4b6dc Luke Dashjr: Add MIT license to autogen.sh and share/genbuild.sh

05:58 < GitHub190> bitcoin/master 3f8a5d8 Luke Dashjr: Trivial: build-aux/m4/l_atomic: Fix typo

05:58 < GitHub162> [bitcoin] laanwj closed pull request #8784: Copyright headers for build scripts (master...license_build) https://github.com/bitcoin/bitcoin/pull/8784

06:01 < wumpus> this is strange, https://github.com/bitcoin/bitcoin/pull/8832 "waxmihs2902 approved these changes 10 hours ago" clicking on that users name gives a 404 page

06:03 < paveljanik> User deleted himself?

06:03 < wumpus> I guess

06:04 < wumpus> or github deleted him

06:06 < wumpus> as it is impossible to remove approvals, random-approval-spamming trolls wouldn't be unthinkable. Note that for normal posts, the user gets changed to 'ghost' if they disappear, so the link doesn't break.

06:08 < paveljanik> I think the approval stuff is half-baked...

06:09 < paveljanik> So maybe someone is trying DoS on us...

06:09 < wumpus> it's only on one issue AFAIK, so I doubt it'd be targetted on us :)

06:10 < paveljanik> ./DoS ... and then while :; do ./DoS; done ;-)

06:10 < wumpus> or they got caught *really* early; more likely such a troll/bot would just randomly approve pulls all over the site

06:11 < paveljanik> wumpus, search for the username on the github...

06:11 < paveljanik> complete github...

06:12 < wumpus> luckily they didn't add a message, the message added with review approval is also un-removable/un-editable IIRC

06:12 < wumpus> yes its half-baked :)

06:12 < wumpus> I'm sure they'll fix this eventually

06:13 < GitHub131> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/d675984fdfa4...7d563cc16d64

06:13 < GitHub131> bitcoin/master fa05cfd MarcoFalke: [rpc] throw JSONRPCError when utxo set can not be read

06:13 < GitHub131> bitcoin/master 7d563cc Wladimir J. van der Laan: Merge #8832: [rpc] throw JSONRPCError when utxo set can not be read...

06:13 < GitHub93> [bitcoin] laanwj closed pull request #8832: [rpc] throw JSONRPCError when utxo set can not be read (master...Mf1610-rpcUtxoFail) https://github.com/bitcoin/bitcoin/pull/8832

07:19 < GitHub3> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/7d563cc16d64...489a6ab5073c

07:19 < GitHub3> bitcoin/master 64047f8 Wladimir J. van der Laan: depends: Add libevent compatibility patch for windows...

07:19 < GitHub3> bitcoin/master 489a6ab Wladimir J. van der Laan: Merge #8730: depends: Add libevent compatibility patch for windows...

07:19 < GitHub165> [bitcoin] laanwj closed pull request #8730: depends: Add libevent compatibility patch for windows (master...2016_09_libevent_windows_gcc_531) https://github.com/bitcoin/bitcoin/pull/8730

07:21 < GitHub162> [bitcoin] laanwj closed pull request #7522: Bugfix: Only use git for build info if the repository is actually the right one (master...bugfix_gitdir) https://github.com/bitcoin/bitcoin/pull/7522

07:24 < wumpus> cfields_: I've assigned some build-system issues to you, hope you don't mind

08:50 < GitHub87> [bitcoin] MarcoFalke pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/489a6ab5073c...8ca69a2a88a7

08:50 < GitHub87> bitcoin/master 54e5d7c jnewbery: Add bitcoin-tx JSON tests

08:50 < GitHub87> bitcoin/master 8ca69a2 MarcoFalke: Merge #8829: Add bitcoin-tx JSON tests...

08:51 < GitHub21> [bitcoin] MarcoFalke closed pull request #8829: Add bitcoin-tx JSON tests (master...test-bitcoin-tx-json) https://github.com/bitcoin/bitcoin/pull/8829

09:00 < wumpus> weird, I'm on testnet and trying to rebroadcast a transaction using "sendrawtransaction" but it's not working, I don't see anything appear in my wireshark session

09:01 < wumpus> maybe all the current nodes already know of it, otoh the "broadcast through 1 node(s)" in status doesn't increase either

09:03 < sipa> could it be that your peers knew about it, but evicted it?

09:03 < wumpus> but for they they'd have to request it first

09:04 < MarcoFalke> What about resendwallettransactions

09:04 < sipa> not necessarily from you

09:05 < wumpus> MarcoFalke: this should work with sendrawtransaction

09:05 < wumpus> could be that the net refactoring work changed the assumptions, will add some debugging...

09:06 < sipa> i don't think so

09:06 < MarcoFalke> its a noop when fHaveMempool?

09:07 < sipa> no

09:07 < wumpus> I don't hope so

09:07 < MarcoFalke> sendrawtransaction will only put in in your mempool

09:07 < sipa> and then loop over all peers

09:07 < sipa> and call PushInventory

09:07 < MarcoFalke> oh

09:08 < sipa> but PushInventory is a noop if filterInventoryKnown for that peer already contains the tx

09:09 < wumpus> but it can't be there unless the peer requested it from *us*right?

09:09 < wumpus> hm or if they inved it for relay to us, I guess

09:10 < wumpus> I don't have a capture of the whole session so we'll never know for sure

09:10 < sipa> well you can enable -debug=net and see if any messages go out in response to sendrawtransaction

09:10 < sipa> oh

09:11 < sipa> you're already doing that

09:11 < wumpus> no, no messages went out

09:11 < wumpus> and according to the tx metadata apparently one peer ever requested the transaction

09:11 < wumpus> but that was before I started logging

09:13 < wumpus> it could also be that that counter is broken, of course, I don't think that functionality is tested anywhere it's UI only

09:33 < wumpus> ok, seems the behaviour was correct. After restart the transaction is no longer in the filter, so I do sendrawtransaction again, it sends out an inv to every node.

09:33 < sipa> i wonder if we should add some random memory/cpu intensive task occasionally during block validation (so it doesn't consume more than 1% cpu rate or so)

09:33 < sipa> and if that fails, tell the user their hardware is too unreliable

09:33 < wumpus> yes, some games have that, it's not a bad idea

09:33 < wumpus> it allows distinguishing bugs from hw failures

09:34 < wumpus> we have the same problem, 'support' is overflowed with issues that are probably hw failures but we can't be sure so it wastes a lot of time

09:38 < paveljanik> We could ask users for the result of bitcoind -sanitychecks or something if we suspect HW issue...

09:39 < sipa> paveljanik: the problem is that such a sanitycheck would need to run for several hours or so to be reliable

09:40 < paveljanik> sure, but 90% rule...

09:40 < sipa> if a failure is detectable within minutes, it also means their block validation like fails within minutes

09:40 < sipa> and many other things

09:41 < sipa> most random failure i hear about happen after several hours of sync

09:42 < wumpus> the sanity check definitely needs to run automatically and periodically for it to be useful

09:42 < wumpus> because otherwise it won't run in the right conditinos

09:43 < wumpus> we should start selling a hardware quality certificates: has synced the bitcoin block chain succesfully

09:43 < wumpus> if it can do that, it won't crash on games either. Well. Maybe after GPU secp256k1 verification is implemented ;)

09:44 < sipa> maybe we should instead just market secp256k1 asics

09:45 < wumpus> would be a very interesting project if there's a market for that

09:46 < wumpus> hm, scrap that. THe market for that is key-cracking :(

09:50 < wumpus> searching around a bit, apparently, many people *started* on a verilog/vhdl FPGA implementation of secp256k1, but there's no code to be found anywhere

09:58 < wumpus> I'm pleasantly surprised how well the wireshark dissector for the bitcoin protocol works, can just do "tcp port 18333" then use a display filter of "bitcoin" and gets a running list of bitcoin packets easy to inspect using the tree structure

10:01 < wumpus> compared to trying to mentally parse debug.log output this is a breeze

10:02 < waxwing> wumpus: they've had that dissector for years, i remember being pleasantly surprised in like 2013/14

10:02 < waxwing> not that i need it or anything, but it's cool :)

10:02 < wumpus> yeah yeah it's probably not new

10:03 < wumpus> but just in case someone didn't discover it 20 years ago yet, there you go...

10:03 < waxwing> sorry wasn't trying to be a hipster :)

10:03 < wumpus> :-)

10:03 < sipa> damn. i believe this is a blocker for bip151.

10:04 < wumpus> agree sipa

10:08 < wumpus> though it is possible to have dissectors with parameters, such as a key, but it's so much less user friendly!

10:12 < wumpus> a more practical way to go at this, post-bip151, would be add functionality to dump packets to disk after decryption on receive and before encryption on send, it's for debugging anyhow not snooping

10:14 < waxwing> iirc there are per-dissector settings, for ssl you can enter the session key there etc.

10:14 < wumpus> yes

10:14 < waxwing> there's an env variable you can set when running firefox that dumps the keys for you, and you can import it in

10:14 < sipa> implementing bip151 in wireshark will be fu

10:14 < waxwing> and chrome i think

10:14 < wumpus> waxwing: but if you deal with lots of different sessions, or sessions that are created on the fly

10:15 < waxwing> right, which i guess might be of particular import in your case (/me hasn't read bip151 tho)

10:15 < wumpus> it's certainly possible, the only remark I was making was in regard to what was the most practical way :)

10:16 < wumpus> one advantage of doing the decryption in the dissector would be that it could detect crypto problems

10:18 < sipa> i don't know what language wireshark uses... but reimplementing sha256, chacha20 and poly1305 doesn't sound trivial

10:18 < sipa> (unless those are already available as primitives)

10:18 < wumpus> C

10:19 < sipa> oh, the filters too?

10:19 < wumpus> it's also possible to write dissectors in lua, but that's only recommended for one-off projects, as performance is abysmal

10:19 < sipa> i thought those would be in a plugin language like lua or so

10:19 < wumpus> yes, most of them

10:19 < sipa> ah.

10:20 < sipa> the bitcoin dissector is in c currently?

10:20 < wumpus> let me see

10:21 < sipa> i vaguely remember seeing the code for it, years ago

10:22 < wumpus> https://github.com/wireshark/wireshark/blob/master/epan/dissectors/packet-bitcoin.c

10:24 < GitHub156> [bitcoin] MarcoFalke opened pull request #8834: [qa] blockstore: Switch to dumb dbm (master...Mf1610-qaBlockstoreDumb) https://github.com/bitcoin/bitcoin/pull/8834

10:24 < wumpus> apparently it's not yet updated for 0.12.0+, no handler for sendheaders, feefilter etc

10:27 < mmeijeri> The same environment variable (SSLKEYLOGFILE) also works for Chrome.

10:29 < wumpus> oh it's a dynamic file? that's a good idea

10:29 < waxwing> mmeijeri: it's coming back to me, i have a feeling that it handles multiple sessions transparently, by just checking which key material works for which session. not that this matters, sorry for OT :)

10:30 < waxwing> ah yes that was it, i think it stores the premaster secrets for all sessions, then in each handshake it tries them out until it finds what works. something like that.

11:10 < GitHub26> [bitcoin] MarcoFalke pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/8ca69a2a88a7...cc9e8aca5f95

11:10 < GitHub26> bitcoin/master a0f8482 Suhas Daftuar: [qa] Split up slow RPC calls to avoid pruning test timeouts

11:10 < GitHub26> bitcoin/master cc9e8ac MarcoFalke: Merge #8827: [qa] Split up slow RPC calls to avoid pruning test timeouts...

11:10 < GitHub122> [bitcoin] MarcoFalke closed pull request #8827: [qa] Split up slow RPC calls to avoid pruning test timeouts (master...fix-pruning-timeout) https://github.com/bitcoin/bitcoin/pull/8827

11:19 < luke-jr> any reason for me to NOT switch to 64-bit after I sleep?

11:22 < phantomcircuit> luke-jr: something about #8828 results in a null pointer dereference

11:22 < phantomcircuit> afaict i just moved code around basically

11:23 < phantomcircuit> you wrote the original method

11:23 < phantomcircuit> can you take a look?

11:23 < luke-jr> phantomcircuit: too sleepy now, can you ask me after I finish switching into 64-bit (probably tomorrow)? :x

11:24 < phantomcircuit> luke-jr: whoa now

11:24 < phantomcircuit> you're moving to 64bit?

11:24 < luke-jr> phantomcircuit: unless someone stops me before I wake up

11:24 < phantomcircuit> also the line gdb thinks it's segfaulting on makes no sense

11:25 < luke-jr> ?

11:25 < phantomcircuit> nOrderPosNext = 0;

11:25 < luke-jr> so this is NULL? O.o

11:25 < luke-jr> + int64_t& nOrderPosNext = nOrderPosNext;

11:25 < luke-jr> wtf is this

11:26 < luke-jr> bet that's your problem, it makes no sense

11:26 < luke-jr> prob undefined behaviour

11:26 < luke-jr> but I'm half asleep, so who knows

11:55 < luke-jr> oh well, hope that helped, otherwise ping me tomorrow I guess :x

11:55 < luke-jr> night

12:01 < phantomcircuit> luke-jr: that's weird but apparently correct

12:10 < midnightmagic> that's a weird scoping problem

12:10 < midnightmagic> lol

13:00 < GitHub52> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/cc9e8aca5f95...9b94cca41f3e

13:00 < GitHub52> bitcoin/master 64d9507 Pavel Janík: [WIP] Remove unused statement in serialization

13:00 < GitHub52> bitcoin/master 9b94cca Wladimir J. van der Laan: Merge #8658: Remove unused statements in serialization...

13:00 < GitHub41> [bitcoin] laanwj closed pull request #8658: Remove unused statements in serialization (master...20160902_nVersion_serialization_cleanup) https://github.com/bitcoin/bitcoin/pull/8658

13:02 < kanzure> wumpus: tcpdump if you want to generate .pcap files without wireshark's gui

13:05 < GitHub166> [bitcoin] jgarzik closed pull request #6451: BIP 102: Increase block size limit to 2MB (master...2015_2mb_blocksize) https://github.com/bitcoin/bitcoin/pull/6451

13:06 < * midnightmagic> stabs wireshark's gui

13:36 < GitHub180> [bitcoin] MarcoFalke opened pull request #8835: [qa] nulldummy.py: Don't run unused code (master...Mf1610-qaNulldummyUnused) https://github.com/bitcoin/bitcoin/pull/8835

14:22 < GitHub8> [bitcoin] jnewbery opened pull request #8836: bitcoin-util-test.py should fail if the output file is empty (master...bitcoin-tx-no-empty-outputs) https://github.com/bitcoin/bitcoin/pull/8836

14:27 < GitHub100> [bitcoin] jnewbery opened pull request #8837: allow bitcoin-tx to parse partial transactions (master...bitcoin-tx-partial-transactions) https://github.com/bitcoin/bitcoin/pull/8837

14:34 < GitHub16> [bitcoin] jnewbery opened pull request #8838: Only log block size if block size is being accounted (master...dont_log_size) https://github.com/bitcoin/bitcoin/pull/8838

14:46 < GitHub177> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/9b94cca41f3e...2dd57e4f9f58

14:46 < GitHub177> bitcoin/master fa156c6 MarcoFalke: [qa] nulldummy: Don't run unused code

14:46 < GitHub177> bitcoin/master 2dd57e4 Wladimir J. van der Laan: Merge #8835: [qa] nulldummy.py: Don't run unused code...

14:46 < GitHub194> [bitcoin] laanwj closed pull request #8835: [qa] nulldummy.py: Don't run unused code (master...Mf1610-qaNulldummyUnused) https://github.com/bitcoin/bitcoin/pull/8835

15:04 < GitHub44> [bitcoin] laanwj opened pull request #8839: test: Avoid ConnectionResetErrors during RPC tests (master...2016_09_freebsd_rpctest_fix) https://github.com/bitcoin/bitcoin/pull/8839

15:08 < GitHub42> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/2dd57e4f9f58...c84181665f34

15:08 < GitHub42> bitcoin/master 16f8823 fanquake: [depends] Boost 1.61.0

15:08 < GitHub42> bitcoin/master c841816 Wladimir J. van der Laan: Merge #8819: [depends] Boost 1.61.0...

15:09 < GitHub98> [bitcoin] laanwj closed pull request #8819: [depends] Boost 1.61.0 (master...depends-boost-1-61-0) https://github.com/bitcoin/bitcoin/pull/8819

15:23 < GitHub53> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/c84181665f34...c9d7b0de2fc9

15:23 < GitHub53> bitcoin/master fa9cd25 MarcoFalke: [qa] blockstore: Switch to dumb dbm

15:23 < GitHub53> bitcoin/master c9d7b0d Wladimir J. van der Laan: Merge #8834: [qa] blockstore: Switch to dumb dbm...

15:23 < GitHub36> [bitcoin] laanwj closed pull request #8834: [qa] blockstore: Switch to dumb dbm (master...Mf1610-qaBlockstoreDumb) https://github.com/bitcoin/bitcoin/pull/8834

15:27 < GitHub17> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/c9d7b0de2fc9...f560d9564f74

15:27 < GitHub17> bitcoin/master 7e5fd71 Pavel Janík: Do not include env_win.cc on non-Windows systems

15:27 < GitHub17> bitcoin/master f560d95 Wladimir J. van der Laan: Merge #8826: Do not include env_win.cc on non-Windows systems...

15:27 < GitHub75> [bitcoin] laanwj closed pull request #8826: Do not include env_win.cc on non-Windows systems (master...20160928_leveldb_no_win) https://github.com/bitcoin/bitcoin/pull/8826

16:05 < GitHub51> [bitcoin] laanwj opened pull request #8840: test: Explicitly set encoding to utf8 when opening text files (master...2016_09_textfiles_locale) https://github.com/bitcoin/bitcoin/pull/8840

16:23 < GitHub57> [bitcoin] jl2012 opened pull request #8841: [qa] fix nulldummy test (master...nulldummytest) https://github.com/bitcoin/bitcoin/pull/8841

17:16 < cfields_> wumpus: no problem

17:17 < cfields_> jonasschnelli: i had a look at your circular dep issue. No luck here. The (hack) solution is to use grouping libs, but I'd really rather not go down that road. I've started untangling dependencies instead.

17:25 < wumpus> which dependencies are the problem there? the circular dependency between libbitcoin_server and libbitcoin_wallet?

17:26 < wumpus> that would "just" require solving https://github.com/bitcoin/bitcoin/issues/7965

17:26 < arubi> jnewbery, :) I was just about to whine about 'if (!DecodeHexTx(txDecodeTmp, strHexTx)'

17:26 < arubi> thanks!

17:57 < arubi> too bad, I wanted to ask him about it here

18:20 < jonasschnelli> wumpus, cfields_: Yes. I think we need to complete 7965 first

18:37 < jl2012> Travis test failed for #8841 with an unrelated test

18:37 < jl2012> https://www.irccloud.com/pastebin/pWEgvccJ/

18:38 < jl2012> https://travis-ci.org/bitcoin/bitcoin/jobs/163788152

18:40 < luke-jr> phantomcircuit: correct for what?

18:40 < MarcoFalke> jl2012: We should probably create an issue for those failures

18:44 < MarcoFalke> and a minor nit: I think the if(msg) is redundant. (The empty string already evaluates to false in python)

18:47 < jl2012> indeed

19:00 < jonasschnelli> bingbong

19:00 < luke-jr> hi

19:00 < sipa> ovatobat

19:00 < CodeShark> yo

19:01 < wumpus> #startmeeting

19:01 < lightningbot> Meeting started Thu Sep 29 19:01:19 2016 UTC. The chair is wumpus. Information about MeetBot at http://wiki.debian.org/MeetBot.

19:01 < lightningbot> Useful Commands: #action #agreed #help #info #idea #link #topic.

19:01 < BakSAj> hi

19:01 < CodeShark> meeting time!

19:01 < MarcoFalke> meeting!

19:01 < wumpus> #bitcoin-core-dev Meeting: wumpus sipa gmaxwell jonasschnelli morcos luke-jr btcdrak sdaftuar jtimon cfields petertodd kanzure bluematt instagibbs phantomcircuit codeshark michagogo marcofalke paveljanik NicolasDorier

19:02 < MarcoFalke> (oh already started)

19:02 < btcdrak> here

19:02 < cfields_> hi

19:02 < kanzure> hi

19:02 < wumpus> any proposed topics?

19:02 < jonasschnelli> topic proposal: pruning and blockrelay

19:02 < petertodd> hi

19:02 < sipa> policy against uncompressed keys or not

19:03 < wumpus> #topic pruning and blockrelay

19:03 < jonasschnelli> I think we should add a service flag for block relay with a min-height

19:03 < jonasschnelli> NODE_PRUNENETWORK or something

19:03 < sipa> there have been multiple ideas around that

19:04 < petertodd> IMO whatever we do, we should recognise that w/ segwit's larger blocks we can expect a lot of full nodes to run out of disk space quite soon

19:04 < sipa> the easiest is just to add a flag that says you relay valid blocks and transactions, but not historical blocks more than a few deep

19:04 < jonasschnelli> I guess people slowly start to prune the blockchain to a max of 80GB or similar... but I guess not everyone is aware of the fact that you don't relay then

19:04 < wumpus> it would be nice to support more than one range, e.g. also archive nodes that host part of the old blocks

19:04 < sipa> it becomes harder when you want multiple ranger

19:04 < petertodd> do we have a reason for more than one range?

19:05 < jonasschnelli> We could introduce another message type... blockrange or so

19:05 < sipa> it becomes even harder when you want to support sharding in an efficient way

19:05 < wumpus> I'm not sure why itb ecomes hard, just add a query message that returns what ranges are supported

19:05 < petertodd> sipa: what do you mean by sharding exactly?

19:05 < sipa> petertodd: you'd configure your node to maintain a certain % of blocks

19:05 < jonasschnelli> wumpus: query, yes, why not, or just inform like we do with sendheaders

19:05 < petertodd> sipa: see, given that the bitcoin protocol can't be safely sharded right now, I think we can safely say that we don't need to support sharding in block relay yet

19:05 < petertodd> sipa: doing so might even be dangerous if people start using it

19:06 < sipa> petertodd: not in block relay

19:06 < sipa> petertodd: for block archival

19:06 < petertodd> sipa: but why shard vs. keep ranges?

19:06 < petertodd> sipa: (ranges of full blocks)

19:06 < luke-jr> BitTorrent already does this. Surely we can learn from that?

19:06 < petertodd> luke-jr: I don't think so - bittorrent is a very different problem than bitcoin

19:06 < wumpus> this is for letting other peers know what ranges of blocks are hosted, I don't think this should affect releay

19:06 < sipa> so, i've been running statistics on what block depths are being requested from nodes

19:06 < luke-jr> petertodd: learn from it, not use it directly

19:07 < luke-jr> petertodd: BitTorrent's problem isn't very different from IBD

19:07 < jonasschnelli> sipa: interesting.. do you have the stats public available somewhere

19:07 < jonasschnelli> I wanted to do this a long time

19:07 < petertodd> luke-jr: so, the thing is bittorrent has the problem of a diverse set of files, we just don't have that problem and can optimise differently because everyone needs access t othe same set of data

19:08 < sipa> there are something like 4 meaningful 'ranges' 1) the top 2 blocks (just relay) 2) up to ~2500 blocks deep... requested very often 3) up to ~10000 deep... requested a few times more than the next range 4) the rest

19:08 < wumpus> otoh bittorrent has a fixed block size :)

19:08 < sipa> wumpus: so do we *ducks*

19:08 < petertodd> sipa: that probably corresponds to how long people leave their nodes offline :)

19:08 < btcdrak> inb4 Bittorrent XT

19:09 < sipa> jonasschnelli: they're not available, and the ranges i gave above are just from me quickly glancing over the result

19:09 < petertodd> btcdrak: I use Bittorrent Unlimited myself

19:09 < jonasschnelli> What about fingerprinting issued in conjunction with available ranges?

19:09 < jonasschnelli> *issues

19:09 < petertodd> jonasschnelli: make them powers of two?

19:09 < sipa> well 4 ranges can be done with 2 service bit flags

19:09 < sipa> gmaxwell: you've worked on these ideas before, comments?

19:09 < jonasschnelli> But would that work with the flexible pruning option based on MB?

19:10 < petertodd> jonasschnelli: sure, just find the biggest range less than the pruning amount

19:10 < sipa> jonasschnelli: you'd change your service bits on the fly

19:10 < wumpus> why would the ranges need to be in the flags?

19:10 < gmaxwell> sorry, I missed that the meeting started.

19:10 < jonasschnelli> Yes. Why? Better add an explicit message for the range

19:10 < sipa> how would you otherwise discover what nodes to connect to?

19:10 < sipa> just randomly try?

19:10 < petertodd> jonasschnelli: oh right, you mean if MB != blocks... sorry.

19:10 < wumpus> I think you'll need a service flag to show support for the protocol, but not what ranges you have

19:10 < jonasschnelli> query or inform the other node if proto-ver > NODE_PRUNENETWORK

19:11 < wumpus> well that can be negotiated later, like bittorrent does I guess

19:11 < sipa> wumpus: well you do want addr messages to contain this information

19:11 < wumpus> I doubt bitcoin has 'service flags' in its tracker what blocks nodes have

19:11 < petertodd> sipa: so the nice thing about bitcoin, is just randomly try will probably work fairly often due to the low number of ranges out there

19:11 < wumpus> as that changes all the time anyhow

19:11 < gmaxwell> I was strongly of the view that we needed to signal at least two ranges. Sipa's latest measurements make me think at least three are needed.

19:11 < wumpus> s/bitcoin/bittorrent/

19:11 < jonasschnelli> I think informing other nodes ranges over addr is another thing...

19:11 < jonasschnelli> A first step would be a information after connect

19:11 < wumpus> yes, addr is another thing

19:12 < gmaxwell> I think ranges in service bits are no big deal, the harder question is what to do about the history. having nodes with 150GB of history in order to serve the last range is not very viable.

19:12 < wumpus> could be done later if an efficient way is needed to *locate* peers with certain ranges

19:12 < wumpus> but that seems premature optimization

19:12 < gmaxwell> We will need to redo addr sometime relatively soon in any case, as our messages are not compatible with HS-NG.

19:12 < petertodd> gmaxwell: oh, you mean Tor's new hidden services standard right?

19:12 < gmaxwell> petertodd: yes.

19:13 < gmaxwell> (also I2P though thats not new)

19:13 < wumpus> I think the number of ranges should be variable

19:13 < wumpus> redesigning addr is a different topic

19:13 < wumpus> also necessary, but again, doesn't need to be on one heap

19:13 < gmaxwell> wumpus: when I'm saying ranges I am specifically referring to the top-N zomes.

19:14 < petertodd> well, so if we add service bits for recent history ranges, that should be possible to implement as a separate feature to archival history ranges, and it'd be a big first step

19:14 < wumpus> I think it should be possible to, say, only host the first 20GB of blocks

19:14 < jonasschnelli> historic only nodes

19:14 < wumpus> I don't see why it should be restricted to only recent history

19:14 < petertodd> I don't think it's likely we'll see the two different features collide, so maybe implement recent history ranges first

19:14 < wumpus> or I mean first 20GB + last 144 blocks

19:15 < gmaxwell> For history storage, I was previously working on a proposal where nodes could signal a small (32 bit) seed and a size and from that everyone would know what parts of the history they would store. I was so far unable to unify two different schemes, one which was computationally efficient to figure out who had what, and one which never required a peer to fetch a block it had previously deleted.

19:15 < sipa> so very quick breakdown: out of 7M requested blocks, 100k were for the tip, range 2-2500 has around 200-2000 requests per block, and from 10000 to genesis deep there are around 20 per block

19:16 < gmaxwell> I think for now we should not worry about the old history part and only worry about Top-n vs everything, as that fits into the pruning we already have and can be accomplished purely with service bits.

19:16 < wumpus> the bittorrent problem is different in that there the goal of each node is to have everything

19:16 < petertodd> so a social consideration here, is we can think in terms of recent history as "if there's a flaw, how much would we ever reorg w/o just saying bitcoin has failed?"

19:16 < gmaxwell> petertodd: thats partly why we have the 288 block maximum amount of pruning.

19:17 < petertodd> gmaxwell: indeed, and that's only two days...

19:17 < jonasschnelli> Using multiple service bits for 4 ranges seems to be a hackish-design IMO

19:17 < gmaxwell> at 100 blocks any reorg will _necessarily_ cause unrecoverable losses. So 288 basically gives a day plus an extra day for overhead.

19:17 < petertodd> there's also a natural time criteria from how the difficulty adjustments reduce your resistance to 51% attack - if your node is offline longer, the minimum attacker size to fool you goes down

19:17 < sipa> strangely enough: i see much more requests around 1000 deep than around 100 deep

19:18 < gmaxwell> jonasschnelli: I don't see anything hackish.

19:18 < wumpus> jonasschnelli: I also think it's a strange use of service bits

19:18 < jonasschnelli> I'd prefere using a single service bit to state pruned blockchain and then a new message (or append something to version?)

19:18 < petertodd> sipa: probably because people don't turn their nodes on and off every day

19:18 < gmaxwell> sipa: you probably want to filter out the bitnodes spider, as I believe it requests a block to check the node is working.

19:18 < sipa> gmaxwell: ah.

19:18 < gmaxwell> petertodd: someone who hasn't turned their node on will request all of 0 to -1000. so it will not make 1000 greater.

19:19 < gmaxwell> jonasschnelli: NAK.

19:19 < petertodd> gmaxwell: oh! I didn't know we did that

19:19 < sipa> i'm a bit surprised people think there is no need to have the available block ranges indicated in addr messages

19:19 < sipa> (whether through service bits, or some extension)

19:19 < jonasschnelli> I think there is a need... but it could be a second step

19:19 < wumpus> jonasschnelli: appending to version should be unnecessary, that's also a hack :)

19:19 < sipa> jonasschnelli: if it's a second step, we need to extend addr, and the whole management of addresses

19:19 < jonasschnelli> Okay. Agree. What about a new message type?

19:19 < jonasschnelli> blockrange

19:19 < sipa> jonasschnelli: you don't understand.

19:20 < gmaxwell> jonasschnelli: look at pieter's request figures, if nodes are effectively forced to go to peers that have everything whenever they connect becuase if they don't know they'll be able to fetch any blocks at all, then it will put lots more load on them.. causing people to stop offering blocks... causing more pressure on what remains.

19:20 < sipa> jonasschnelli: the point of having it in service bits is so nodes can find peers that have the range they need

19:20 < wumpus> but addr information gets old really fast

19:21 < sipa> wumpus: much less so with feeler connections

19:21 < wumpus> nodes may dynamically change what blocks they have, so there will always be cases of nodes connecting and realizing they have nothing to offer each other

19:21 < jonasschnelli> Okay. I see the point.

19:21 < sipa> (presumably, i don't have numbers)

19:21 < wumpus> just like currently nodes will try to connect into black holes that no longer host a node

19:22 < petertodd> so another interesting thing here is that ranges are queried linearly - you download blocks in a roughly linear fashion - so we could take advantage of that by making sure that nodes with one range keep track of nodes with adjacent ranges

19:22 < wumpus> sipa: sure, feeler connections make it somewhat better

19:22 < gmaxwell> wumpus: yes, sometimes the data is wrong. But there is a big difference between having 80% of the nodes on the network giving you no idea if they'll be useful at all until after you connect, vs a suggestion that might sometimes be wrong.

19:22 < wumpus> but I don't think addr is a very up-to-date information source

19:22 < petertodd> thus, as you sync the first time, ask nodes with the range you're syncing at this moment for the next range you need

19:23 < luke-jr> wumpus: if ranges are deterministic, they don't need to be up to date

19:23 < sipa> petertodd: yes, any sharding plan wouldn't randomly distribute the kept blocks, but keep randomly distributed ranges

19:23 < gmaxwell> wumpus: I don't know if you realize that sipa and I are not thinking in terms of absolute ranges here. but nodes saying "I keep the last 288" or "I keep the last 2016" or "I have all of history".

19:23 < wumpus> gmaxwell: but indeed this is a different problem from the bittorrent problem where everyone's goal is to have everything

19:23 < sipa> gmaxwell: well that's sharding... maybe that is something to postpone for later

19:24 < petertodd> sipa: sure, I'm more talking about how the linearity affects the network p2p design - prefentially peering with peers with the adjacent range may even be a reasonable design

19:24 < luke-jr> wumpus: eh, everyone needs to get everything

19:24 < wumpus> there, nodes can just connect randomly and have a high change the other nodes has something to offer them

19:24 < gmaxwell> wumpus: and I wouldn't expect that data to go out of date fast.. pretty much only when nodes go up and down.

19:24 < sipa> oh, nvm, i'm misreading

19:24 < wumpus> luke-jr: only initially

19:24 < luke-jr> oh, I see the distinction

19:24 < wumpus> luke-jr: bittorrent nodes don't throw away blocks, generally

19:24 < luke-jr> f(best-height, seed-in-addr) -> ranges

19:25 < gmaxwell> for the spreading the history around, as mentioned I came up with concrete schemes (based on consistent hashes) that have nice properties.

19:25 < sipa> i wonder whether we need to have that in the first go at this

19:26 < jonasschnelli> I think a first simple solution that allow to extend it further would be appriciated.

19:26 < sipa> even just having serve-everything and server-the-last-288-and-relay-at-tip would be a good addition

19:26 < wumpus> making the ranges deterministic makes some sense, on the other hand, it does restrict the flexibilty of nodes to choose what ranges they host, it means everything has to be got right in first try

19:26 < gmaxwell> sipa: thats what I am saying.

19:26 < jonasschnelli> sipa: agree

19:26 < gmaxwell> I do not think we can do better immediately anyways.

19:26 < sipa> 21:18:07 < jonasschnelli> I'd prefere using a single service bit to state pruned blockchain and then a new message (or append something to version?)

19:26 < gmaxwell> sipa: though your latest figures suggest that the 2016 depth is important too.

19:26 < sipa> 21:19:07 < gmaxwell> jonasschnelli: NAK.

19:27 < petertodd> if nodes attempt to maintain a few connections to peers that have the next range after they have, maybe it doesn't matter exactly what the ranges actually are? any given node would have a few connections to the next range, and anyone syncing from them could ask for those connections

19:27 < gmaxwell> sipa: my understanding of jonasschnelli comment was there should be a bit that says "I relay blocks but don't have history" I am NAK on that.

19:27 < wumpus> as there is no scope for later optimization, because all nodes have to agree what ranges are implied

19:27 < jonasschnelli> We could add a service bit that says "I relay only the last 288 blocks"

19:27 < wumpus> jannes: yes that would be the initial idea

19:27 < wumpus> jonasschnelli*

19:28 < sipa> gmaxwell: how is that different from what i suggested?

19:28 < sipa> 21:26:10 < sipa> even just having serve-everything and server-the-last-288-and-relay-at-tip would be a good addition

19:28 < jonasschnelli> I think my initial idea with the general pruning sevice bit and a new message type is to complex and inflexible

19:28 < gmaxwell> jonasschnelli: yes, that would be better, though pieter's data suggests that there are a LOT of requests at 1000. I think if I had that data I would have been suggesting the maximum pruning should be 2016, and then had the bit at that dep.

19:28 < gmaxwell> sipa: the ability to relay blocks at depth -10.

19:29 < sipa> gmaxwell: less than 2% of blocks requested from my node are at the tip

19:29 < sipa> (but the tip is still 100x more frequent than any other individual depth)

19:29 < sipa> gmaxwell: "a service bit to indicate pruned blockchain" implies you can serve 288 deep :)

19:30 < petertodd> gmaxwell: re: maximum pruning depth, it's reasonable for that to be a similar % of the total data that storing the UTXO set takes - if you have 10GB of UTXO, 2GB of block data isn't a big change

19:30 < wumpus> yes, you could define it as that

19:30 < gmaxwell> I don't think there is any remaining disagreement on using bit(s) to signal I have a top-n. But I have some doubt on N. it needs to capture the largest amount of the block realy bandwidth without being unduely pruning incompatible.

19:30 < wumpus> 288 is the minimum pruning amount in bitcoin core already so it'd be a valid choice

19:30 < morcos> as a first pass, i wonder if you preferentially downloaded from pruned peers whenever you were behind by less than 288 blocks, that would take enough load of peers serving full history?

19:30 < gmaxwell> morcos: absolutely.

19:31 < jonasschnelli> Good idea

19:31 < wumpus> yes, that would make sense

19:31 < gmaxwell> unfortunately, sipa's data suggests that 288 sheds less traffic than measurements years ago suggested.

19:31 < sipa> maybe i should compute statistics in bytes rather than blocks

19:31 < morcos> gmaxwell: it wasn't clear to me what the integral from 1 to 288 was compared to 288 to inf

19:31 < wumpus> well it is a compromise

19:31 < wumpus> putting the threshold higher makes some peers completely useless

19:31 < sipa> to see what percentage of bandwidth is needed in 1-288

19:31 < wumpus> which reduces morcos 's argument

19:32 < jonasschnelli> Yes. I guess you convinced me to use two service bits then. -288 and -2016

19:32 < gmaxwell> which is why it might be useful to use two bits and be able to signal 1-288, 1-2016... and perhaps start encouraging people to not prune shorter than 2016.

19:32 < sipa> i think we're getting into a design discussion here

19:32 < sipa> my number are very premature and not well analysed

19:32 < wumpus> it'd also be possible to add a 288-flag now, and then consider a 2016 flag later

19:32 < gmaxwell> sipa: indeed, thought that was the input you requested from me.

19:32 < morcos> wumpus: yes, thats what i'm saying

19:32 < gmaxwell> wumpus: yes! indeed.

19:33 < jonasschnelli> Agree with wumpus

19:33 < wumpus> if it turns out to be necessary

19:33 < petertodd> wumpus: ACK

19:33 < sipa> yes, i think just a 1-288 one seems useful

19:33 < wumpus> good :)

19:33 < jonasschnelli> Start with a simple tip-288 relay, and get some experience

19:33 < gmaxwell> wumpus: it looks pretty clearly necessary but no need to do everything at once.

19:33 < petertodd> wumpus: basically advice is, turn your node on at least once every two days

19:33 < wumpus> petertodd: yes

19:33 < gmaxwell> petertodd: we really should have cron mode for the daemon where it just syncs up and shuts off. :P

19:34 < gmaxwell> bitcoind -oneshot

19:34 < gmaxwell> :P

19:34 < petertodd> gmaxwell: heh, that's not a crazy idea - I'd use it on my laptop

19:34 < jonasschnelli> didn't we once had a proposal for the pause option?

19:34 < wumpus> right, there's a flag that quits after reindex, but none that exits after sync

19:34 < wumpus> would be easy to add tho

19:34 < morcos> we could just ask for the utxo set, shoudl we discuss ideas how to do that

19:34 < CodeShark> ^ yes :)

19:34 < petertodd> make -oneshot run in the foreground with a progress bar :)

19:34 < wumpus> without utxo commitment that's a no-go

19:35 < morcos> thanks codeshark

19:35 < petertodd> wumpus: +1

19:35 < gmaxwell> morcos: pointless when we were unable to get past the discussion for the security model change to not validate the past history based on proof of work.

19:35 < petertodd> and lets not underestimate how dangerous UTXO commitments can be - I'm very dubious about committing to the (U)TXO set more recently than maybe a month or two

19:35 < CodeShark> would be great to query utxo for quick sync, then go backwards in time fetching blocks to increase security...but yes, this is a design discussion

19:35 < morcos> i was making a joke, sorry

19:36 < CodeShark> alas, quick sync doesn't look feasible in the nearterm

19:36 < wumpus> ok, next topic?

19:36 < gmaxwell> but since that was brought up... Can we talk about removing checkpoints?

19:36 < wumpus> #topic removing checkpoints

19:36 < sipa> what % of transactions are before the last checkpoint

19:37 < sipa> does anyone know?

19:37 < morcos> someone should write up a design proposal for that to be evaluated

19:37 < gmaxwell> Right now they're used for two things, preventing header flooding with low difficulty headers; and skipping signatures in earlier blocks.

19:37 < petertodd> gmaxwell: just removing checkpoints, or assuming sigs are valid if buried deep enough?

19:37 < sipa> gmaxwell: and 3) estimating progress

19:37 < wumpus> keeping something for estimating progress would make sense

19:37 < sipa> i think 1) remains needed and 3) remains useful

19:37 < wumpus> that doesn't need to be checkpoints

19:38 < gmaxwell> because very few percentage of the transactions are below the checkpoint .. since libsecp256k1 (and I expect the checkqueue)-- my point two is basically pointless, and I think it could just be removed

19:38 < gmaxwell> I think on a desktop it only adds 15-20 minutes to the sync.

19:38 < petertodd> gmaxwell: I'd ACK simply removing checkpoints entirely; I'm not happy to see them replaced with another scheme to skip sig checking

19:38 < wumpus> a block-height-to-relative-difficulty map would have much less of a stigma

19:38 < wumpus> eh, verification difficulty that is

19:38 < sipa> gmaxwell: really?

19:38 < gmaxwell> petertodd: I think we could remove CP from reason two without implementing the replcement.

19:39 < gmaxwell> petertodd: morcos is right that needs a design proposal outside of the meeting.

19:39 < sdaftuar> i'm a bit confused about how to think about checkpoints for signature skipping

19:39 < gmaxwell> sipa: I benchmarked before but I'm going off of memory, I could be wildly wrong. I will test again if there is interest.

19:39 < jonasschnelli> Removing checkpoints would slow down (maybe insignificant) a scan in a possible SPV hybrid mode?

19:39 < gmaxwell> For reason (1) the only answer I have is that I think we should proposal a bit to perpetually increase the minimum difficulty from 1 to something else.

19:40 < sdaftuar> for instance the recent ISM change caused us to do less validation for certain blocks in our history (blocks in a softfork between the 75% and 95% thresholds)

19:40 < sipa> jonasschnelli: SPV mode won't validate *anything* at all

19:40 < gmaxwell> (with a checkpoint like bypass of that new rule, for existing blocks that break it) As little as 100,000 would eliminate the header flooding vulenrablity.

19:40 < jonasschnelli> Yes. But assume we would add an SPV hibrid mode in oder to received payment during IBD

19:40 < jonasschnelli> One would need to download 400k headers without a checkpoint at h400k

19:40 < luke-jr> maybe checkpoints should just be disabled by default before complete removal?

19:41 < sipa> jonasschnelli: i think you're confused

19:41 < gmaxwell> for Sipa's (3) reason for 'checkpoints' I don't give a darn, use chicken bones for progress estimation for all I care. :P it's historical accident that checkpoints and progress use the same data structure.

19:41 < morcos> gmaxwell: :) +1

19:41 < wumpus> gmaxwell: yes, my point too

19:41 < sipa> gmaxwell: agree, those could be completely separated

19:41 < petertodd> gmaxwell: ACK checken bones

19:41 < gmaxwell> Might as well fit a cubic spline to the height vs txn count... and store the parameters.

19:41 < wumpus> right

19:42 < petertodd> gmaxwell: heh, if we do that with floating point math that has the advantage that it _can't_ be used for consensus :)

19:42 < * sipa> now remembers a song our student organization wrote to the melody of staying alive, called 'cubic spline'

19:42 < gmaxwell> so my proposal, if there is interest, is that I'll measure the performance impact of removing the signature skippingentirely (esp post checkqueue). And if it's not awful, we'll remove.

19:42 < wumpus> +1

19:42 < sipa> gmaxwell: i'm unconvinced

19:42 < wumpus> it doesn't hurt to benchmark

19:43 < gmaxwell> and maybe I'll tender a proposal to up the minimum difficulty, but I'd like to know what people think about that.

19:43 < wumpus> measuring is always better than making assumptions

19:43 < sipa> with a replacement for sig skipping that isn't based on checkpoints we could significantly improve things

19:43 < petertodd> sipa: I don't think such a replacement can exist without changing the security assumptions; I'd *rather* have checkpoints than trusting hashing power for that

19:43 < sipa> the last checkpoint currently is very old for the very reason that we've been planning to replace it

19:43 < gmaxwell> sipa: would you like to help work on a proposal for that? it has been controversial in the past. I'd like to do something good, because otherwise imprudent attempts will be adopted instead.

19:44 < sipa> so it's unfair to use the "the last checkpoint is old" as a given; it's something we've affected indirectly

19:44 < petertodd> sipa: though what checkpoints should do is say "Something big has changed; you can disable checkpoints with --no-checkpoints, but you should find out what this means before doing so."

19:44 < gmaxwell> (for example Bitcoin Classic's current behavior simply looks at block header timestamps and ignores signatures when they're more than 24 hours (*par) old by the local clock. It's easily exploited and makes me sad.

19:44 < sipa> petertodd: it's my opinion that on a timescale of months, it doesn't matter

19:44 < sipa> IF you can guarantee it's actually a timescale of months

19:44 < wumpus> yes that makes me sad too

19:44 < petertodd> sipa: on a timescale of months, checkpoints shouldn't matter either...

19:45 < wumpus> anything based on time seems very brittle

19:45 < sipa> petertodd: look at the current hashrate; what's 3 months worth of chain work at that hashrate

19:45 < petertodd> wumpus: and anything based on work isn't much better if you're running an old client, and mining has advanced significantly

19:45 < jonasschnelli> sipa: I (hope) I'm not confused. If we would add a SPV hybrid mode directly fetch blocks at the tip (in order to received payments), no available checkpoint would result in downloading all headers *losing* maybe 3-4mins before you can start using SPV... minor issue though, I agree

19:45 < petertodd> sipa: that assumes you know what the current hashrate is

19:45 < gmaxwell> wumpus: the prior proposals were based on work, e.g. skip if the best chain you see dominates the next conflicted chain at that hight by N months of work.

19:45 < Chris_Stewart_5> gmaxwell: How have we solved the problem that checkpoints were originally created for? You have an excerpt in here: https://en.bitcoin.it/wiki/Bitcoin_Core_0.11_(ch_5):_Initial_Block_Download#Checkpoints

19:45 < petertodd> sipa: your node might be surrounded by sybils

19:45 < gmaxwell> wumpus: with a 'minimum total work' coded in as part of the release proces.

19:46 < sipa> Chris_Stewart_5: headers first sync

19:46 < sipa> Chris_Stewart_5: 0.10

19:46 < gmaxwell> Chris_Stewart_5: headers first sync.

19:46 < wumpus> gmaxwell: right. Well, at the least it should be measured whether such a change is really worth it.

19:46 < sipa> petertodd: yes, i know...

19:46 < sipa> so, let's measure.

19:46 < sipa> and discuss later

19:46 < gmaxwell> Chris_Stewart_5: and the signature skipping behavior in checkpoints was actually a result of a bug fixed years ago.. mlock being used on all allocations making script validation INSANELY slow.

19:46 < wumpus> so much of the verification overhead is looking up UTXOs

19:46 < gmaxwell> sipa: okay.

19:46 < wumpus> something you'll not avoid

19:47 < gmaxwell> Chris_Stewart_5: but then with chain growth we became dependant on it to keep sync times reasonable. but libsecp256k1 made signature validation >5x faster.

19:47 < wumpus> especially for recent blocks

19:47 < wumpus> if you do any benchmarking please look at the recent blocks, not the first N

19:48 < gmaxwell> wumpus: it's still a major speed up on existing blocks.

19:48 < sipa> in a side node: i've already updated my logging to measure bandwidth vs blockdepth instead of just count.

19:48 < Chris_Stewart_5> So header sync solves the attack of flooding disk space, but not having your entire network hijacked, correct?

19:48 < wumpus> Chris_Stewart_5: huh?

19:48 < wumpus> gmaxwell: sure, could be

19:48 < gmaxwell> Chris_Stewart_5: isolation can be resolved by simply knowing what the total work of the best chain was at release.

19:49 < gmaxwell> Chris_Stewart_5: sorry, this was discussed prior times removing checkpoints had come up, I haven't completely described the background.

19:49 < Chris_Stewart_5> gmaxwell: Thanks for the explanation, i'll keep digging.

19:49 < wumpus> Chris_Stewart_5: ah, you mean being isolated and being fed a wrong chain, sorry I was imaginging some wacky things at having your network hijacked :)

19:50 < wumpus> ok, next topic?

19:50 < gmaxwell> wumpus: just the "you got a faithful bitcoin core download but the attacker controls your network"... but that doesn't need a checkpoint to fix, a simple partitioning detction that knows the total work of the best chain at releast time is sufficient.

19:50 < gmaxwell> Thanks for the discussion.

19:51 < wumpus> #topic segwit against uncompressed keys or not

19:51 < wumpus> (10 minutes to go)

19:51 < wumpus> (9 minutes to go)

19:51 < petertodd> so to be clear, *just* segwit right?

19:51 < CodeShark> does anyone still use uncompressed keys?

19:51 < wumpus> yes, only segwit

19:51 < achow101> CodeShark: armory does

19:51 < luke-jr> seems uncontroversial

19:51 < petertodd> I'm happy to ACK that given just segwit

19:51 < achow101> having segwit enforce uncompressed keys would delay segwit adoption for armory users

19:52 < achow101> *compressed

19:52 < jl2012_> it's in #8499

19:52 < luke-jr> achow101: why? just compress them

19:52 < wumpus> gmaxwell: yes, though we had a lot of trouble with partitioning detection, I remember some code being stripped out and such. But anyhow, yes that's the better approach if it can be gotten to work.

19:52 < sipa> achow101: sigh, does armory still not do that?

19:52 < achow101> luke-jr: we have to change the whole wallet structure (it's still going to happen anyways)

19:52 < wumpus> gmaxwell: without too much false positives

19:52 < luke-jr> achow101: why?

19:52 < sipa> achow101: alan said somewhere in 2013 he was implementing it...

19:52 < achow101> alan's gone now..

19:52 < luke-jr> afaik the only downside to using compressed keys is it changes the address, which segwit is changing anyway

19:52 < CodeShark> it's not a very complicated change

19:52 < wumpus> armory still uses uncompressed keys?!

19:53 < luke-jr> there's no reason you'd need to change the wallet structure I can see

19:53 < wumpus> in any case this only applies to segwit, not to old transactions

19:53 < achow101> the plan is to have a new wallet structure with bip32 that supports segwit and compressed keys

19:53 < gmaxwell> wumpus: "you're partitioned until you see a header chain with at least work X" is a pretty simple critera. :P

19:53 < sipa> luke-jr: it had fixed size records in its wallet format for pubkeys

19:54 < sipa> achow101: well if a new wallet format is needed for segwit anyway, it doesn't matter right?

19:54 < gmaxwell> achow101: oh god please do not use uncompressed keys with segwit. why would you do that?

19:54 < luke-jr> sipa: zero-pad it?

19:54 < achow101> sipa: well no, we don't need a new wallet for segwit as it could still work with the old one with a little bit of hacking

19:54 < achow101> that was the original plan

19:54 < luke-jr> achow101: no less than compressed could

19:55 < luke-jr> sipa: or store the uncompressed key, and compress it at address-generation/signing

19:55 < gmaxwell> achow101: why cant the same hack that indicates segwit is in use indicate compressed.. you just chop off some bytes of the key pretty much.

19:55 < sipa> btw, uncompressed keys account for 0.7% of used keys in succesful sigs on the network (in the past 2 hours)

19:55 < gmaxwell> it could be done entirely inside the process that seralizes the segwit scriptpubkey.

19:55 < achow101> gmaxwell: idk. ask goatpig

19:56 < gmaxwell> achow101: okay

19:56 < * michagogo> pokes his head in belatedly

19:56 < CodeShark> I think we should encourage all wallets to use compressed keys - achow101, if you need help with this I'd be willing to help

19:56 < sipa> agree - we should help

19:56 < gmaxwell> yes, lots of people would be glad to help.

19:56 < sipa> instead of just yell

19:56 < gmaxwell> well I offered to help armory move off uncompressed keys to alan several times, including offering to pay to do it.

19:56 < gmaxwell> so please don't say anyone just yelled.

19:58 < CodeShark> I initially designed my account structures to only use compressed keys - but later added a compressed bit to support legacy stuff

19:59 < petertodd> CodeShark: what legacy stuff specifically? legacy armory users?

19:59 < wumpus> CodeShark: bah,it's kind of sad that to hear some things seem to be going back instead of forward :)

19:59 < CodeShark> yes, to support other wallets

19:59 < wumpus> it's time

19:59 < CodeShark> but I think we really do need to prod all wallets to move to compressed keys

20:00 < CodeShark> there's really no reason to continue to support uncompressed keys - other than perhaps some migration tools

20:00 < wumpus> #endmeeting

20:00 < gmaxwell> CodeShark: as pieter notes, virutally nothing is already.

20:00 < lightningbot> Meeting ended Thu Sep 29 20:00:15 2016 UTC. Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)

20:00 < lightningbot> Minutes: http://www.erisian.com.au/meetbot/bitcoin-core-dev/2016/bitcoin-core-dev.2016-09-29-19.01.html

20:00 < lightningbot> Minutes (text): http://www.erisian.com.au/meetbot/bitcoin-core-dev/2016/bitcoin-core-dev.2016-09-29-19.01.txt

20:00 < lightningbot> Log: http://www.erisian.com.au/meetbot/bitcoin-core-dev/2016/bitcoin-core-dev.2016-09-29-19.01.log.html

20:00 < gmaxwell> 0.7% percent

20:00 < anchow101> Yes, help would be appreciated

20:01 < wumpus> well supporting it in consensus for the normal network keeps making sense, but segwit is just such a great oppertunity to get rid of it

20:01 < petertodd> wumpus: yeah, I don't see how we can remove backwards compatibility for it w/o confiscating funds, but no reason to not remove support in new addresses

20:01 < wumpus> petertodd: indeed

20:01 < gmaxwell> yes, thats why its important to get rid of now. otherwise I wouldn't care if action were taken n months later.

20:01 < luke-jr> if anything, we should be discussing whether to make it a consensus rule rather than a policy ;)

20:02 < gmaxwell> luke-jr: I like to but many people feel that addining an additional consensus rule for segwit now wouldn't be prudent.

20:02 < gmaxwell> making it non-standard is sufficient in my view, such that we'd be able to make it a consensus rule later.

20:02 < btcdrak> achow101 seems to be having connection problems

20:02 < luke-jr> sure

20:02 < achow101_> btcdrak: just a little bit. switching computers

20:02 < CodeShark> gmaxwell: having to deal with the additional case complicates implementations

20:02 < luke-jr> just saying the rule shouldn't be controversial itself really

20:03 < CodeShark> not very much, but still

20:03 < petertodd> gmaxwell: well, so long as we loudly warn that this is intended to become unspendable later if you bypass the standardness

20:03 < morcos> gmaxwell: if so, we should be as clear about it being not allowed now as if we were to make it a consensus rule now.

20:03 < gmaxwell> for those thinking that we have to verify all the old stuff for all time, that might be true for bitcoin core, but in the future I could imagine some implementations just not bothering to verify old stuff.

20:04 < gmaxwell> morcos: petertodd: agreed for sure.

20:04 < sdaftuar> petertodd: ntoe that it's standard to create an output with an uncompressed pubkey hash, as we can't detect the issue until a spend attempt (right?)

20:04 < jtimon> I agree with luke-jr on not seeing the controversy in it being consensus rule with the rest of segwit

20:04 < petertodd> gmaxwell: I don't mean verifying old stuff, I mean verifying new txs spending old coins

20:04 < petertodd> sdaftuar: yes, nothing we can do about that though

20:05 < gmaxwell> jtimon: well in the case of low-S which we also wanted to make a consensus rule, jl2012 discovered that there were corner conditions we wanted to think about more carefully before making it a consensus rule.

20:05 < sdaftuar> petertodd: right; just want to make sure we're all on the same page as i think communicating this widely/loudly is important

20:05 < petertodd> for example, we've made hybrid pubkeys non-standard, and given basically no-one's ever used them in production for anything I'd have no issues with making them unspendable in a soft-fork

20:05 < jtimon> gmaxwell: thanks

20:06 < gmaxwell> jtimon: I don't think the same applies to uncompressed keys, because the criteria there is even simpler. but the lowS reason is part of why we punted this collection of improvements to policy for now.

20:06 < jtimon> mhmm

20:06 < michagogo> I saw a movie that depicts a form of distributed/decentralized system, to avoid it getting shut down. Or in the words of the character that explains it, "everyone that logs on is a server". It's said to be "open source", but then that's explained as "anyone can edit the code, like Wikipedia.

20:07 < achow101_> so if anyone wants to help armory with segwit support, bip32, compressed keys, we accept PRs. All our work happens in the dev branch, not master

20:07 < michagogo> And the code is deployed when a majority of users approve it

20:08 < wumpus> michagogo: heh, open source in some weird twisted mirror world

20:08 < gmaxwell> achow101: is there a IRC channel where things are discussed? E.g. where should I ask goatpig about compressed pubkeys in segwit.

20:08 < sipa> michagogo: i believe they're mistakingly not describing a computer network, but politics.

20:08 < achow101_> #bitcoin-armory

20:08 < jtimon> michagogo: that is, when sybil decides so...

20:08 < luke-jr> achow101_: meh, should just collapse that into #bitcoin-dev :p

20:09 < michagogo> And it's completely vulnerable to Sybil attacks…

20:09 < michagogo> Gah, lagging

20:09 < michagogo> Yeah

20:10 < michagogo> And of course, when the last user logs off, it doesn't just stop working

20:10 < michagogo> The sybil attackers are able to watch it dramatically implode with special effects, "graphical corruption" type stuff

20:11 < michagogo> And there's the obligatory "they're blocking all our foreign IPs" and that kind of stuff, with no explanation of who "they" are

20:12 < jtimon> so what was there a conclusion for the range service bits? nothing/top-288/everything?

20:12 < jtimon> what about the getrange message and "sharding"

20:12 < GitHub159> [bitcoin] laanwj opened pull request #8843: rpc: Handle `getinfo` client-side in bitcoin-cli w/ `-getinfo` (master...2016_09_getinfo_clientside) https://github.com/bitcoin/bitcoin/pull/8843

20:12 < wumpus> conclusion was to add one service bit: last-288-served

20:13 < wumpus> and maybe later one for last-1000-served

20:13 < jtimon> wumpus: I see, and leave the rest for later, thanks

20:13 < luke-jr> 1024 would be rounder. ☺

20:13 < wumpus> and a jackpot for whoever enabled both at once

20:13 < luke-jr> if you set both, does it mean last 288000? :P

20:14 < jtimon> would it be crazy to just have last-1024 without last-288 and just change prunning's default?

20:14 < wumpus> 288 is not just the default, it's the minimum

20:15 < wumpus> I'd be okay with changing the default not the minimum, but that'd keep some nodes completely useless

20:15 < wumpus> whereas by far most requests are in the last 288

20:15 < luke-jr> wumpus: useless for syncing*

20:15 < luke-jr> frankly, there are enough full-archive nodes out there that we really don't *need* to do anything right now, so meh :p

20:16 < sipa> wumpus: actually, not true.

20:16 < jtimon> well, the users know what to do to stop being useless...

20:16 < wumpus> which, as morcos remarked, preferentially downloading the last blocks from would take a lot of load of nodes that do keep more blocks

20:16 < sipa> there are more requests in 101-1000 deep then 2-100 deep

20:16 < wumpus> ok...

20:16 < sipa> *than

20:16 < wumpus> I misremembered apparently, never mind

20:16 < luke-jr> an unsyncable-from node is still more useful than a syncable node that isn't used for a wallet

20:17 < luke-jr> syncable-from*

20:17 < jtimon> maybe we can changethe prunning minimum if that simplifies things?

20:17 < sipa> wumpus: well, sample of 1 long-term-running node over the course of a few weeks of data

20:17 < sipa> wumpus: more samples welcome

20:17 < wumpus> sipa: do you have a special patch for statistics collection?

20:18 < gmaxwell> sipa: need to filter out bitnotes.

20:18 < sipa> gmaxwell: right; how do you suggest to do that?

20:18 < wumpus> sipa: or a script for parsing logs?

20:18 < sipa> wumpus: both; i'll publish them after a little cleanup

20:18 < wumpus> I could put it up on a few nodes, no problem

20:19 < sipa> it just logs an extra line with depth and block size for each requested block

20:19 < wumpus> nice

20:19 < jtimon> I guess it's not completely crazy, but nobody seem to specially like it

20:19 < sipa> en then

20:19 < sipa> to inspect :)

20:20 < wumpus> jtimon: no, it's not completely crazy, using only one service bit is kind of elegant

20:21 < jtimon> :)

20:22 < wumpus> jtimon: if 1000 is really one-size-fits-all, and <1000-keeping nodes may as well be ignored. It's just hard to say without better statistics.

20:22 < wumpus> also statistics about what pruning sizes people prefer

20:22 < wumpus> I mean if everyone prefers the minimum and no one sets 1000 in practice

20:23 < gmaxwell> sipa: just do Satoshi:(recent) useragents.

20:23 < jtimon> well, independenlty of the statistics we will eventually need a more generic solution for flexible sharding, right?

20:23 < sipa> jtimon: maybe

20:24 < sipa> "need" is a big word imho

20:24 < sipa> but i agree it would be nice

20:24 < gmaxwell> jtimon: I think we do, would you like to finish the solution for that I started on?

20:24 < wumpus> jtimon: well there needs to be a different solution for historical block hosting IMO, but that's a different thing

20:25 < gmaxwell> sipa: I think excepting participants to keep around hundreds of gigs of blockchain is not conducive to the surival of the network, the alternative I see is a hardfork that drops off the history past some point. (e.g. just restarts the chain from a utxo commitment made a year before)

20:26 < sipa> gmaxwell: well, or just stop supporting historical block fetching more than 1 year or whatever number back on the p2p protocol, and use http

20:26 < wumpus> or bittorrent *ducks*

20:26 < jtimon> wumpus: yeah, historical hosting is what I mean

20:27 < jtimon> gmaxwell: maybe, but it sounded deterministic like luke-jr proposed instead of flexible like wumpus wanted

20:27 < wumpus> it could be anything that supports downloading ranges of data...

20:27 < brainwave> Under overview, balances, on the right side of available, pending, total, add ~ exchange rate for dollars, pounds, euro

20:28 < sipa> brainwave: bitcoin core does not and cannot know exchange rates

20:28 < sipa> (because it would require contacting a centralized service, which we don't do by design)

20:29 < wumpus> yes or someone would need to commit them to the chain, but that'd still be trusting a central issuer/signer of the information

20:29 < wumpus> it's just a no-go

20:35 < gmaxwell> well if the users of bitcoin accepted that kind of security model change, what I would suggest is something like every 26280 blocks the block is required to have a commitment to the utxo set (could be a linear hash) as of 2016 blocks prior. and then six months of work after that, that commitment becomes usable for initial sync. and so then no one need process more than a year of blocks at sync.

20:35 < gmaxwell> .. though you would have to store three copies of the utxo set (though perhaps deduplicated)

20:36 < gmaxwell> jtimon: I don't know why anyone would find determinstic less desirable.

20:37 < sipa> gmaxwell: well i expect the controversy to not be about the change in security model, but about the perpetual requirement of having a utxo set

20:40 < wumpus> gmaxwell: I explained that: if you make it deterministic you have to be sure of the parameters in advance, there is no room for tweaking or optimizing later on

20:42 < gmaxwell> wumpus: well you simply extend the protocol to have a new signaling mechenism for the tweaked thing.

20:42 < wumpus> sipa: yes the bigger problem is the ever-growing UTXO set

20:42 < wumpus> gmaxwell: but then it loses backwards compatibility every time

20:43 < gmaxwell> something that just signals absolute heights has the problem that the communicated information will always be out of date. .. or if nodes don't change the ranges they host, we will end up with highly irregular distributions of information.

20:43 < sipa> the type of tweaking needed, and the potentially aging problem depend on the specific proposal

20:44 < sipa> i'm sure we can come up with something that seems reasonable to all

20:44 < wumpus> agree, there may be a compromise that is somewhat flexible and still deterministic

20:44 < gmaxwell> well what I suggested might not be viable after all too. I'm not sure, I wasn't successful in achieving all my goals at once.

20:45 < wumpus> I just don't think setting it all in stone in advance is a good idea, for the whole reason that it's so hard to achieve all your goals all at once

20:45 < wumpus> especially if you don't know some of those goals yet

20:45 < gmaxwell> I wanted a scheme that would result in a uniform distribution of blocks, that didn't depend on peers to look to see what other peers had (because that could be spoofed), required minimal communication (not a long list of blocks in an addr message).. and retainined uniformity as the chain grew, without causing peers to redownload blocks they already forgot.

20:47 < gmaxwell> So I had found two schemes, one where peers had a ID and the amount of blocks they would store, and from that they could determine which they would store, and as new blocks came in they might store then and drop some group of old ones. The problem with it was that to figure out if a particular peer had block X you had to do computation linear in the number of blocks in the chain.

20:47 < wumpus> darn, also fingerprinting will be hard to avoid

20:47 < gmaxwell> Then I had another scheme that was sublinerar work, BUT a peer might drop a block but later have to go fetch it again.

20:47 < gmaxwell> wumpus: thats unavoidable with any split up scheme.

20:48 < wumpus> yes

20:48 < sipa> make the IP address part of the seed

20:48 < sipa> if your dhcp changes, you have to resync, sorry.

20:48 < wumpus> unless a substantial part of nodes are the same

20:48 < gmaxwell> sipa: then when you change IP, you have to go download a different set of blocks.. :P hah

20:48 < wumpus> e.g. there are only 8 IDs, pick one

20:48 < gmaxwell> wumpus: well I was thinking 32 bits, but perhaps a smaller collection would be fine.

20:49 < gmaxwell> but that gives you at best only 1/8th spitting storage. :( maybe fine now, but not in the long term.

20:49 < wumpus> maybe the number of groups can grow over time, a doubling every so many blocks :)

20:50 < sipa> hah: if you get a request through an IP that doesn't correspond to your local storage, just proxy all requests through to another node which does, and use that to gradually resync for the new seed.

20:50 < gmaxwell> Part of why I haven't given this that much more thought is because I think bitcoin will need to move to the commit state and forget history model; the ever growing sync time is too big a tide to stand against.

20:50 < gmaxwell> sipa: lol!

20:50 < gmaxwell> sipa: I think thats actually how the freenet location swapping works, funny enough.

20:50 < wumpus> hehe

20:50 < sipa> downside: if you want this to be fingerprint resistant, you have no way to determine how many proxies your blocks actually went through

20:50 < sipa> => instant mixnet

20:51 < gmaxwell> sipa: freenet nodes change position over time, and they do it by swapping their location with a direct neighbor, when that location swap makes them both closer to where they want to be, ... when requests come in for the new location, they don't have the data, but it's only one hop away..

20:52 < wumpus> gmaxwell: I've always thought that, it's hard to imagine this continuing for 10's of years, but where to put the anchor...

20:52 < gmaxwell> in any case. if there were only 8 flavors of nodes, then it all becomes simple, block_height//1000 % 8 = flavor.

20:53 < * gmaxwell> lunch

20:54 < wumpus> that seems kind of elegant and straightforward, there must be a catch

20:59 < jtimon> gmaxwell: sorry, well the deterministic seems to come at the cost of less flexibility

21:00 < sipa> wumpus: i'm trying to think about why 8 isn't enough

21:00 < wumpus> if you want to automatically scale the number of flavors up with height you could divide height 0..N into X flavors, the N..3N into 2*N flavors, and so on, where each flavor gets flavor (x<<1)+randbit()

21:01 < jtimon> only 8 flavors requires you to store 1/8 of the blockchain

21:01 < sipa> and we could have names for the first 8 top-level flavours or so... so your wallet could report "Looking for a bittersweet node..."

21:01 < wumpus> (well those numbers are arbitrary but the idea is that if a doubling of the # is needed, the new flavor, a member of a twice as big set, would contain the previous one)

21:02 < wumpus> hehe, yes assigning names would be nice

21:46 < GitHub131> [bitcoin] jnewbery opened pull request #8844: change sigops cost to sigops weight (master...sigops_weight) https://github.com/bitcoin/bitcoin/pull/8844

21:49 < GitHub72> [bitcoin] jnewbery opened pull request #8845: Don't return the address of a P2SH of a P2SH (master...trivial-P2SH-P2SH) https://github.com/bitcoin/bitcoin/pull/8845

21:51 < gmaxwell> jtimon: "less flexible" -- everything is less flexible short of sending someone arbritary x86 bytecode that they run.

21:53 < jtimon> less flexible in the amount of data you store, but maybe 8 flavors can be subidivided in 16 flavors half the size as wumpus was suggeting, then 16 to 32, etc. That may be flexible enough

21:54 < gmaxwell> jtimon: I was recommending 2^32 'flavors' but wumpus was concerned about reducing fingerprinting.

21:55 < gmaxwell> the whole reason to reduce the amount was to make it more difficult to follow a node around as it changes network identity.

21:55 < gmaxwell> sipa: 8 isn't enough if the chain is perpetually growing.

21:56 < jtimon> I see

21:56 < sipa> yeah, increasing number, the further back you go, may make sense

21:56 < gmaxwell> a year from now the chain will be 200 gb, a year after 300 gb-- at that size it is larger than the most common ssd size currently. a year after that 400gb.... and at that point an 8 way split is again running common hosts out of disk even if the common ssd size has moved up to 500gb by then.

21:56 < jtimon> well, maybe archive nodes that don't want to store everything have to get a privacy hit

21:57 < gmaxwell> who will bother running one if it takes speical effort above and beyond running a node, and draws more resources?

21:57 < sipa> well if only we'd have a separate network for archivsl

21:57 < sipa> there are no privacy issues at all then

22:06 < gmaxwell> and no one run them.

22:06 < gmaxwell> s/run/running/

22:07 < sipa> i was about to say that separate network doesn't need to imply separate nodes

22:07 < sipa> but of course, that doesn't work because you'd get a privacy leak from correlating

22:08 < sipa> however, you can reconcile those by only having nodes with a long-term IP provide archival further back than some threshold

22:10 < gmaxwell> sipa: not just that, but if it's a special very resource intensive mode.. few will do it, pliling more resources onto it... causing fewer to do it...

22:10 < sipa> it's true that it's resource intensive, but it's a different kind of resources than most of the rest of running a node

22:10 < sipa> it needs disk space and bandwidth

22:10 < gmaxwell> I might think it's not over the threshold of that, except already people don't run regular nodes due to costs.

22:11 < sipa> rather than memory and cpu

22:11 < gmaxwell> which are what people usually complain about.

22:11 < sipa> then why aren't we seeing more pruned nodes?

22:11 < sipa> one reason may be that pruned nodes don't advertize, so we just don't know about them

22:12 < gmaxwell> because you have to edit a config file or change an obscure setting, we don't advertise it, and it breaks rescan and reindex. (which is part of why we don't really advertise it)

22:14 < sipa> well people mostly complain about the sync time for a node

22:15 < gmaxwell> yes, though I think thats most because so many stop there and give up before they get a chance to complain about the rest.

22:15 < sipa> perhaps

22:18 < TD-Linux> a first-run dialog box with a slider for disk usage and an estimated sync time would be very nice

22:18 < sipa> except the sync time does not depend on the value of the slider

22:19 < TD-Linux> yes, I meant it there so it'd appear at start. I guess having it in the status bar is sufficient

22:20 < sipa> ah

22:20 < sipa> well there will be an overlay with sync time indication in 0.14

22:21 < gmaxwell> doesn't it still incorrectly say you can't transact while syncing?

22:21 < sipa> we still have a lot time until 0.14

22:23 < gmaxwell> :)

22:23 < wumpus> well you can, but most people probably shouldn't do so

22:23 < gmaxwell> yes they should

22:23 < wumpus> during the initial sync they won't have any coins to send anyway, and receiving them is a bad idea as they'll only see them when the entire thing is done

22:23 < wumpus> oh?

22:24 < wumpus> why?

22:24 < gmaxwell> initial sync isn't my concern there:

22:24 < gmaxwell> probably one of the most common usage patterns for a wallet user is that you start your wallet up in order to pay someone, and it's three weeks behind. You can go ahead and pay, no problems.. why wouldn't you?

22:24 < gmaxwell> during initial sync you just won't have any coins, indeed. :)

22:24 < wumpus> the biggest problem is people giving out addresses during initial sync

22:24 < wumpus> then realizing how long it takes

22:25 < wumpus> this is what the overlay is designed to prevent

22:25 < wumpus> sure, you can send coins if you're three weeks behind, no problem, although fee computation could be off

22:25 < gmaxwell> yes, that is a large source of complaints, but we shouldn't tell people that they cant send funds already in their wallet when they start up and they're a bit behind, it's already a common mistaken belief that they cant (and then they complain about how long it takes to catch up a month of blcks)

22:25 < TD-Linux> the warning could be conditional on having zero funds

22:26 < gmaxwell> TD-Linux: the earlier warning text was fine-- saying that you won't see payments to you yet, but for some reason it was changed to say that you cannot send funds.

22:26 < wumpus> yeah fix one thing and they'll start complaining about another, it's a never ending source of fun...

22:26 < sipa> i don't think anyone will read the text anyway

22:26 < gmaxwell> I also complained that the text is now too long and won't get read.

22:26 < wumpus> of course people will read it

22:26 < sipa> the important thing is that it's in the way, and gives accurate (by then, hopefully) information

22:26 < wumpus> heck, users aren't stupid

22:26 < gmaxwell> The first text was better.

22:27 < sipa> gmaxwell: PR welcome

22:27 < wumpus> maybe some are, but not all of them, some will actually read and understand

22:27 < gmaxwell> Well I'm stupid, and looked at the notice in its updated state and didn't read the list line.

22:27 < gmaxwell> first*

22:27 < gmaxwell> because when there is too much text many people go a bit banner blind and skim past headings and such.

22:27 < wumpus> if we don't believe peopel actually pay attention then why do anything at all

22:28 < gmaxwell> saying that a wall of text is too much is not saying that people don't pay attention.

22:28 < wumpus> I think it's an improvement to what was there, indeed, if you want to imrpvoe further then pulls are welcome

22:28 < sipa> right, that's what i'm saying - having there being an overlay at all is more important than what the text says

22:28 < gmaxwell> and re: being able to send, people already complain that they have to wait a long time after starting to send because they already frequently mistakingly believe they can't.

22:28 < sipa> and we have time to improve the latter

22:29 < wumpus> but I'm a bit tired of people always saying "users won't read anyway" to everything that adds documentation , help or warnings

22:29 < wumpus> a lot of users are definitely looking for more help and guidance when they first open the program, and a bit of text helps there

22:29 < gmaxwell> wumpus: why should I waste my time when I point out that THE TEXT IS OUTRIGHT UNTRUE and your response is to accuse me of thinking users are stupid? my comment was that the earlier version of the text which was simple and NOT UNTRUE was better.

22:29 < sipa> please guys

22:30 < sipa> gmaxwell: go propose something

22:30 < gmaxwell> I did!

22:30 < wumpus> gmaxwell: well if the text is wrong then it should be fixed obviously, change it to a better text

22:30 < wumpus> I don't know what the previous version of the text was

22:31 < sipa> it's been changed a dozen times in the lifetime of the pull

22:33 < sipa> also, it says "Spending bitcoins may not be possible until synchronization has finished."

22:33 < sipa> which is not untrue.

22:34 < gmaxwell> okay, it was changed after I last saw it.

22:35 < wumpus> ok that was useless :)

22:35 < gmaxwell> by saying 'may' which is still misleading, but worse, that text is the bold.

22:35 < gmaxwell> er is the only bold part.

22:35 < sipa> well, improvements welcome

22:35 < gmaxwell> So now it says "mumble mumble mumble Spending bitcoins may not be possible during that phase!" :-/

22:36 < gmaxwell> it's a waste of my time, I already raised these issues and it was then merged.

22:36 < wumpus> it had to be merged at some point, with the idea it could be improved later

22:36 < gmaxwell> well to be fair the last change did improve it, its true.

22:37 < gmaxwell> but created the problem that if you skim it is that all you extract is that you can't spend, .. which misses the really critical thing: which is that you wallet may look empty when it isn't.

22:37 < wumpus> that doesn't mean it's final, most will only see the message when it is merged, and can improve it then, there are already some pulls open to improve that overlay

22:38 < gmaxwell> but okay, I can open a PR.

22:38 < wumpus> (but I don't think they change that message)

22:39 < sipa> gmaxwell: i think people didn't really understand the point of your concern (i didn't): if you're looking at it from a point of view that this would be be mostly seen (and intended to convey information) during IBD, it's perfectly reasonable to warn users they won't be able to spend the money they're still to receive... and a simplification to reduce the length of the text may be warranted

22:39 < sipa> it's a good point that it's also seen during non-IBD

22:39 < gmaxwell> It will mostly not be seen during IBD.

22:39 < gmaxwell> during IBD sure someone will see it then, say of course I knew that (even if they didn't) minimize and go on with life. :P

22:40 < gmaxwell> but then users will see it every single time they start.

22:40 < sipa> i'm aware, you don't need to argue about this

22:40 < sipa> i'm just explaining why maybe you felt misunderstood

22:40 < gmaxwell> sorry, not arguing-- clarifying.

22:40 < gmaxwell> Yes, I see that and I didn't before.

22:41 < gmaxwell> when I first saw this PR I even took the time to go through the code carefully to check to see if there was anything that made it IBD only.

22:42 < gmaxwell> because I couldn't understand why people wanted the text that it had.

22:42 < gmaxwell> it did not occure to me that other people might be only thinking about IBD.

22:42 < gmaxwell> sorry for being thoughtless there.

22:42 < wumpus> #8805 fixed a few minor grammar nits, #8821 fixes a blocking problem with the overlay, there are no pulls yet that improve the message

22:43 < sipa> sorry, we (including me) aren't being careful with terminology here... IBD is also used for syncup when you were previously synced to a month ago

22:43 < wumpus> it's very easy to forget about catching up nodes

22:44 < wumpus> but yes we shouldn't

22:44 < sipa> well it's mostly designed to help with that first sync

22:45 < gmaxwell> more obvious to me just by chance of hering more people complain about it, also I've stopped running a node 24/7 on my laptop because I've been watching the battlestar galactica series in evenings and bitcoin interupts video playback. :)

22:45 < wumpus> during catch-up it's reasonably useful too, people may not know they won't see transactions newer than their sync point and worry, but yes it's mostly important for the initial IBD (lol)

22:45 < gmaxwell> so every time I go to use bitcoin I'm stuck waiting for it to catch up.

22:45 < gmaxwell> yes, we should have this message during catch up. But it's important to not make people think they can't spend funds that they can see.

22:46 < gmaxwell> The important message is that you may not see all payments to you yet (and you can't spend what you can't see).

22:46 < wumpus> bitcoin interrupts video playback? even in steady state mode?

22:46 < TD-Linux> one option would be to put it on the payment request generation page instead. but even in its current state it's far better than what was there before (nothing)

22:47 < gmaxwell> wumpus: yes. on my laptop... playback from a local file. The issue is IO or cpu related, probably the former but I haven't tested extensively to know for sure.

22:48 < wumpus> that's very strange. I'd expect that during intial sync when it maxes out CPU and I/O usage, but not when it's up to date

22:48 < gmaxwell> TD-Linux: the big problem we should be solving here is that people see a balance of zero then delete the wallet. I think thats the priority because any other issue doesn't cause irrecoverable loss.

22:48 < gmaxwell> wumpus: I notice it during ordinary computer use.. causes IO hangs, but its not irritating except when watching video.

22:49 < wumpus> do you have a lot of mlocked memory? is it swapping?

22:50 < gmaxwell> no, not swapping 8gb ram. I think that when a bunch of random writes happen it causes long delays for garbage collection in the SSD.

22:50 < wumpus> swapping seems to be the foremost cause of I/O related hangs here, as essentially the memory subsystem has to wait for I/O to complete

22:50 < wumpus> heh as if 8gb ram means no swapping these days :)

22:50 < gmaxwell> well on my laptop its enough most of the time.

22:51 < gmaxwell> The stalls seemed to get better for a while after I freed up a bunch of space and trimmed the drive, but got worse after which is why I think SSD GC plays a roll.

22:51 < gmaxwell> but in any case, while watching the show every block arrival causes a second-long pause in playback.

22:51 < wumpus> maybe someone is requesting a lot of blocks from you with a bloom filter? :-) it would be interesting to find out what your node is actually doing at those times

22:52 < gmaxwell> nah, outbound only.

22:52 < gmaxwell> I know its at the same time as blocks showing up.

22:52 < TD-Linux> gmaxwell, easy way to verify that would be to increase your video player's lookahead cache

22:52 < wumpus> ok so it's block verification, leveldb seeks

22:52 < gmaxwell> TD-Linux: think mpv uses non-blocking reads of the disk?

22:53 < TD-Linux> gmaxwell, yup it does. I've increase the setting to 10s when using sshfs and it works fine

22:54 < gmaxwell> in any case, performance distraction aside, when this happens I shut down bitcoind then it may stay off for a week before I need to do something with it, then waiting for it to catch up is irritating.

22:55 < gmaxwell> (and of course my systems performance is seriously impacted while it catches up)

22:55 < wumpus> yes, nothing to do about that, I guess if hybrid SPV mode is implemented it could also work during catch-up

22:55 < wumpus> indeed, it's either slow down the catch up or tolerate it hogging the whole system

22:58 < gmaxwell> I have wondered if it might be useful to split the chainstate into two parts, one with txouts created in the most recent N blocks, and one with the rest. Then on start we could just load the whole first one into the cache.

22:58 < wumpus> the default setting of hogging all cores during IBD/catch-up is a bit rude, certainly if it is a background process

22:59 < gmaxwell> if we did that much of the cost would then be signature valdation instead of random IO, and signature validation could run in the background, following behind the blocks.. and at lower priority.

22:59 < wumpus> so that would be like a 'prefer keeping recent UTXOs' cache policy?

23:00 < gmaxwell> I guess thats a stat that I still haven't collected. "what is the average age-in-blocks of inputs that are consumed" (/what is the distribution of that age)

23:00 < gmaxwell> we know recent ones are spent more often but I don't have good numbers on it.

23:00 < gmaxwell> wumpus: yes.

23:02 < gmaxwell> probably not the highest priority improvement in any case.

23:02 < wumpus> well, currently the whole cache is emptied at a write, I think there are many eviction policies that would do better

23:03 < gmaxwell> right, also the in memory representation of the cache entries is quite inefficient.

23:04 < gmaxwell> so its effective size could potentially be doubled if its entries were flat allocated.

23:04 < wumpus> though that helps it actually being an efficient cache, a more efficient representation shouldn't come at a higher access cost

23:06 < wumpus> I did an experiment once with storing the UTXOs in serialized form in memory: https://github.com/laanwj/bitcoin/tree/2016_04_dummy_db

23:09 < gmaxwell> interesting!

23:09 < wumpus> although that's for the entire UTXO set, not just a limited cache

23:10 < gmaxwell> yea, the on disk seralization is most efficient, but not fast.

23:11 < gmaxwell> I wouldn't be surprised though for the current in memory representation if there wasn't more bytes spend in malloc/container overhead than actual transaction data though.

23:11 < wumpus> but given that most time is wasted on disk seeks anyway, it may not make too much difference in practice

23:11 < wumpus> depends on the system...

23:12 < wumpus> yes, the malloc overhead is somewhat bad

23:14 < wumpus> but not more than the actual data size, from what I remember

23:15 < wumpus> in any case improvements are certainly possible there, without any rocket science, it's just that it's such risky code to change

23:15 < wumpus> if it was any other project people would have optimized the shit out of it by now

23:21 < wumpus> unfortunately the damages of a bug there are unfathomable, not just skipping a few video frames

23:22 < gmaxwell> well the hope I had there was that with making the cache more efficient, it could be increased in size and avoid more disk IO. :)

23:24 < gmaxwell> on the earlier subject of 'alt' implementations doing inadvisable things, some of them have a genius performance optimization which they're crowing about, -- where they only validate transactions that weren't already in the mempool; something we explicity decided not to do because of the long history of subtle mempool corruption issues.

23:25 < wumpus> exactly, there are tons of ways to optimize and get things just slightly wrong

23:25 < gmaxwell> I worry that there will be a race to the bottom, where by making risky / security reducing optimizations implementations will gain significant performance advantages, and suffer no cost until the inevitable spectacular failure that results.

23:26 < wumpus> which doesn't matter if no one runs your code anyway, but we have to be really careful

23:26 < gmaxwell> and being safe doesn't matter if peopel don't run it in favor of things that are faster.

23:26 < gmaxwell> people*

23:26 < wumpus> we also shouldn't overestimate how important the performance is to most users, many just run it on a server or otherwise unused computer

23:27 < wumpus> well you'd say safeness is really important, if the inevitable spectacular failure happens you don't want to be at the center of it

23:28 < gmaxwell> well sure. indeed.

23:28 < wumpus> better slow than dead :)

23:29 < gmaxwell> but for things like the security model change to use a 6 month old utxo commitment instead of syncing the history... the potential for a spectacular failure there which a more conservative approach could have stopped is negligible.

23:29 < TD-Linux> gmaxwell, still think you should verify that it's actually IO that's the problem before going too deep :)

23:29 < gmaxwell> And if we don't investgate things like that, someone will do something dumber.

23:30 < wumpus> TD-Linux: yes, measuring is better than assumptions :)

23:30 < gmaxwell> TD-Linux: Oh IO is an issue for sure regardless of whats causing my mpv stalls. I'll find out tonight (I'm not going to sit here and watch video for an hour just to check)

23:31 < gmaxwell> TD-Linux: SSD vs a fast spinning disk with small dbcache here is a <4 hour sync (from a local peer) vs a >9hour sync.

23:32 < wumpus> and sure, it's good to investigate things like that

23:33 < wumpus> I/O is absolutely a problem sometimes, leveldb generates *tons* of seeks and small reads, better caching would help avoid some of that

23:34 < wumpus> I had no luck with other databases, I remember trying with lmdb at some point, which was faster in reading, but instead... does tons of seeks and small writes at write, so it just moves the problem

23:35 < TD-Linux> yeah, on a SSD at least, the former is much less detrimental to system latency

23:35 < wumpus> indeed!

23:35 < gmaxwell> for operation at the tip, using the mempool instead of validating would be a big aid... but the safty of that remains dubious. :)

23:36 < sipa> gmaxwell: i believe for the mempool it's approximately a factor 2 overhead

23:36 < sipa> gmaxwell: that that is also indexes, orderings, accounting, ...

23:38 < wumpus> yes that remains dubious, especially with regard to isFinal and such

23:38 < wumpus> I mean there is a part of transaction validation that can be obviously cached, and a part that may change in time

23:39 < gmaxwell> fortunately the MTP change made that much safer.

23:41 < gmaxwell> e.g. before a block could come in with a time before your local time, and contain txn which are isfinal invalid according to the block but okay with respect to the local time, and you'd accept it. I wonder if the alt implementations had that bug.

23:48 < wumpus> I remember that one, tricky... and there may be more problems of that kind not yet found

23:49 < gmaxwell> looks like their change dodged it, because the finality test is in the ContextualCheckBlock, and the bypass patch only bypasses checkinputs...

23:49 < gmaxwell> (which also means that it doesn't manage to avoid accessing the utxo cache entries)

23:51 < wumpus> I just realized, if the problem is that the block validation hiccups other things happening on your PC, the solution may be actually to slow it down :)

23:52 < wumpus> put a small sleep between each UTXO lookup, limit the validation to one thread

23:52 < wumpus> not something you'd want to do during initial sync if you're waiting for it, but if you don't care and it runs in the background...

23:54 < wumpus> after all you run it to keep up, you don't need to outrace it

23:56 < gmaxwell> I've thought before that if we have bandwidth limiting enabled we should delay announcement of new blocks to reduce the number of peers that request them from us... but slowing down the validation would work as well.

23:56 < gmaxwell> small sleeps perhaps aren't so good because it may busy spin. :P

23:57 < wumpus> heh, not that small

23:58 < wumpus> or use some OS-dependent way to reduce the I/O priority

23:59 < wumpus> as long as it's done by the time the next block comes in, so taking 10 minutes would take it too far :)