#bitcoin-core-dev on 2015-10-26 — searchable irc log

07:06 < GitHub133> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/46f74379b86b...450893769f7a

07:06 < GitHub133> bitcoin/master ceb2a9c Wladimir J. van der Laan: doc: mention BIP65 softfork in bips.md

07:06 < GitHub133> bitcoin/master 4508937 Wladimir J. van der Laan: Merge pull request #6879...

07:06 < GitHub137> [bitcoin] laanwj closed pull request #6879: doc: mention BIP65 softfork in bips.md (master...2015_10_bip65) https://github.com/bitcoin/bitcoin/pull/6879

07:21 < GitHub163> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/450893769f7a...867d6c90b850

07:21 < GitHub163> bitcoin/master dca7bd3 Wladimir J. van der Laan: doc: Add developer notes about gitignore...

07:21 < GitHub163> bitcoin/master 867d6c9 Wladimir J. van der Laan: Merge pull request #6878...

07:21 < GitHub17> [bitcoin] laanwj closed pull request #6878: doc: Add developer notes about gitignore (master...2015_10_ignore_files) https://github.com/bitcoin/bitcoin/pull/6878

07:35 < jonasschnelli> gmaxwell: re: -maxuploadtarget. Will try to complete it today. Need to adapt sdaftuar tests and rebase/squash.

08:10 < GitHub137> [bitcoin] laanwj pushed 7 new commits to master: https://github.com/bitcoin/bitcoin/compare/867d6c90b850...5242bb32c723

08:10 < GitHub137> bitcoin/master 4d2a926 dexX7: Ignore coverage data related and temporary test files

08:10 < GitHub137> bitcoin/master d425877 dexX7: Remove coverage and test related files, when cleaning up...

08:10 < GitHub137> bitcoin/master 8e3a27b dexX7: Require Python for RPC tests, when using lcov...

08:10 < GitHub173> [bitcoin] laanwj closed pull request #6813: Support gathering code coverage data for RPC tests with lcov (master...btc-test-lcov-rpc) https://github.com/bitcoin/bitcoin/pull/6813

08:34 < jonasschnelli> anyone getting the same build errors (osx) for current master: boost: mutex lock failed in pthread_mutex_lock: Invalid argument?

08:35 < * jonasschnelli> is doing a fresh checkout / build

08:43 < jonasschnelli> hmm.. same issue when doing a fresh checkout / build...

08:44 < wumpus> is that a *build* error?

08:44 < jonasschnelli> libc++abi.dylib: terminating with uncaught exception of type boost::exception_detail::clone_impl<boost::exception_detail::error_info_injector<boost::lock_error> >: boost: mutex lock failed in pthread_mutex_lock: Invalid argument

08:44 < wumpus> looks like a runtime errror to me, something abusing boost?

08:45 < jonasschnelli> i try now to upgarde from 1.57 to 1.58...

08:45 < wumpus> let's first try to debug it

08:45 < wumpus> when does it happen? can you get a traceback?

08:45 < jonasschnelli> stacktrace stops when program tries to call the first LOCK

08:46 < wumpus> ok

08:46 < jonasschnelli> EnterCritical() / push_lock()

08:46 < wumpus> when was the last time it worked? maybe try a git bisect

08:46 < jonasschnelli> that is strange: *pthread_mutex_lock: Invalid argument*

08:47 < jonasschnelli> wumpus: okay... i'll try to find out if it's related to a source change or something on my local machine.

08:47 < wumpus> can be a use-after-free of a lock

08:48 < wumpus> if it happens at startup it may be a unexpected error path that doesn't wind down properly

09:07 < GitHub136> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/5242bb32c723...26f5b34e8838

09:07 < GitHub136> bitcoin/master 10e2eae Wladimir J. van der Laan: rpc: Add maxmempool and effective min fee to getmempoolinfo

09:07 < GitHub136> bitcoin/master 26f5b34 Wladimir J. van der Laan: Merge pull request #6877...

09:07 < GitHub170> [bitcoin] laanwj closed pull request #6877: rpc: Add maxmempool and effective min fee to getmempoolinfo (master...2015_10_mempool_effective_fee) https://github.com/bitcoin/bitcoin/pull/6877

09:33 < dcousens> gmaxwell: with the address index comment, you mention it might drive away users, but, this is an opt-in patch

09:34 < dcousens> I'd much rather use this than my own self-maintained version of the patch, hell, several block explorers I know have written their own implementations eating up much more than 50GB of space etc

09:38 < btcdrak> dcousens: FYI, my .bitcoin folder is 67GB with addrindex turned on. What is it without?

09:39 < dcousens> btcdrak: moment, just checking

09:40 < dcousens> 56G

09:40 < dcousens> with -txindex

09:40 < btcdrak> dcousens: but as a maintainer of an addrindex fork, I think this particular implementation is absolutely not the right way to go about it. It's ugly. I'd much prefer to see something like #5048 which maintains a separate index entirely.

09:40 < dcousens> Oh no doubt

09:40 < dcousens> I meant in concept only

09:40 < dcousens> Hence, only my concept ACK

09:41 < dcousens> There might be a way to do this as gmaxwell pointed out, via fast re-scan or something

09:42 < dcousens> It would just have to be super fast in the case of being feasible for block explorers etc, but, I guess you could just throw some caches/LBs in front of it

09:43 < btcdrak> dcousens: yeah, I'm not sure how addrindex would scale in the long term.

09:45 < dcousens> yeah, but I mean, we have that same concern for the blockchain itself haha

09:45 < dcousens> I just figured, in terms of worrying about users losing resources, in the end, their opting-in, so they'd be aware no?

09:46 < btcdrak> dcousens: I'm no longer as passionate because I provide a fork for those that want it. It's easy enough to maintain.

09:46 < wumpus> indexing the whole block chain is generally the wrong thing to do

09:47 < wumpus> and I understand there are forensic or research reasons to do so, but if you end up at that it's generally an indication you're doing something wrong in your design

09:48 < dcousens> wumpus: that really depends on your trust model though?

09:48 < wumpus> no, it depends on your scaling model

09:49 < wumpus> it's best to regard the block chain as transitory data used to update the utxo state

09:50 < wumpus> (as well as other things, such as wallets)

09:51 < dcousens> agreed, but the UTXO may not be all you care about. But yes your right, in that sense, its your scaling model

09:52 < wumpus> that's why I added (as well as other things) - it's data you want to use to apply changes to your running state, then throw away

09:52 < dcousens> wumpus: but that lacks the information in the case of a user wanting to see the 'history' of a certain address

09:52 < wumpus> bitcoind's wallet, as one exaple, does this correctly. It only requires the whole beast for rescans. It would be nice to have an external example though that isn't embedded into bitcoind.

09:52 < dcousens> the only way to generate that history would be to view all hte transitory data

09:53 < wumpus> the way to generate that history would be to store it as it comes along, as metadata

09:53 < dcousens> that assumes you know the address from the start

09:53 < wumpus> the same way bitcoind's wallet does - store the transactions, or whatever you need to list it later

09:54 < dcousens> for example for a wallet I maintain, we very often have users that come along with BIP39 mnemonics with 100's of used addresses, forcing a rescan for each time they introduce a new mnemonic/wallet would be a huge rescan burden for us

09:54 < wumpus> history (or the part of history that you want to store for accounting purposes) is part of your running state

09:54 < dcousens> Which again, relates to our scaling model, but, that too relates directly to this patch (as a concept)

09:57 < wumpus> yes, if you essentially offer rescanning-as-a-service it makes sense to keep an index. I still think it should be something external though, not part of bitcoind. There are tons of differents kinds of indexes one could want, for specific purposes

09:57 < wumpus> ideally some external indexing tool would source block data from bitcoind then keep an index, as running state

09:59 < wumpus> so my points are: do not encourage people to keep that index, and do not clutter the codebase with all kinds of extra indexes not necessary for node functionaity. It is not "never create an index". sure there may be reasons to do so but it's very custom

10:00 < dcousens> wumpus: hmmm, but, maintaing this indexing is not a simple task

10:00 < btcdrak> wumpus: I think I finally see your point

10:00 < wumpus> I haven't used the word 'simple'

10:00 < dcousens> It requires a lot of complex logic in regards to re-orgs etc

10:01 < dcousens> wumpus: I know, just putting it out there as to why this is probably a persistent feature request

10:01 < wumpus> yes, it requires logic with regard to re-orgs. To be precise, you need to be able to roll back updates from a block.

10:01 < wumpus> bitcoind does this by keeping undo files, but your database may have features to do so as well

10:02 < wumpus> and once this code is written people can just use it, it's no more complex than using your patch

10:02 < dcousens> wumpus: in my mind, it seems more like its just a way to offload the maintenance burden haha

10:03 < btcdrak> why dont we use zmq to notify external apps?

10:03 < wumpus> there's --enable-zmq?

10:04 < dcousens> wumpus: is there a re-org notification?

10:04 < dcousens> That would make the roll backs super simple

10:04 < btcdrak> it could store whatever it wants, bitcoind would notify about reorgs etc. that would allow to build a sort of indexd. The indexd could always query bitcoind for things like blocks as a passthrough

10:05 < wumpus> not sure - there is a notification when there is a new block, you can use that to determine if there has been a reorg. But a more direct notification of what blocks have to be un-done/applies might be useful too. Though be careful, zmq notifications can be lossy, so you shouldn't rely on their content.

10:06 < wumpus> btcdrak: right

10:06 < wumpus> indexd would pull the data it requires from bitcoind

10:06 < btcdrak> wumpus: if we're missing notifications I'm sure they are easy to add.

10:06 < dcousens> wumpus: really? Are they consistently lossy?

10:07 < wumpus> dcousens: yes, if the queue is full new messages are discarded

10:07 < dcousens> wumpus: right, so its just a matter of consumption

10:08 < wumpus> not sure if are any guarantees for delivery - that's why I have always been kind of careful there, it's easy to use it in a wrong way. Notifications are "hey, wake up, there is new information". If those get lost there's no big issue, it will pick them up next time.

10:10 < dcousens> wumpus: mmm, I guess, in the end, its as btcdrak as suggested. The underyling problem is that the maintenance of the address index becomes difficult due to re-org detection. If we can have bitcoind provide us with that information in a clear way, most of this becomes trivial

10:11 < wumpus> I'm not sure reorg detection is so difficult? keep the list of hashes up to the current block, then if a new best block comes in, look at its parents, where it intersects with the list of your blocks is the fork/reorg point

10:12 < dcousens> wumpus: requires maintaing another list, another source of data

10:12 < wumpus> yes it does

10:12 < wumpus> that's life, you need to maintain the data structures that you need to do your work. only be keeping this state yourself you can be sure your state is consistent.

10:13 < btcdrak> indexd could even use bitcoind for SPV lookups

10:14 < wumpus> would be great to have some library or existing software to handle this for you, I'm just talking concepts here not implementation

10:14 < dcousens> btcdrak: I was thinking about that

10:14 < dcousens> you don't even need to verify since you'd already trust your own node

10:15 < wumpus> right - you outsource trust to your node, verification, that's what it's for

10:15 < dcousens> just as wumpus said, probably still need to maintain the hash-chain if theres no notification for re-orgs

10:15 < wumpus> well you need to know what hash-chain your state is based on

10:16 < dcousens> well, not really, all you actually care about is what blocks got reverted, such that you can roll back the relevant transactions in the index

10:17 < wumpus> otherwise the couling is dangerously tight, say, your indexd was disconnected from bitcoind for a while and you reconnect and have to catch up

10:17 < wumpus> coupling*

10:17 < wumpus> yes, but you need to detect that yourself. You can't rely on bitcoind to keep state for you

10:17 < dcousens> wumpus: it would know what blocks get reverted in a re-org, no?

10:18 < wumpus> but only in realtime - ideally you don't want that kind of tight coupling. What if indexd wasn't connected at the time the reorg happens. Do you want to restart all the way from the beginning?

10:18 < dcousens> true

10:19 < wumpus> you have a) your own hash chain b) bitcoind's hash chain , you always want to go from a to b

10:19 < wumpus> (given that you trust bitcoind to give you the best chain)

10:20 < jonasschnelli> wumpus: i can successfully build up to https://github.com/bitcoin/bitcoin/commit/579b863cd7586b98974484ad55e19be2a54d241d

10:20 < jonasschnelli> wumpus: https://github.com/bitcoin/bitcoin/commit/a09297010e171af28a7a3fcad65a4e0aefea53ba seams to break things.. how can this even be possible...

10:20 < jonasschnelli> no code changes at all

10:21 < dcousens> btcdrak: happy to work with you on indexd concept

10:21 < wumpus> would be awesome if someone implemented that

10:21 < wumpus> jonasschnelli: huh that just changes my silly dev tools :)

10:22 < jonasschnelli> yeah. I know... let me investigate more

10:28 < wumpus> so 0fbfc51 works?

10:30 < wumpus> (that's the last commit before merging #6854)

10:33 < GitHub168> [bitcoin] laanwj pushed 3 new commits to master: https://github.com/bitcoin/bitcoin/compare/26f5b34e8838...ff057f41aa14

10:33 < GitHub168> bitcoin/master 9d55050 Mark Friedenbach: Add rules--presently disabled--for using GetMedianTimePast as endpoint for lock-time calculations...

10:33 < GitHub168> bitcoin/master dea8d21 Mark Friedenbach: Enable policy enforcing GetMedianTimePast as the end point of lock-time constraints...

10:33 < GitHub168> bitcoin/master ff057f4 Wladimir J. van der Laan: Merge pull request #6566...

10:33 < GitHub161> [bitcoin] laanwj closed pull request #6566: BIP-113: Mempool-only median time-past as endpoint for lock-time calculations (master...medianpasttimelock) https://github.com/bitcoin/bitcoin/pull/6566

10:36 < btcdrak> dcousens: sure.

10:38 < dcousens> wumpus: would it better to maintain indexd in repo or as a separate application?

10:39 < wumpus> well - better is always to have it completely detached, but I suppose developing it inside the repository is easier for now if you need serialization and certain data structures

10:40 < wumpus> we don't have a way to export those as a library yet

10:40 < jonasschnelli> wumpus : found the problem: https://github.com/bitcoin/bitcoin/pull/6722/files#diff-ca74c4b28865382863b8fe7633a85cd6R312

10:40 < dcousens> wumpus: aye

10:40 < jonasschnelli> clear() calls LOCK() in cxx_global_var_init()

10:41 < jonasschnelli> i think this should be changed.

10:44 < wumpus> common, straightforward to do this is to separate it out into a private function, say clear_(), and a public function clear() that gets the lock and calls clear_()

10:44 < wumpus> this way, functions that already have the lock, or don't need it otherwise, can call clear_ instead

10:45 < wumpus> in this case it is the constructor so obviously it doesn't need the lock

10:45 < jonasschnelli> okay... can try a patch.

10:49 < wumpus> not that it should normally hurt to lock a lock that is part of your object inthe constructor... but this looks like some global obscure initialization order thing because the mempool object is global :(

10:51 < wumpus> (probably it would be better to use an explicit lifetime for objects such as these, but fixing that is more than you bargained for)

11:32 < Luke-Jr> huh, the wallet rescan % shown in the splash screen is much higher than the debug.log Progress - block count vs tx count? :/

11:36 < wumpus> GUIstd::max(1, std::min(99, (int)((Checkpoints::GuessVerificationProgress(chainParams.Checkpoints(), pindex, false) - dProgressStart) / (dProgressTip - dProgressStart) * 100)))

11:36 < wumpus> log: Checkpoints::GuessVerificationProgress(chainParams.Checkpoints(), pindex)

11:37 < wumpus> the GUI takes into account the starting point, which you'd expect would result in it showing a *lower* progress % when it makes a difference

11:38 < wumpus> but the GUI also has fSigchecks=false for GuessVerificationprogress, maybe tha'ts it

12:42 < wumpus> this is curious, anyone else having problems with address parsing on windows 10? https://github.com/bitcoin/bitcoin/issues/6886

12:44 < wumpus> looks like the IP parsing (using getaddrinfo) always returns 0:0:0:0:0:0:0:0/128, IPv4 parsing as well as IPv6 parsing fails

13:57 < GitHub175> [bitcoin] jonasschnelli opened pull request #6889: fix locking issue with new mempool limiting (master...2015/10/fix_mempool_lock) https://github.com/bitcoin/bitcoin/pull/6889

14:46 < morcos> Does anybody have any thoughts on storing the number of sigOps in a tx in the CTxMempoolEntry?

14:47 < morcos> We calculate this on ATMP, and we need it in CreateNewBlock, but it is on the expensive'ish side to calculate.

14:48 < sipa> sounds good

14:48 < morcos> Wow, I was prepared to have to fight for that... :)

14:50 < morcos> I was naively hoping you could just assume every scriptSig was satisfying a P2SH and you wouldn't over count by too much, but turns out that is NOT the case..

14:55 < dgenr8> morcos: any plans to cache ancestor pkg sums the way descendants are?

14:56 < morcos> dgenr8: sdaftuar has written something which he might publish soon. but i don't think there is any point in merging it until we have mining code that can take advantage of it.

14:57 < morcos> i'm starting by just trying to make the existing mining code much faster

14:59 < gmaxwell> 02:58 < gmaxwell> Is there a reason that for fee purposes we are not using max(size, sigops*BLOCK_MAX_BYTES/BLOCK_MAX_SIGOPS) as the size of a transaction?

14:59 < gmaxwell> as in "the size" that we use for mempool/etc. purposes.

15:02 < gmaxwell> dcousens: Opt-in isn't sufficient to prevent it from driving users out, I do explain this...

15:02 < morcos> gmaxwell: i saw that. on my list somewhere is to research algorithms for satisfying multiple constraints. presumably if we know that actual size is usually the limiting factor, then it doesn't seem right to sort something lower b/c it has a lot of sigops even though you're not going to hit that limit

15:02 < morcos> i was thnking about it in the terms of some future consensus requirement on utxo deltas as well

15:05 < morcos> the discussion we were having the other day about a separate thread for template generation seems very difficult to me. its very hard to extract code from needing to lock at least the mempool.

15:05 < morcos> i think to do that, we might need 2 mempools, or to just allow the mempool to queue tx's while the mining code is running

15:06 < morcos> but its an easy win to make existing createnewblock significantly faster just by adding an index on individual tx feerate, so i figured i'd start with that.

15:19 < gmaxwell> morcos: I've generally assumed the multivariable optimization would be much harder and not really worth it-- the sigops limit is high enough that it doesn't normally have an effect. What I suggested at least prevents the case where some silly dos attack transaction with a high per byte fee kills a feerate greedy selection.

15:20 < gmaxwell> morcos: hm. So I didn't expect the seperate template thread would avoid locking the mempool. I assumed it would. But that having it would allow the createnewblock to always run without taking any long-held locks.

15:21 < gmaxwell> E.g. block creation thread would lock the mempool and build a block. Then take a blocktemplate lock for just an instant to replace the last template.

15:21 < morcos> gmaxwell: ah, interesting, so trick the mining code into producing small blocks unecessarily.. maybe that is a good idea.

15:23 < morcos> gmaxwell: i see. i guess i need to understand how getblocktemplate is used in practice. i was assuming the threading was so we could be optimizing all the time. i didn't realize it was because you wanted to return a previously generated template to someone calling gbt. i assumed they had already gotten the previous template

15:24 < gmaxwell> and if the cached template is out of date with the chain when gbt runs, you just generate an empty template, which you can do without touching he mempool.

15:25 < morcos> yeah, so i did write a threaded version that does what you suggested, but it was bad, because it was basically just busy grabbing cs_main and mempool, and i thought well ok we'll have to make it run only every so often, but then i didn't see why that was better than the existing usage

15:26 < morcos> but you want to optimize for the case where the caller currently has nothing valid to work on?

15:26 < gmaxwell> When a new block is accepted by the node the miner process learns this (either via p2p inspection or via longpolling) and calls GBT. You want the lowest latency response possible; because until it returns the mining gear is working on a fork.

15:26 < gmaxwell> Then the mining clients will periodically poll to update the transaction list. In this polling you don't care if the transaction data returned is seconds stale.

15:27 < morcos> yes, understood that the new block case needs to be optimized, that was my next step, and that seems pretty easy

15:27 < gmaxwell> since all that does is deny you the little incremental fee rate from new transactions that came in.

15:28 < gmaxwell> We already do createnewblock caching. but it only helps with the second and later requests.

15:28 < morcos> so what i was assuming is that when you periodically poll, which is the common use case, you already have valid work, so the existing longpoll functionality is fine, and there is no need for background process

15:28 < morcos> yes right

15:30 < morcos> so i think what you're saying is that for instance a new block could come in while you happen to not be longpolling, so then it won't be until you call getblocktemplate that it even tries to do anything, and it woudl be better if it started immediately?

15:30 < gmaxwell> Ultimately it would be good to avoid the case where if you poll just before a new block is accepted, you don't delay getting work on the new block. Though, indeed, then I think you run into locking issues with the mempool.

15:31 < gmaxwell> morcos: yes, when a new block comes in and you have been mining it should immediately compute a new block template, and until thats done, return empty ones so you'll at least be working on extending the longest valid chain.

15:32 < gmaxwell> (a failure to make this optimization is part of what contributes to miners mining on other miners work without validating-- they go and 'optimize' at a layer higher than Bitcoin, and as a resut complex details like validation go out the window. :) )

15:32 < morcos> it seems like you'd want to be notified of 2 things, when there is a new block template (empty) and when there is the first template with txs based on the new block. how do you get that notification if you're not longpolling constantly?

15:36 < morcos> i was just going to change it so the longpool returns immediately with an empty block if a new block comes in, via a flag to create new block or something. and then your next long polling call to getblocktemplate also calls CNB immediately which will return with txs.

15:36 < gmaxwell> We could trigger the GBT LP's every N seconds or when the fees change by X, I don't actually know what (if anything) triggers the GBT LP now except a new block. I also don't know how widely GBT LP is used at the moment: RPC concurrency has been kinda broken for a long time (fixed in git master), and GBT LP was a late addition, so I think most things decide when to poll GBT based on monitoring

15:36 < gmaxwell> P2P and then otherwise run on a timer.

15:37 < morcos> ok... well doing it the threaded way shouldn't be hard either...

15:38 < gmaxwell> morcos: we do want to get people mining transactions quickly too-- as otherwise you're missing out on fees (and the public tends to get really angry when they're waiting for transactions and an empty block shows up. :) )

15:38 < morcos> gmaxwell: yes, will be much much faster, so as long as you poll again immediately, shouldn't be more than 100ms

15:39 < morcos> i'll probably have a lot more questions when our mining hardware shows up.. :)

15:43 < gmaxwell> There is some ... potential for "can't win" here, so there is some hardware that really dislikes being longpolled often. E.g. for some dumb reason (usually a complete lack of understanding about all the reasons you might LP) and they flush the pipeline completely. Now, GBT LPing doesn't necessarily mean the hardware gets disrupted, as some higher layer thing can ignore an early long poll and pi

15:43 < gmaxwell> ck up the non-empty template on a later timed run.

15:44 < gmaxwell> So it might be the case that we expose breakage if we start LPing 100ms after the last time we LPed, just because we now have transactions. But if so, we can probably add a flag to delay that second LP event.

15:44 < gmaxwell> (that same hardware is the stuff that cannot be used with P2Pool)

16:25 < GitHub52> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/ff057f41aa14...c8322ff7f754

16:25 < GitHub52> bitcoin/master 143d173 Eric Lombrozo: Use BOOST_CHECK_MESSAGE() rather than BOOST_CHECK() in alerts_tests.cpp and initialize strMiscWarning before calling PartitionCheck()."

16:25 < GitHub52> bitcoin/master c8322ff Wladimir J. van der Laan: Merge pull request #6888...

16:25 < GitHub96> [bitcoin] laanwj closed pull request #6888: Clear strMiscWarning before running PartitionAlert (master...alert_tests) https://github.com/bitcoin/bitcoin/pull/6888

16:42 < GitHub16> [bitcoin] laanwj pushed 3 new commits to master: https://github.com/bitcoin/bitcoin/compare/c8322ff7f754...dbc5ee821ecd

16:42 < GitHub16> bitcoin/master d4aa54c Kevin Cooper: added org.bitcoin.bitcoind.plist for launchd (OS X)

16:42 < GitHub16> bitcoin/master e04b0b6 Kevin Cooper: added OS X documentation to doc/init.md

16:42 < GitHub16> bitcoin/master dbc5ee8 Wladimir J. van der Laan: Merge pull request #6621...

16:42 < GitHub118> [bitcoin] laanwj closed pull request #6621: added org.bitcoin.bitcoind.plist for launchd (OS X) (master...master) https://github.com/bitcoin/bitcoin/pull/6621

16:54 < GitHub11> [bitcoin] laanwj pushed 3 new commits to master: https://github.com/bitcoin/bitcoin/compare/dbc5ee821ecd...7939164d8985

16:54 < GitHub69> [bitcoin] laanwj closed pull request #6622: Introduce -maxuploadtarget (master...2015/09/maxuploadtarget) https://github.com/bitcoin/bitcoin/pull/6622

16:54 < GitHub11> bitcoin/master 872fee3 Jonas Schnelli: Introduce -maxuploadtarget...

16:54 < GitHub11> bitcoin/master 17a073a Suhas Daftuar: Add RPC test for -maxuploadtarget

16:54 < GitHub11> bitcoin/master 7939164 Wladimir J. van der Laan: Merge pull request #6622...

18:33 < sipa> morcos, sdaftuar, BlueMatt: perhaps we need to talk a bit about what the expectation of the coincache + mempool size limiting is

18:34 < sipa> do we want a "all of the mempool's dependencies are guaranteed to always fit in memory" ?

18:36 < morcos> sipa: hmm, i don't think i was trying to argue for that. in fact i don't think it would be at all reasonable to do that.

18:37 < morcos> i think we should set the default sizes so that is true in general though

18:37 < dgenr8> Was reversing the feerate sort just an expressiveness change, or did it fix something? 78b82f4

18:37 < sdaftuar> dgenr8: expressiveness mostly, avoids an issue with using project

18:38 < sipa> morcos: one way to accomplish that is to just have a combined coincache+mempool memory limit size, and depending on whether the dependencies are large or small, effectively have a smaller resp. larger mempool limit as a result

18:38 < sipa> morcos: but if we don't have that, we _must_ enforce the coincache limit

18:38 < dgenr8> sdaftuar: thx!

18:38 < sdaftuar> i think that might be a little dangerous though because that has an effect on relay policy

18:39 < sipa> morcos: if the coincache exceeds its size, it will be wiped anyway when the next block comes

18:39 < sipa> calling flush in the tx code just makes it happen a bit earlier to avoid an OOM

18:39 < sdaftuar> sipa: i think we should revisit that behavior. ideally the coinscache would have in it the set of things likely to be needed when the next block arrives

18:39 < morcos> sipa: you realize that we can't always enforce the coincache size? when we're connecting a block it could temporarily exceed for example

18:40 < sipa> morcos: yeah...

18:40 < morcos> i agree that we should _eventually_ put in something that enforces limitation of the coincache size between blocks

18:40 < morcos> but that might as well wait for something smarter than just flushing

18:40 < sipa> i think the only real solution is to switching to a per-txout cache instead of a per-tx cache

18:40 < sipa> so the ratio is more or less fixed

18:40 < sdaftuar> ah i was wondering how feasible that would be

18:40 < morcos> and in the meantime we set the ratio of the defaults to be reasonable so that we don't usually blow through the cache limit

18:41 < morcos> given that you have to pay for relay to increase the cache size there is some cost to the attack after all

18:41 < morcos> and how big are we worried the cache size could get?

18:41 < sipa> define usually

18:42 < morcos> i guess i'm thinking that if we set mempool to 300M then it'd be rare than the coinsview cache was over 600M or something

18:43 < morcos> thats just a lot bigger than the current default

18:45 < morcos> i just think flushing after txs should be rare, if we want to set some higher limit to protect against OOM and flush after txs there, then thats fine, lets just set all these defaults so that what happens usually is the cache is big enough to contain a good chunk of the mempool. it doesn't really have to contain the whole thing, because

18:45 < morcos> you wont' get to 100% mempool size in between blocks

19:33 < phantomcircuit> indeed why are we keeping the already accepted mempool dependencies in the cache? they're already validated isn't that going to reduce the cache hit % ?

19:34 < sipa> phantomcircuit: they're validated again when a block is received

19:34 < sipa> which we want to make fast

19:35 < phantomcircuit> sipa, ah that's right

19:35 < sipa> an approach to avoid that is caching the success, but if you store that cache value in the mempool, you're making the mempool code consensus critical

19:35 < phantomcircuit> yeah lets not do that :)

19:35 < sipa> eh, and that doesn't even help

19:35 < sipa> signature validation is already cached anyway

19:35 < phantomcircuit> oh that reminds me

19:35 < sipa> and you need to be able to update the spends in the chainstate anyway after the block is validated

19:36 < phantomcircuit> the size of the sig cache isn't in the help menu

19:36 < sipa> good point

19:36 < phantomcircuit> it significantly reduces worst case AcceptNewBlock() latency to increase that

19:36 < phantomcircuit> miners should basically all have that be 10-100x the default

19:36 < gmaxwell> what?

19:37 < gmaxwell> the entries are _tiny_, we should be able to have a "big enough for ~100% hitrate" cache for everyone.

19:37 < sipa> i actually have no idea what size the sigcache is

19:39 < phantomcircuit> huh it is in the help

19:39 < gmaxwell> phantomcircuit: for block acceptance speed it may make sense to seperately cache success and failure.

19:39 < phantomcircuit> 50000 entries

19:40 < phantomcircuit> gmaxwell, yes map<prevblockhash, set<txid> >

19:41 < phantomcircuit> should work nicely under non adversarial conditions

19:41 < sipa> that means that an entry "3000 ago" (which is around 1000 tx) only has a 96% hitrate

19:41 < sipa> eh, 94%

19:41 < gmaxwell> phantomcircuit: what, no.

19:41 < sipa> it uses random replacement

19:41 < phantomcircuit> sipa, yup 10-100x makes a big difference

19:41 < gmaxwell> phantomcircuit: transactions can spend unconfirmed outputs my friend.

19:42 < phantomcircuit> gmaxwell, yeah you still have to check that the dependencies

19:43 < phantomcircuit> that *does* make the mempool view consensus critical though

19:43 < gmaxwell> phantomcircuit: if you only mean to check that the ECDSA passes, than <txid> is enough.

19:43 < sipa> the sigcache is per signature, not per transaction

19:43 < sipa> per transaction is pretty complicated, as you need to take active softforks into account etc

19:44 < phantomcircuit> sipa, hadn't considered that

19:44 < sipa> there have been patching before that cached full tx validation

19:44 < sipa> but actually just script validation caching is cheaper

19:45 < sipa> s/cheaper/easier

19:45 < phantomcircuit> hmm

19:46 < sipa> i can implement that easily

19:46 < phantomcircuit> have we considered that upload limiting disconnecting peers will have an impact on the network topology?

19:47 < gmaxwell> phantomcircuit: it only disconnects peers that fetch historic blocks currently; largely avoiding the concern.

19:47 < phantomcircuit> that seems safe

19:48 < gmaxwell> well safer at least. it doesn't completely escape topology impact.

19:48 < gmaxwell> Because a new node will, of course, not remain connected to anything over its limit.

19:49 < gmaxwell> so even once its synced up it will end up with a different position in the topology than it would otherwise.

19:49 < phantomcircuit> export MALLOC_ARENA_MAX=1

19:49 < phantomcircuit> that completely fixes the problem

19:49 < phantomcircuit> neat

19:50 < sipa> IBD's stall detection already causes significant movement in the topology

19:50 < sipa> phantomcircuit: ??

19:50 < sipa> to reduce fragmentation?

19:50 < phantomcircuit> gmaxwell, right which means new nodes will tend to end up connected to nodes that have no upload limit

19:50 < phantomcircuit> sipa, yeah, calling getblocktemplate in a loop and memory usage <1GB

19:50 < gmaxwell> phantomcircuit: at least during their first runtime, after restart their topo will be more uniform.

19:51 < gmaxwell> phantomcircuit: we really need outbound rotation to flatten topo hotspots.

19:51 < phantomcircuit> gmaxwell, for that we need "outbound slots" basically

19:52 < phantomcircuit> the easiest way to do that is to just assign the second half of the outbound connections (based on vNodes position) as rotating

19:52 < phantomcircuit> definitely dont want to rotate all the outbounds :)

19:52 < gmaxwell> right, it's a good way to wander into a sybil attack.

19:53 < CodeShark> are you guys submitting proposals for hong kong?

19:53 < CodeShark> sorry if off-topic

19:54 < phantomcircuit> hmm this is actually kind of annoying to test

19:55 < phantomcircuit> have to get the mempool full again

19:55 < phantomcircuit> sending the "mempool" command to peers when they connect results in comical performance issues

19:55 < sipa> phantomcircuit: i run with that for benchmarking mempool behaviour

19:56 < phantomcircuit> sipa, have you noticed that you just sit there spinning at 100% cpu processing something (i haven't bothered to figur eout what yet)

19:56 < sipa> phantomcircuit: turn on debug=mempool and debug=mempoolrej

19:57 < phantomcircuit> i run this node with debug=

20:04 < GitHub138> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/7939164d8985...2b625510d374

20:04 < GitHub138> bitcoin/master 7bbc7c3 Suhas Daftuar: Add option for microsecond precision in debug.log

20:04 < GitHub138> bitcoin/master 2b62551 Wladimir J. van der Laan: Merge pull request #6881...

20:05 < GitHub108> [bitcoin] laanwj closed pull request #6881: Debug: Add option for microsecond precision in debug.log (master...add-microsecond-timestamps) https://github.com/bitcoin/bitcoin/pull/6881

20:08 < phantomcircuit> BlueMatt, plz2rebase seed

20:14 < phantomcircuit> sipa, non-scientific gdb interrupt+ bt says it's spinning on ECDSA_verify

20:14 < phantomcircuit> which i guess is to be expected :)

20:16 < wumpus> is it catching up or in initial sync? if so, that's expected

20:16 < wumpus> if in steady state it's not normal

20:17 < sipa> wumpus: this is with sending out a BIP35 "mempool" command, so at startup it receives zillions of transactions

20:18 < wumpus> ok...

20:18 < wumpus> yes, then it makes sense too

20:18 < sipa> so i do expect you'd be accepting mempool txn at the speed you can handle

20:26 < phantomcircuit> wumpus, http://0bin.net/paste/8349G1adfXBgLthd#l6rCtj99ayGFnNdx6-C+bbP1fYFF0EJIvg4qvV4JnD9

20:32 < wumpus> you do have commit 5ce43da03dff3ee655949cd53d952dace555e268?

20:33 < phantomcircuit> wumpus, yes

20:33 < wumpus> if so, this shouldn't happen - unless gdb happens to reenable the signal

20:34 < phantomcircuit> it shouldn't

20:34 < wumpus> yeah it does, it reports all signals by default. Try: "handle SIGPIPE nostop noprint pass"

20:34 < phantomcircuit> but who knows... gmaxwell do you know :)

20:35 < phantomcircuit> wumpus, sigh

20:35 < phantomcircuit> ok yes that's good

20:36 < phantomcircuit> bleh why does it do that?

20:57 < phantomcircuit> wumpus, can verify that MALLOC_ARENA_MAX=1 works btw

20:57 < sipa> phantomcircuit: what about 2 instead of 1?

20:58 < phantomcircuit> sipa, not sure it takes ages to load enough transactions into the mempool for the result to be valid

20:58 < wumpus> phantomcircuit: awesome

20:59 < sipa> too bad we can't set MALLOC_ARENA_MAX from the program itself, i suppose?

20:59 < phantomcircuit> sipa, well we can but...

21:00 < sipa> how so?

21:00 < wumpus> I think you can

21:00 < sipa> we can change the env variable, but i expect that the arenas are created at library load time

21:00 < sipa> before we can change it

21:00 < phantomcircuit> sipa, the easiest would be to set the env and execve

21:01 < phantomcircuit> ie h4x

21:01 < wumpus> let me see glibc source...

21:03 < wumpus> it indeed makes no sense to set the env var, as it parses it at startup as expected, however you should be able to do mallopt(M_ARENA_MAX, 1)

21:05 < wumpus> /* mallopt options that actually do something */ in /usr/include/malloc.h :-)

21:06 < sipa> man mallopt does not list M_ARENA_MAX here

21:07 < wumpus> it's an undocumented option

21:08 < wumpus> probably should call it before any call to malloc to be effective

21:09 < wumpus> or at least before calling it in a thread

21:09 < sipa> the execve approach works...

21:09 < sipa> it's ugly because it needs to know the name of the binary you just started

21:09 < wumpus> that's really ugly, and makes debugging harder

21:10 < sipa> agree

21:10 < phantomcircuit> ha it's not even in the libc manual

21:10 < wumpus> and that (though you should always have that in argv[0] on unix)

21:10 < phantomcircuit> it looks like we could set M_MMAP_THRESHOLD which would be slower but seems to guarantee memory is actually free'd

21:12 < wumpus> but gdb will show something like "process 28686 is executing new program: /bin/dash" when a program execve's, and it loses all its state

21:13 < phantomcircuit> wumpus, yeah i know thus "h4x"

21:13 < phantomcircuit> it would probably also break anybody using start-stop-daemon

21:13 < phantomcircuit> gtg

21:14 < wumpus> but the mallopt should work too (can't check it right now though)

21:14 < wumpus> ok later

23:06 < phantomcircuit> 41k transaction in the mempool and counting

23:07 < sipa> yeah, 50k sigcache entries won't be good in that case...

23:07 < phantomcircuit> yeah i changed mine to maxsigcachesize=1000000

23:13 < gmaxwell> sipa: sigcache is currently uint256,vector<char>,cpubkey.... bloat bloat bloat.

23:14 < gmaxwell> Could be compressed with H(the above).

23:14 < gmaxwell> H() need not even be very collission resistant if there is a per node secret nonce.

23:21 < phantomcircuit> gmaxwell, yeah that's what i was thinking

23:21 < sipa> it's around 220 bytes per entry now

23:23 < phantomcircuit> sipa, depends on the size of the pubkey script right?

23:23 < phantomcircuit> it would be nice if it didn't

23:23 < phantomcircuit> upto 44.5k now

23:24 < sipa> phantomcircuit: it's a pubkey; not a script

23:24 < sipa> pubkeys are 65 bytes

23:24 < sipa> in memory

23:24 < sipa> + alignment

23:24 < phantomcircuit> oh

23:26 < gmaxwell> sipa: plus whatever overhead there is from the std::set.

23:26 < sipa> gmaxwell: around 40 bytes, i included that

23:39 < gmaxwell> sipa: would it be really ugly if accepting the block carried a flag to the checker that told it to remove the entry? with that alone the hitrate would no longer be low absent attacks because the cache would not be full all the time.

23:52 < morcos> sipa: phantomcircuit: i don't think we need to worry about M_ARENA_MAX. we can drastically reduce the size of the problem. first we rewrite CreateNewBlock to only pull into the cache 1 blocks worth of txs. second we make it run in only 1 thread.

23:53 < morcos> but yeah that maxsigcachesize will bite you. i couldn't figure out why my new code was slower and it was that its sigcache was getting blown away