#bitcoin-core-dev on 2015-10-22 — searchable irc log

00:12 < Luke-Jr> re sigop-limit flooding, can we just merge acceptnonstdtxn (mainnet option, off by default) finally? this would not be a problem if people were using code with it merged..

00:15 < Luke-Jr> https://github.com/bitcoin/bitcoin/pull/559

00:19 < gmaxwell> Luke-Jr: explain your logic?

00:19 < Luke-Jr> gmaxwell: the top commit limits (in policy) tx sigop count based on its size

00:19 < Luke-Jr> so the spam would need to be larger for it to use 15 sigops

00:20 < Luke-Jr> thus hitting the block size limit approximately at the same time as the sigop limit

00:20 < gmaxwell> I ~think~ the suggestion I made of max(size, sigops*1e6/20000) is more general than the limit thing.

00:20 < Luke-Jr> (and triggering higher fees, of course)

00:21 < Luke-Jr> gmaxwell: yes, my point is that the bug wouldn't exist in the first place had this already been merged

01:10 < BlueMatt> morcos: you asked about tx cache? https://github.com/TheBlueMatt/bitcoin/tree/limitucache

01:10 < BlueMatt> actually, I'll just pr

01:11 < GitHub110> [bitcoin] TheBlueMatt opened pull request #6868: Limit txn cache growth (master...limitucache) https://github.com/bitcoin/bitcoin/pull/6868

01:13 < gmaxwell> BlueMatt: for your don't fork from here-- I suggest https://github.com/gmaxwell/secp256k1

01:13 < BlueMatt> gmaxwell: yes, but that takes effort....

01:14 < gmaxwell> well if someone does it to the repo first you can check out theirs and force push it into yours. :P

01:14 < gmaxwell> I guess I should do bitcoin core too.

01:16 < GitHub26> [bitcoin] TheBlueMatt closed pull request #6868: Limit txn cache growth (master...limitucache) https://github.com/bitcoin/bitcoin/pull/6868

01:46 < morcos> BlueMatt: interesting idea.. i'm not sure what i think of it yet... you don't thinks also potentially inefficient

01:48 < morcos> did you see the conversation wumpus and I were having earlier

01:48 < morcos> one thing we could do in ATMP, is not pull into the pcoinstip cache coins for txs which aren't actually accepted into the memory pool

01:56 < sipa> yeah, we should do that

01:56 < sipa> i see sometimes huge increases in utxo cache size after a transaction

01:58 < sipa> however, with bith limited utxo cache and mempool, memory usage actually remains *very* low

01:58 < sipa> have a node with 100 MB utxo and 200 MB mempool, and total memory is staying below 570 M

02:01 < morcos> just don't run GetBlockTemplate! :)

02:02 < morcos> sipa: why does CreateNewBlock need to hold cs_main the whole time

02:02 < sipa> no idea

02:02 < sipa> also no idea why it actually needs to verify the result

02:03 < morcos> it seems like after the first round of calling HaveCoins on all the txins of all the txs.. you now have a view with all of of your coins in it

02:03 < morcos> and you could release cs_main

02:03 < sipa> the mempool could change then, though?

02:03 < morcos> i took a first stab at a background miner, but realized its useless if its holding cs_main

02:04 < morcos> well... what do you mean change? i think we don't want to lose txs that we might be trying to mine

02:04 < morcos> but maybe you can handle that by not removing things

02:05 < morcos> a smaller lock that just guards actually removing txs

02:06 < morcos> why would it not need to verify the result, you mean because it shouldn't have been able to create something invalid

02:06 < sipa> yes

02:06 < morcos> i do think thats a nice double check

02:06 < sipa> i don't think it's worth the cost

02:06 < sipa> given that it hasn't failed in years

02:06 < morcos> well it will be if we're chaning the mining algorithm!

02:06 < morcos> changing

02:06 < sipa> maybe a spot check would be useful

02:07 < sipa> run it once every 1000 calls

02:07 < jgarzik> since it's in the background the cost is mitigated

02:07 < morcos> holds cs_main

02:07 < sipa> block validation will not be background

02:07 < sipa> it needs the utxo set

02:08 < sipa> though i guess you are right that of it actually does make a full copy of all inlits, it can release the lock afterwards

02:08 < jgarzik> yes it needs utxo set but doesn't need cs_main long term

02:08 < sipa> well it needs the utxo lock, whatever that is

02:08 < morcos> yeah i think we could be smart about it

02:08 < sipa> i think it shouldn't need the utxo set or copies at all

02:08 < sipa> the mempool should be consistent

02:09 < sipa> do a spot check occasionally to make sure the code isn't broken

02:09 < morcos> right now it already pulls the potentially used utxos into caches TWICE. so you could use one for creation and the other for validation without needing to grab cs_main again

02:09 < sipa> yup

02:09 < sipa> it should be 0, imho

02:09 < morcos> oh

02:09 < morcos> thats what you mean by mempool should be consistent

02:09 < sipa> the mempool is known to not double spend

02:09 < morcos> yikes!

02:10 < sipa> not contain invalid transactions

02:10 < morcos> but thats also been broken

02:10 < sipa> so we should fix it

02:10 < morcos> but what i'm saying is that breaks often enough that you wouldn't want it to cause you to mine an invalid block?

02:10 < morcos> at least one check should still exist

02:10 < sipa> that's like saying "we're not sure about the utxo cache, let's run checkblockchain on every update"

02:10 < sipa> sure the check should exist

02:11 < sipa> but it should not be slowing down GBT by seconds!

02:11 < morcos> ok ok

02:11 < sipa> every single call

02:12 < morcos> its just scary when you have a giant block of code protected by cs_main.. trying to reason about what in there really cares about it

02:12 < sipa> well you don't want the best block or the mempool to change underneath it

02:12 < sipa> or you will be pulling in transactions that may conflict with each other

02:13 < Luke-Jr> sipa: GBT verifies the result because it's worth it. No miner wants to find out after the fact they mined an invalid block.

02:13 < morcos> if the best block changes i've got other problems.. i don't want to be calculating this block anyway

02:13 < sipa> Luke-Jr: the code should not produce invalid blocks, period

02:13 < Luke-Jr> also, spot checking just leads to people /expecting/ a lower time than they can rely on

02:13 < sipa> Luke-Jr: if you run the check once every 1000 calls you will equally well detect broken code

02:13 < Luke-Jr> not equally well, no.

02:13 < sipa> if the code is broken, it will fail badly

02:14 < Luke-Jr> and it will mislead people into thinking it's faster than it's necessarily

02:14 < Luke-Jr> an edge case will not necessarily fail badly.

02:14 < sipa> so run with -checkmempool on then

02:14 < Luke-Jr> without the verification, there is no consensus code involved in producing blocks here..

02:14 < Luke-Jr> valid transactions != valid block

02:14 < sipa> then you will also not get an invalid memlool state

02:15 < sipa> just get that check out of the mining path

02:15 < sipa> i wonder how much gbt delay has encourage validationless mining...

02:15 < Luke-Jr> removing the check *is* validationless.

02:16 < morcos> I'm with Luke-Jr that I think consensus code should happen at least once in the path

02:16 < sipa> it is not if the code is not broken

02:16 < morcos> but it doesn't have to happen twice as it does now

02:16 < Luke-Jr> morcos: wait, twice?

02:16 < sipa> once for building, once for checking

02:16 < morcos> CheckInputs on the transactions and TestBlockValidity

02:17 < Luke-Jr> CheckInputs doesn't check the entire block, but maybe it can be removed

02:17 < sipa> it does check the whole block eventually, as it's called for all transactions that end up in it

02:18 < Luke-Jr> sipa: no, because all the transactions can be valid without the block itself being valid

02:18 < morcos> but i think we're talking about different things here

02:18 < sipa> Luke-Jr: i know that

02:18 < morcos> i'm concerned with having the block-template thread hold cs_main too much...

02:18 < morcos> CheckInputs doesn't need cs_main once you have a view

02:18 < sipa> but checkinputs is called for the equivalent amount of work of validating the whole block

02:18 < morcos> sipa your concern seems to be latency in GBT

02:18 < morcos> but thats solvable in other ways

02:18 < sipa> yes

02:19 < sipa> ok

02:19 < morcos> as long as you think its ok to have brief periods of mining on an empty block or something

02:19 < Luke-Jr> GBT isn't supposed to be time-critical anyway (but I already plan to improve it)

02:20 < sipa> well it can't take seconds...

02:20 < Luke-Jr> maybe ideally it wouldn't, but it shouldn't hurt much

02:20 < morcos> forget about how long it takes, what you care about is how long after a new block comes in that you have a header you can mine that builds off it right?

02:21 < Luke-Jr> also afaik right now it only takes seconds when the miner is neglecting to keep a clean mempool

02:21 < morcos> it doesn't matter if it then takes seconds to generate a new one with txs right?

02:21 < Luke-Jr> since it needs to go over every tx in the mempool

02:21 < Luke-Jr> not because of the test afterward

02:21 < sipa> morcos: as long as fees are negligablez yes

02:21 < morcos> although seconds is really absurdly slow and we should be able to get it under that even for that case

02:21 < morcos> Luke-Jr: yeah thats one of the problems, is that now it loops all the txs twice

02:22 < morcos> once putting them all in vecPriority

02:22 < morcos> and then trying to go through vecPriority and put them all in a block

02:22 < Luke-Jr> yeah

02:22 < morcos> with a sorted mempool

02:22 < morcos> both become unnecessary (the 2nd was already unnecessary)

02:22 < Luke-Jr> sorted mempool is essentially what I'm working on

02:22 < Luke-Jr> although I need to rebase on top of the packages stuff I expect

02:22 < morcos> sdaftuar is very close to finishing the ancestor package tracking

02:23 < morcos> if you want to build a more general sorting priority score, thats probably what you should build off of

02:23 < Luke-Jr> not priority score

02:23 < Luke-Jr> sorting based on the user-defined policy ;)

02:23 < morcos> yeah thats what i meant

02:24 < morcos> don't you have to somehow turn their policy into a score?

02:24 < Luke-Jr> no

02:24 < morcos> how can you sort without a score?

02:24 < Luke-Jr> the policy is implemented mostly by a method that compares two transactions

02:25 < morcos> and if A < B and B < C then A < C right?

02:27 < Luke-Jr> right

02:27 < Luke-Jr> or at least, that's assumed by the mempool

02:27 < Luke-Jr> it might be fun to some day make blocks with a policy that just returns rand() :P

02:27 < morcos> ok well the point i was going to make was that if you could look at it as a score

02:27 < morcos> and A has child A2 and B has child B2

02:28 < sipa> you're defining a total ordering, that is equivalent to a score function

02:28 < Luke-Jr> yes, you could look at it that way; my point was that actually using a score is overcomplex

02:28 < morcos> then if you use suhas's ancestor package tracking with your score instead of fee you'll get CPFP in terms of your policy

02:31 < morcos> sipa: what are your thoughts on maintaining ancestor package tracking on nodes that aren't mining

02:31 < morcos> its not totally useless and it hopefully shouldn't be too expensive. but certainly adds more mememory/cpu for something they are basically not using

02:32 < morcos> the one thing i think it could be used for is fee estimation? but thats only a maybe, i haven't really figured out how or if that would be worth it. but estimating what miners are going to do seems like potentially a valuable tool for fee estimation.

03:39 < gmaxwell> cfields_: good catch, apparently it's not inhereted reliably and there is a bunch of broken software out there.

03:50 < gmaxwell> (by reliably I mean it's system dependant)

08:13 < phantomcircuit> can confirm bitcoin core requires reindex on power failure under windows

08:14 < phantomcircuit> probably leveldb env driver is stupid

08:15 < gmaxwell> phantomcircuit: fixy fixy? unfortunately I think it's unmaintained.

08:15 < gmaxwell> as chrome uses some chrome specific abstractions for IO. :(

08:19 < phantomcircuit> gmaxwell, i can take a look at it, probably it should just be replaced with the current posix version which is just fopen/fread/fwrite basically

08:24 < GitHub16> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/0fbfc5106cd9...a09297010e17

08:24 < GitHub16> bitcoin/master 579b863 Wladimir J. van der Laan: devtools: Add security-check.py...

08:24 < GitHub16> bitcoin/master a092970 Wladimir J. van der Laan: Merge pull request #6854...

08:24 < GitHub129> [bitcoin] laanwj closed pull request #6854: devtools: Add security-check.py (master...2015_10_security_checks) https://github.com/bitcoin/bitcoin/pull/6854

08:26 < wumpus> phantomcircuit: yeah https://github.com/bitcoin/bitcoin/issues/5610

08:26 < wumpus> no clue on solving it though

08:26 < wumpus> for some reason most people with a clue about windows internals seem to be blackhats, and they like things broken :p

08:27 < phantomcircuit> wumpus, it's using windows memory mapping

08:27 < gmaxwell> phantomcircuit: reports went up astronomically after 0.10 and some users reported it worked fine for them before them and fails every time since.

08:27 < phantomcircuit> im going to assume it's that

08:28 < gmaxwell> phantomcircuit: oh it's writing via mmap too?

08:28 < phantomcircuit> i just checked and there's a Win32Map something or other

08:28 < phantomcircuit> i assume it's using that

08:29 < gmaxwell> https://litteration.files.wordpress.com/2013/05/science-dog.jpg

08:29 < wumpus> it has been a while since we updated leveldb, maybe some things got fixed since?

08:30 < wumpus> heh

08:30 < phantomcircuit> gmaxwell, that's about right

08:31 < phantomcircuit> it's windows... how do you even begin to debug this

08:32 < wumpus> that's a dark art

08:33 < wumpus> wouldn't be surprised if it requieres at least some, possibly expensive, proprietary software applications

08:35 < wumpus> debugging-by-mutating-the-code may be the most promising approach, extract a bare bones program that uses leveldb and crashes, then change it (adding flushes everywhere) until it manages to not corrupt anymore

08:36 < gmaxwell> it's 100% reproducable it seems, which helps.

08:36 < wumpus> this is the approach that works for fixing problems with underdocumented hardware, it should work for windows too...

08:36 < gmaxwell> the fact that it went way up when we turned up checking suggests to me that it's something like a truncated record at the end that was otherwise silently recoverable.

08:37 < gmaxwell> e.g. something like the osx bug.

08:38 < wumpus> yes good point - can you send me a database that is corrupted by this problem?

08:39 < gmaxwell> phantomcircuit: you would be you, as I don't have a repro.

08:40 < phantomcircuit> heh

08:41 < phantomcircuit> gmaxwell, i suspect you're right that it's similar to the OSX bug

08:41 < phantomcircuit> i also suspect it would be easier to replace leveldb with lmdb

08:41 < wumpus> phantomcircuit: can you send me a database that is corrupted by this problem?

08:41 < gmaxwell> mmap only, goodbye 32 bit hosts.

08:41 < gmaxwell> Among other limitations. :(

08:41 < Arnavion> It's funny because if you ask Windows devs they'll tell you the dev tooling on Windows is far better than on Linux

08:42 < phantomcircuit> wumpus, i mindlessly clicked the reindex button

08:42 < gmaxwell> Arnavion: there is some aspect of "tools you know are always better than ones you don't"

08:42 < phantomcircuit> i can certainly try to do so though

08:42 < wumpus> Arnavion: it may be more user friendly on windows, but it's harder to get insight about what is happening on a system

08:42 < Arnavion> There's also the aspect of GUI tools that don't suck

08:42 < Arnavion> but I'm biased

08:42 < gmaxwell> Arnavion: you mean GUI tools inherently sucking? :P

08:43 < Arnavion> Oh, but that's what WinDbg is for

08:43 < Arnavion> You can do everything with a keyboard if you wish

08:43 < wumpus> anyhow this is not constructive

08:44 < wumpus> if you feel helpful, please demonstrate your skills by solving the leveldb corruption issue Arnavion :)

08:44 < Arnavion> I have no skills

08:45 < Arnavion> but I can take a look

08:45 < wumpus> demonstrate the great capability of windows debugging tools, then

08:46 < wumpus> phantomcircuit: but you can reproduce it! just start a new datadir, crash the thing, and send it to me

08:51 < wumpus> lmdb is not the panacea some people think - but if you feel like patching bitcoind to use it, that'd be pretty neat as we could run some benchmarks and comparisons

08:59 < phantomcircuit> wumpus, the main thing that's missing from leveldb is... sequence numbers in the journal

08:59 < phantomcircuit> if you miss a full record of writes it doesn't know

09:00 < wumpus> still, we need to debug this issue, as it is clearly windows specific, so not fundamental to leveldb

09:00 < wumpus> if this was broken on all platforms I'd agree that looking for a replacement would be advisable...

09:01 < phantomcircuit> wumpus, it *is* broken on all platforms

09:01 < phantomcircuit> the failure rate is simply much lower

09:02 < gmaxwell> one step at a time.

09:02 < gmaxwell> lets making the bleeding stop on windows first.

09:03 < wumpus> that's a nihilistic way of looking at it phantomcircuit, I like that, yes, it's all broken, some things with a very low failure rate... but it doesn't help solving the issue :)

09:05 < wumpus> I'm sure lmdb is also broken in some subtle ways

09:06 < wumpus> anyhow, if you're not going to send me a corrupted datbase I'm going to start working on something else

09:07 < gmaxwell> I assume he has to wait for the reindex to complete to reproduce. :P

09:08 < wumpus> wha?

09:08 < wumpus> I assumed it would also happen when crashing during reindex?

09:08 < Luke-Jr> wumpus: what is Windows specfic?

09:09 < Luke-Jr> [08:13:52] <phantomcircuit> can confirm bitcoin core requires reindex on power failure under windows <-- I can confirm it requires reindex on power failure under Linux..

09:09 < wumpus> in that case, sorry, no hurry implied

09:10 < Luke-Jr> wumpus: if you in fact want such a corrupt bitcoin dir, I can probably get one for oyu

09:10 < * Luke-Jr> ponders where he put his USB Armory

09:11 < Luke-Jr> oh, it's plugged in

09:11 < wumpus> Luke-Jr: great!

09:12 < phantomcircuit> Luke-Jr, ive never had that issue on linux

09:12 < gmaxwell> it's interesting that the usb armory corrupts, perhaps not the same issue as windows though (though perhaps also worth fixing)

09:14 < wumpus> indeed, may be a different issue

09:22 < Luke-Jr> FWIW, removing power immediately after IBD started did NOT reproduce it

09:22 < Luke-Jr> anything in debug.log to look for to know when it does the first flush to disk?

09:23 < wumpus> set a low dbcache to force lots of flushes?

09:23 < wumpus> AFAIK it doesn't flush unless necessary until the initial block download is complete

09:27 < Luke-Jr> 'often' is too slow

09:28 < Luke-Jr> 2015-10-22 09:27:59 UpdateTip: new best=000000003f7e074587fa1684ac863519fea3c64040b05ddd04948a13f7b19b42 height=415 log2_work=40.700462 tx=423 date=2009-01-14 05:56:05 progress=0.000002 cache=419

09:33 < Arnavion> From just lazy-mode browsing the code, Win32MapFile::Sync() flushes but Win32MapFile::Flush() is a no-op, whereas the equivalent functions for posix both do things

09:34 < Arnavion> I did not see how they are called from common code to see if that matters

09:37 < wumpus> it all depends on whether this map file is used for writing

09:38 < wumpus> AFAIK on other OSes, writing uses normal file system commands, whereas reading can use mmap

09:38 < wumpus> if t his is different on windows that would explain something

09:39 < Arnavion> Windows has a dedicated FlushViewOfFile function to flush a mapped region

09:39 < Arnavion> That is called by Sync(), but not by Flush()

09:58 < Luke-Jr> hmm

09:58 < Luke-Jr> it seems I can't reproduce during IBD or something

10:07 < Luke-Jr> and a fully synced state is huge :/

10:08 < midnightmagic> Luke-Jr: you're reproducing on a Windows OS?

10:08 < Luke-Jr> midnightmagic: no

10:25 < * Luke-Jr> ponders

10:26 < Luke-Jr> wumpus: well, the last chainstate I had before today is 922 MB and demanded a reindex.. not sure if that's good enough, because I can't upload 44 GB :/

10:27 < wumpus> that's kind of huge - do you perhaps know what ldb file the corruption is in?

10:27 < Luke-Jr> no, I have no idea how leveldb works :/

10:28 < Luke-Jr> I could probably just give you access to the device?

10:39 < Luke-Jr> wumpus: see PM for login info

10:39 < wumpus> ok

10:40 < Luke-Jr> setup auth for your github ssh key; let me know if there's a better one

10:40 < Luke-Jr> wumpus: the leveldb in question is under ~/.bitcoin.bak

10:41 < Luke-Jr> perhaps relevant: tar: /home/usbarmory/.bitcoin.bak/chainstate/1463292.ldb: File shrank by 2020259 bytes; padding with zeros

10:42 < wumpus> uh...

10:42 < wumpus> yes, that doesn't sound good. Were you tarring while bitcoin running?

10:43 < Luke-Jr> not sure if bitcoind is running, but if it is, it's not running on *this* database

10:43 < * Luke-Jr> wonders what it's doing so much to hang SSH

10:44 < wumpus> ok, it would not be strange if it is happening while running tar on the datadir that bitcoin was running in, those files are deleted and recreated all the time

10:44 < Luke-Jr> right

10:45 < sipa> wumpus: leveldb seems unmaintained

10:47 < wumpus> bleh

10:52 < wumpus> I'm happy we at least never decided to use it for the wallet

10:53 < sipa> maybe we should switch to sqlite... at least that's very well tested and maintained

10:54 < wumpus> it is, but it's not very fast for key/value storage, and I don't think it handles such large databases very well

10:55 < btcdrak> sipa: sqlite is very slow

10:56 < wumpus> if you need more advanced query features it is very nice though

10:57 < btcdrak> it would be so much easier and flexible if we were using SQL of some kind...

10:57 < wumpus> what would be easier?

10:57 < sipa> btcdrak: leveldb's own benchmark says that sqlite is faster for random reads

10:57 < btcdrak> sipa: interesting

10:57 < wumpus> for wallet metadata sql would be reasonably useful

10:58 < sipa> writes are much slower, but we do very few writes

10:58 < btcdrak> wumpus: well you get a lot of stuff for free with a SQL database stack.

10:58 < sipa> such as?

10:58 < Luke-Jr> # dmesg

10:58 < Luke-Jr> Segmentation fault

10:58 < Luke-Jr> -.-

10:59 < wumpus> during initial sync we do quite a lot of writes (every coin that is touched is written back), after that, most are reads

10:59 < btcdrak> sipa: indexes for a start. the ability to query the dataset externally from bitcoind etc

10:59 < wumpus> using database indexes would mean using a very verbose format

10:59 < sipa> btcdrak: we're not storing any data in a way that's useful to index

11:00 < sipa> we'd lose massive performance by expanding the data

11:00 < wumpus> that kind of defeats the purpose - bitcoind's database should optimized for its own purposes as running a node, not external ones

11:00 < btcdrak> I had wondered about leveldb's ongoing support. it really looks like an orphaned project at this time.

11:01 < wumpus> sipa: right

11:01 < jgarzik> btcdrak, the expanded dataset in SQL is something like 40GB at least

11:01 < jgarzik> not including the block data itself

11:02 < jgarzik> sipa, worse comes to worse, we can do our own key/value store...

11:02 < wumpus> also bitcoin's databases are not an external interface, don't query them directly unless it's to write troubleshooting/recovery tools

11:03 < jgarzik> there are actually a few optimizations you can make if you assume your keys are hashes already

11:03 < jgarzik> not that seeks matter these days, but some seeks can be eliminated

11:04 < wumpus> reinventing the wheel should be the last recourse

11:04 < Luke-Jr> wumpus: whatever just happened appears to have fried the device, so I guess it'll be a while :/

11:04 < jgarzik> <grin> https://github.com/jgarzik/pgdb https://github.com/jgarzik/pagedb

11:04 < wumpus> Luke-Jr: I hope it wasn't me logging in! :p

11:05 < Luke-Jr> heh, I don't see how that could do it

11:05 < sipa> leveldb does have a benchmark about batch write performance

11:05 < sipa> where sqlite is much slower

11:05 < wumpus> I don't see either, I didn't even get a prompt

11:05 < wumpus> I've disconnected now

11:05 < sipa> though sqlite does win in synchronous write speed, which we also do frequently

11:06 < Luke-Jr> wumpus: makes me wonder if it's just a bad microSD card, which could make the failures invalid

11:06 < wumpus> would be interesting to patch sqlite into bitcoind and compare and do benchmarks

11:06 < sipa> the only reason i'm suggesting sqlite is because it is *very* well tested afaik

11:07 < * wumpus> says that for the second time today about a different database

11:07 < wumpus> sipa: also on windows?

11:07 < wumpus> leveldb is also very well tested, it is used by many companies in production... just mostly on linux/unix servers

11:08 < sipa> yes

11:10 < wumpus> do you know, how does sqlite do at caching?

11:10 < btcdrak> sipa: wumpus: maintenance is probably more of a concern than anything else. I do remember wondering about leveldb's longevity. sqlite is way too popular to disappear.

11:10 < jgarzik> sqlite leans a lot of OS caching, temporary files etc.

11:10 < wumpus> jgarzik: it can't do a much worse job than leveldb at that at least

11:11 < btcdrak> if we're just doing key/value what about all those database that do that sort of thing, like those nosql stacks?

11:12 < jgarzik> I'm a big fan of sqlite, and used it in my DNS server project. I don't know that sqlite has batch update - does begin/commit suffice given our workflow? They are -mostly- equivalent but not 100% equiv

11:12 < btcdrak> still. sqlite would be a better choice. But here's the thing, if we use sqlite then you're opening up the door for just about any database backend. specially if we made an abstraction layer

11:12 < wumpus> "The maximum size of a database file is 2147483646 pages. At the maximum page size of 65536 bytes, this translates into a maximum database size of approximately 1.4e+14 bytes " ok, apparently I misremembered

11:13 < wumpus> btcdrak: there aren't that much embeddable key/value stores

11:13 < btcdrak> wumpus: true.

11:13 < btcdrak> and sqlite wins in terms of support/maintenance.

11:13 < jgarzik> I certainly wouldn't suggest expanding the bitcoind db in SQL, but using it as a dumb blob datastore shouldn't be a big issue

11:13 < wumpus> and most of them are likely one-off projects, even worsely maintained than leveldb

11:13 < wumpus> right

11:13 < wumpus> jgarzik: +1

11:13 < btcdrak> +!

11:14 < btcdrak> +1

11:14 < Luke-Jr> reminder: UTXO db is consensus-critical.

11:15 < sipa> very aware

11:15 < wumpus> sqlite is extremely easy to embed in a project, it's just one C file

11:15 < jgarzik> kv datastores off the top of my head: leveldb, gdbm, [n]dbm, tokyo cabinet, kyoto cabinet, berkeley db

11:15 < jgarzik> wumpus, er huh?

11:15 < Luke-Jr> wumpus: surely one *big* C file. all of SQL can't be trivial.

11:15 < jgarzik> wumpus, sqlite is quite bigger than one C file

11:16 < wumpus> hm it used to be at least

11:16 < jgarzik> berkeley db is well maintained :) :)

11:17 < Luke-Jr> sqlite is like 140kLOC

11:18 < jgarzik> for sqlite, if you pre-compile the SQL queries, it goes pretty fast

11:18 < sipa> of course we'd precompile them...

11:19 < sipa> i din't think git subtreeing sqlite is reasonable

11:20 < wumpus> oh this is why I thought it was one C file, yes it's a very large one with everything pasted together: https://www.sqlite.org/amalgamation.html

11:20 < wumpus> sipa: license-wise?

11:20 < sipa> size wise

11:21 < jgarzik> sqlite is public domain, one of the few

11:22 < sipa> oh, that amalgation is pretty awesome

11:22 < sipa> we can compile it with various flags for extensions disabled

11:22 < wumpus> apparantly leveldb is 26kLOC in total

11:22 < wumpus> sipa: yep

11:24 < jgarzik> imo LOC and size are secondary factors

11:24 < jgarzik> primary is on going maintenance, reliability, ...

11:24 < wumpus> LOC would be important if there is the chance we end up having to maintain/troubleshoot it ourself

11:24 < wumpus> (like now for leveldb)

11:25 < jgarzik> indeed

11:25 < jgarzik> I think pgdb will likely be 10k loc when finished

11:27 < wumpus> "SQlite as a key-value database" https://www.sqlite.org/cvstrac/wiki?p=KeyValueDatabase

11:28 < wumpus> looks very easy to do, the only question is indeed jgarzik's whether the TRANSACTION/COMMIT is equivalent to batching as we use it now

11:29 < jgarzik> another sql caveat - based on experience - you have to be careful that your 'strings' are not translated in any way by the engine via unicode etc.

11:29 < Luke-Jr> jgarzik: nobody is going to maintain consensus-critical behaviour

11:29 < wumpus> sql injections? *ducks*

11:29 < jgarzik> I used sqlite to store binary DNS records

11:29 < sipa> sqlite is fully transactional

11:29 < Luke-Jr> no matter what we use, we will need to maintain it

11:30 < Luke-Jr> even if that just means reviewing each upstream release

11:30 < sipa> or just reviewing their test practices

11:30 < jgarzik> sqlite is very well hammered

11:30 < wumpus> (no, sqlite has proper parametrized queries, and from my experience its BLOBs are binary clean)

11:30 < jgarzik> few maintain software to consensus critical standards though

11:31 < sipa> agree, but a database interface clearly should... it's perfectly specified in both dirextion: it should find every record that exist, and not find any reorc that doesn't exist

11:31 < jgarzik> another bit - make sure there are no limitations on rows-modified-at-once

11:31 < sipa> so anything that is nkt consensus compatible is a bug

11:31 < jgarzik> otherwise we re-introduce the BDB fork issue (locks)

11:32 < Luke-Jr> sipa: fixing bugs is a consensus compatibility bug!

11:32 < sipa> jgarzik: the "bug" in the bdb case was that we didn't chefk tue bdb return value, and regarded failure to write as a block validation failure

11:33 < sipa> Luke-Jr: fully agree there

11:33 < Luke-Jr> anyhow, I need to go to sleep, but I think we will regret it if we import sqlite to consensus-critical code.

11:33 < jgarzik> sipa, well there was an issue where we were hitting BDB max lock

11:33 < jgarzik> sipa, the point still stands - sql engines do somethings have max-rows-updated-in-one-transaction type limits

11:33 < sipa> jgarzik: sure, but that wouldn't have been a consensus failure threat if we didn't treat out-of-locks as invalid-block

11:34 < jgarzik> *sometimes

11:34 < sipa> jgarzik: it was us who turned an administrative restriction into a consensus problem

11:34 < jgarzik> so, sql transaction limits is another one for the eval list

11:35 < sipa> if we would just have gone "oops! can't write! quitting", like we do for out of disk, bdb would not have caused forks

11:35 < wumpus> right, if it would treat database errors as a fatal error instead of rejection, it'd wouldn't have been as bad

11:35 < jgarzik> nod

11:35 < sipa> it would have been a bad DoS attack, though

11:35 < jgarzik> yep

11:35 < sipa> but not a fork

11:35 < wumpus> but fairly easy to resolve

11:35 < wumpus> right

11:36 < sipa> of course, you are absolutely right we need to be aware of such limits in sqlite or whatever we're incestigating

11:36 < sipa> eh, investigating

11:36 < jgarzik> lol

11:36 < sipa> the keys are like right nezt to each other

11:37 < sipa> also, sqlite databases are single files, right?

11:38 < jgarzik> yes*

11:38 < jgarzik> * some temporary files such as journals are created along the way, and must exist for recovery post-crash

11:38 < sipa> ok, similar to bdb

11:39 < sipa> that's fine for application database stuff

11:39 < sipa> maybe not wantes for wallets

11:39 < sipa> *wanted

11:40 < jgarzik> sqlite files do want periodic maintenance via 'VACUUM'

11:41 < sipa> we can do that in our flushtodisk function

11:41 < jgarzik> they get fragmented, old records cluttered, performance degrades a bit over time

11:41 < jgarzik> it is a very, very heavyweight operation

11:42 < jgarzik> info: https://sqlite.org/lang_vacuum.html

11:42 < jgarzik> auto_vacuum handles large deletes, but not fragments over time

11:43 < jgarzik> could just run it at startup and then auto_vacuum

11:43 < sipa> ok

11:56 < btcdrak> I've not seen everyone get so excited about something for a long time.

12:01 < wumpus> "The author disclaims copyright to this source code. In place ofa legal notice, here is a blessing: May you do good and not evil. May you find forgiveness for yourself and forgive others. May you share freely, never taking more than you give." I love sqlite's appraoch to licensing

12:11 < btcdrak> wumpus: if only there were more like that

13:06 < jgarzik> At https://www.sqlite.org/cvstrac/wiki?p=KeyValueDatabase sqlite seems across the board worse... except for large numbers of records (our use case)

13:06 < jgarzik> I'm happy to create an sqlite branch for bitcoind, if nobody else is working on it

13:10 < btcdrak> jgarzik: go for it

13:13 < sipa> how mature is sqlite4?

13:14 < jgarzik> good question

13:14 < jgarzik> vast majority in field is 3

13:14 < sipa> its design seems much closer to leveldb

13:14 < sipa> as it is... a key/value store internally

13:15 < sipa> i wonder if there is an api to access that key/value store interface

13:15 < sipa> i have read before that you can

13:15 < wumpus> I was starting on it, but will happily leave it to you jgarzik

13:15 < sipa> The default built-in storage engine is a log-structured merge database. It is very fast, faster than LevelDB, supports nested transactions, and stores all content in a single disk file.

13:16 < sipa> ^ from the sqlite4 design

13:16 < wumpus> where do you seea nything about sqlite4? at least the download page only has 3

13:16 < sipa> ok, sqlite4 is far from released

13:17 < wumpus> oh, "trunk" :-)

13:17 < sipa> not an option at this point

13:17 < wumpus> sounds great though

13:17 < sipa> alsoz sqlite3 does not promise backward write compatibility... so it is like bdb, and we need to guarantee only increases in version

13:18 < wumpus> it does promise backward read compatibility?

13:18 < sipa> yes

13:19 < wumpus> well then it's not too bad, if it detects a downgrade, it could export|import the database

13:19 < sipa> wait

13:19 < wumpus> berkeleydb doesn't even guarantee that much

13:19 < sipa> databases created by 3.3 can't be read by earlier 3.x versions

13:20 < sipa> lol, and this caused so much problems they reverted to the old format by default in 3.3.6

13:21 < jgarzik> note - single file db implies large file support req

13:22 < sipa> is that not available on every platform these days?

13:22 < wumpus> is that unreasonable these days?

13:22 < jgarzik> every platform, but some filesystems may be e.g. fat32 for weird reasons

13:22 < wumpus> ... and has been for 10 years or so

13:23 < jgarzik> I think it's reasonable

13:23 < sipa> oh: a database created by version X will always be read/write compatible with version X, even if modified by later versions

13:23 < jgarzik> just noting

13:23 < wumpus> fat32 is pretty much dead

13:23 < sipa> so it's not an auto-update to whatever the format in the later version is, like bdb

13:23 < jgarzik> sipa, yeah, kernel filesystem rules

13:23 < wumpus> it also doesn't support bigger disks

13:23 < jgarzik> wumpus, sadly very much alive on USB sticks

13:24 < sipa> put bitcoind'd chainstate on a USB stick: you are eaten by a grue

13:24 < wumpus> *small* USB sticks

13:24 < sipa> also: for now the chainstate still fits in 2 GB...

13:24 < sipa> ... for now

13:25 < jgarzik> as I said... just making a note because it's something of which people should be aware. continuing to hack on it :)

13:25 < sipa> awesome, thanks

13:25 < wumpus> great

13:27 < wumpus> 32GB is pretty much the limit for FAT32 devices, it's possible to have larger volumes, but it becomes really ineffiecient and at least windows doesn't allow formatting them

14:54 < GitHub122> [bitcoin] MarcoFalke closed pull request #6866: [trivial] fix white space in rpc help messages (master...MarcoFalke-2015-rpcWhitespace) https://github.com/bitcoin/bitcoin/pull/6866

14:54 < morcos> sipa: i'm trying to think about how to preserve the state of the mempool while generating block templates

15:02 < sipa> morcos: how about being able to mark mempool txn as locked?

15:03 < sipa> individual transactions, that is

15:03 < morcos> sipa: how do you mean? the block template generation code needs to mark all of them as locked?

15:03 < morcos> or do you mean if you need to delete something, such as with RBF, then you mark it as deleted and go ahead and add the new one?

15:04 < sipa> morcos: while the template generator runs, it locks transactions one by one as they are added to the template

15:04 < sipa> so the next transactions it fetches are guaranteed to be not conflicting with the ones it already has

15:05 < sipa> you want a new incoming block to override that though, and cancel the generator to make it start over

15:05 < morcos> yes agreed with that last part, i'll tackle that later

15:06 < morcos> but i'm not sure how to just lock txs one by one, you'd have to lock the mempool.cs each time you went to look up a tx to decide to add it to the mempool

15:06 < morcos> i'm starting off by trying to modify the existing CNB to hold locks a lot less

15:07 < morcos> so i'm imagning it basically copies ptrs to all the txs in the mempool and the score it needs for them

15:07 < morcos> then it doesn't need to hold mempool.cs any more as long as the ptr does not become invalid

15:07 < morcos> which would only happne if the txn is deleted

15:08 < sipa> i guess transactions would get a refcount

15:08 < sipa> and the refcount value itself would be protected by mempool.cs

15:08 < morcos> but i think that has to be for all of the txs as you're running the logic of figuring out which ones need to be in the template

15:08 < morcos> yeah, but i was imagining it was a singular refcount

15:08 < sipa> hmm, does the mempool not hold some index sorted by template inclusion preference?

15:08 < morcos> instead of per tx

15:09 < morcos> yeah so thats where my approach might not be the best approach b/c the new template generation code will have such an index

15:09 < sipa> another idea is to remove the storage if txn from the mempool entirely

15:10 < morcos> but even still you need to iterate far past the size of a block

15:10 < sipa> you just get a transaction store manager, which you give transactions, and it returns a refcounted pointer

15:10 < sipa> the mempool stores the pointers

15:10 < sipa> i guess that's just smart pointers

15:11 < sipa> so the template generator would grab pointers to the top N (some multiple of a block), increase their refcount, release memlool.cs

15:11 < sipa> then goes off to build a block, verify if wanted, ...

15:11 < sipa> and then release the pointers

15:12 < sipa> the mempool can change during that time... you just know that the set you grabbed at that point while holding the lock is internally consistent

15:12 < sipa> and consistent with the block is was building on

15:12 < morcos> yes thats what i'm going for

15:13 < morcos> i guess it depends on how big the set is compared to the whole mempool

15:13 < morcos> so my idea was that rather than marking individual txs

15:13 < morcos> you just tell the mempool, hey, referring to things, don't delete

15:13 < morcos> and then occasionally that gets freed, and anything marked for deletion can happen

15:13 < sipa> hmm

15:13 < sipa> not sure

15:13 < morcos> since deletes are rarer and smaller than adds

15:13 < sipa> sounds more complicated

15:14 < morcos> ok, maybe i'll explore both

15:15 < morcos> next question and this might be related to CSV

15:15 < morcos> really annoying that CheckInputs needs cs_main

15:15 < morcos> doesn't seem like it should have to?

15:16 < sipa> so i think it's relative easy this way: turn the storage of txn in the mempool into smart pointers, and you can very cheaply and efficiently ask the mempool for the set of all its transactions

15:16 < morcos> you're storing the hashblock that its valid at, and that can't change height

15:16 < sipa> it shouldn't be needed, indeed

15:17 < morcos> sipa: not that easy, you need stuff from the mempool entries, thats where fees are stored

15:17 < morcos> so you still have to iterate to figure out which pointers you need and copy the meta information

15:17 < sipa> ok, so that fee information needs to be inside the smart pointed-to objects

15:18 < morcos> so all you've done now is taken the multi-index approach and say ehh forget that, lets just have multiple indexes referring to this same set of pointers

15:19 < sipa> i may be missing something

15:19 < morcos> i think the only complication from my idea is damn RBF... otherwise you just don't call any eviction code except periodically... and there is no problem running it only periodically

15:19 < morcos> and you don't even have to track anything

15:20 < sipa> what i'm suggesting is a way to make cheap snapshots of subsets of the mempool at a given time

15:20 < sipa> those snapshots themselves don't have an index, but can be ordered by one at the time they are created

15:20 < sipa> isn't that enough?

15:21 < morcos> yeah maybe, i think i'm just leary of the double indirection now where the multi-index is a multi-index of ptrs, but maybe thats something stupid to worry about

15:25 < sipa> the transactions inside the mempool already have a dozen or so indirections

15:26 < sipa> ok, more like 6

15:26 < sipa> but still!

15:27 < morcos> so something like a boost::shared_ptr

15:28 < sipa> yes, that specifically :)

15:28 < sipa> sorry, i thought it was called smart_ptr :)

15:29 < sipa> you probably want to wrap CTransaction inside something that also contains its (direct) fee and perhaps other immutable statistics

15:29 < sipa> and then have sharee_ptrs to those inside the mempool entries

15:29 < sipa> heh, we could even serialize the transactions inside to reduce memory usage...

15:31 < morcos> yes, so then it'll get a bit more complicated with ancestor package tracking when you have mutable state like ancestor fee rate that you want to copy to your template generation code, but that was going to be difficult period

15:32 < sipa> why do you need it in the template generator code?

15:32 < morcos> thats the sort the logic uses

15:33 < morcos> yikes

15:33 < sipa> sort the list of copied pointers according to the sorting criterion you want while creating it (so while holding mempool.cs)

15:33 < sipa> then release mempool.cs and you can forget it

15:33 < morcos> ok, but see thats the complicated part that takes a long time

15:33 < sipa> ah, ok

15:34 < morcos> the tx thats sorted at the top , really links to a lot of other txs , whose ptrs you have to grab to

15:34 < sipa> hmm, right

15:34 < morcos> but then how do you solve the problem of once you include that in your template

15:34 < morcos> everything elses sort changes

15:34 < sipa> i wouldn't bother with changing sorts

15:36 < morcos> well that'll be the magic of the algorithm, it'll probably have to be some heuristic, but its very easy to have expensive chain A-B-C and so C is sorted first, but then cheap C2 is still sorted quite high.

15:36 < morcos> uh not exactly but close

15:36 < morcos> but yes, i agree we don't have to be perfect

15:37 < morcos> ok you've given me some ideas... let me see what i come up with

15:40 < sipa> great

15:40 < sipa> feel free to completely disregard my ideas :)

15:48 < jgarzik> RE locking transactions one-by-one - that is the typical parallelism solution

15:48 < jgarzik> just need a gatekeeper (core struct lock) for insert/delete/move

15:53 < GitHub181> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/a09297010e17...2cd020d054d6

15:53 < GitHub181> bitcoin/master 3cb56f3 Daniel Cousens: *: alias -h for --help

15:53 < GitHub181> bitcoin/master 2cd020d Wladimir J. van der Laan: Merge pull request #6846...

15:53 < GitHub119> [bitcoin] laanwj closed pull request #6846: [Trivial] bitcoind: alias -h for -help (master...aliash) https://github.com/bitcoin/bitcoin/pull/6846

16:02 < GitHub145> [bitcoin] laanwj pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/2cd020d054d6...f2c869aef2e7

16:02 < GitHub145> bitcoin/master c6824f8 J Ross Nicoll: Add DERSIG transaction test cases...

16:02 < GitHub145> bitcoin/master f2c869a Wladimir J. van der Laan: Merge pull request #6848...

16:02 < GitHub77> [bitcoin] laanwj closed pull request #6848: Add DERSIG transaction test cases (master...bip66-tests) https://github.com/bitcoin/bitcoin/pull/6848

16:18 < gavinandresen> FYI: git HEAD master isn't working for me on OSX. Running make check gets me:

16:18 < gavinandresen> libc++abi.dylib: terminating with uncaught exception of type boost::exception_detail::clone_impl<boost::exception_detail::error_info_injector<boost::lock_error> >: boost: mutex lock failed in pthread_mutex_lock: Invalid argument

16:20 < gavinandresen> I'm very confused, because git bisect blames a commit that works....

16:20 < cfields> gavinandresen: sounds like your boost may be built against the wrong stdlib? checking master on osx here now.

16:21 < gavinandresen> cfields: I thought that too, did a brew uninstall / brew install of boost

16:25 < btcdrak> morcos: sorry was afk. My other question was how does mempool evictions play into the mix? I assume one doesnt want to evict a txn that you're trying to mine.

16:26 < morcos> btcdrak: yes, thats what my concern was. the only way a pointer to a tx becomes invalid now is if the tx is deleted. A) throught being mined, in which case you want to abort block template generation anyway or B) through eviction or RBF

16:27 < morcos> for B1) eviction, i thought it would just be easy to just skip eviction if the template_tx_lock is held, and just make sure eviction gets to run every so often.

16:28 < morcos> but B2) RBF, doesn't have an ideal solution, i wanted to just flag those txs, but maybe sipas idea with shared_ptrs is cleaner

16:34 < cfields> gavinandresen: ok here. What compiler/boost/osx version?

16:45 < gavinandresen> cfields: Apple LLVM version 7.0.0 (clang-700.0.72), boost /usr/local/Cellar/boost/1.58.0, osx 10.10.5 ....

16:46 < gavinandresen> ... but it is very possible my machine is in some weird state, I upgraded my XCode yesterday.

16:47 < btcdrak> gavinandresen: first rule of fight club - "never upgrade XCode"

16:56 < jgarzik> heh

16:56 < jgarzik> just did last night...

16:59 < cfields> gmaxwell / jgarzik: ping. I must be going crazy. From what I can tell experimentally (and backed by man pages), on Linux, our incoming connections are being treated as blocking.

16:59 < jgarzik> cfields, post-accept(2) you set sockets to non-blocking

16:59 < cfields> because O_NONBLOCK isn't inherited by accept

17:00 < jgarzik> nod

17:00 < cfields> jgarzik: should, sure

17:00 < cfields> but we don't

17:00 < jgarzik> oops

17:00 < jgarzik> seems surprising, seems like it would have been well noticed before now

17:01 < sipa> that would imply that the "opportunistic write" always succeeds

17:01 < cfields> for the most part it should be ok because reads wait for select(), but i'm surprised it hasn't caused issues

17:01 < sipa> that should be trivial to test?

17:01 < jgarzik> no wait - send/recv have per-call NB flags

17:02 < cfields> jgarzik: ah right

17:03 < cfields> jgarzik: still though, i've read about select() acting funny on linux with blocking sockets

17:04 < jgarzik> like haha, that's a laugh funny?

17:05 < * jgarzik> attempts to implement db iteration without brute force 'select * from table' into RAM ;p

17:06 < cfields> "Under Linux, select() may report a socket file descriptor as "ready for reading", while nevertheless a subsequent read blocks... Thus it may be safer to use O_NONBLOCK on sockets that should not block."

17:06 < jgarzik> recv() will fail in that case

17:06 < cfields> Either way, forcing them to nonblock after accept can't hurt, right?

17:06 < cfields> hmm

17:07 < sipa> jgarzik: i guess you can get some sort of object/state which you can ask for "more results" ?

17:07 < cfields> it wouldn't just block?

17:08 < GitHub132> [bitcoin] MarcoFalke opened pull request #6870: [trivial] Misc cleanup and translations (master...MarcoFalke-2015-trivial3) https://github.com/bitcoin/bitcoin/pull/6870

17:08 < cfields> jgarzik: er, the DONTWAIT. nm.

17:12 < jgarzik> sipa, you can 'select *' and iterate in one direction (forwards; next-result), whereas the CDBIterator wants Next, Prev and Seek operations

17:13 < * jgarzik> is studying the callers/users of CDBIterator now

17:13 < sipa> jgarzik: i don't think we use prev

17:15 < jgarzik> so it seems

17:16 < sipa> and seek would be equivalent to just starting the search

17:16 < cfields> gavinandresen: that's no good. i'm confused about that error, though

17:17 < jgarzik> sipa, well seek-start, seek-end and seek-key are all different

17:17 < sipa> jgarzik: i think the abstraction can be changed so we have a few 'entry types' (which would translate to separate sqlite tables)

17:17 < sipa> jgarzik: and only have iterators for all-data-within-one-entry-type

17:17 < jgarzik> sure

17:18 < sipa> which would be compatible with both leveldb and sqlite

17:21 < jgarzik> ah a few of these db ops are like Prev(), unused. job easier++

17:24 < jgarzik> hrm

17:24 < jgarzik> sipa, lex order?

17:25 < jgarzik> seems so

17:43 < sipa> some things need lex order, yes

18:28 < gavinandresen> FYI: my build-failing-on-OSX problem went away with a 'git clean -dxf' then re-autogen/configure/build....

18:36 < * jonasschnelli> did accidentally sent untracked files to nirvana serval times with `git clean -dxf`...

18:36 < * sipa> too

18:37 < jonasschnelli> heh

18:58 < gmaxwell> I haven't tried it on bitcoin core yet, but the latest functionality in rr (replay debugger) looks pretty great: http://robert.ocallahan.org/2015/10/rr-40-released-with-reverse-execution.html

19:02 < jcorgan> did i hear rumblings about switching to sqlite?

19:03 < maaku> i wish there was a 'git stage --clean' command

19:03 < btcdrak> meeting over at #bitcoin-dev now

19:03 < jcorgan> isn't it git checkout . ?

19:03 < btcdrak> jcorgan: yes, jgarzik is working on it now

19:04 < jcorgan> oh sweet. then we can use sqlcipher and get full db ondisk encryption

19:04 < jcorgan> it's a drop-in replacement/fork for sqlite that operates at the db page level, like dmcrypte for block devices

19:05 < sipa> why do you need encryption for public data? :)

19:05 < maaku> sipa: i presume he's talking about wallet?

19:05 < sipa> i wouldn't use sqlite for the wallet

19:05 < gmaxwell> jcorgan: you saw that discussion, but I expect it won't happen (At least with sqllite3) as-- if I'm not mistaken-- btcd tried it and found the performance unacceptable. (but we should anyways, both to have a comparison point, and because sqllite4 might be faster even if 3 is too slow)

19:06 < jcorgan> encrypt ALL THE THINGS

19:07 < gavinandresen> RE: switching to sqlite: this CACM article was very interesting: http://cacm.acm.org/magazines/2015/10/192379-crash-consistency/abstract

19:07 < gavinandresen> ... bashes on leveldb a bit, praises sqlite ....

19:08 < * maaku> is very confused as to what the benefit of switching to sqlite would be for this particular use case

19:08 < jgarzik> maaku: it's maintained and reliable

19:09 < gavinandresen> maaku: assuming sqlite is better about crash consistency (and from all I've read, it is), would cut down on the "core sucks, because my PC crashed and then eleven days to reindex..."

19:10 < sipa> yes, that's the only reason

19:10 < sipa> sqlite is known to be rock solid

19:10 < sipa> however, performance would need to be acceptable

19:14 < maaku> sipa: well, also interop and the ease of using a different sql backend

19:16 < jcorgan> i would definitely advocate making a sql abstraction layer with pluggable back ends, with sqlite as the default

19:17 < CodeShark__> I'm using ODB in my stack for that

19:17 < sipa> jcorgan: to be clear, i think SQL is a downside

19:18 < sipa> we're not looking for a database, and turning things into a database is against what we're trying to achieve

19:18 < jgarzik> nod

19:18 < jcorgan> my secret plan revealed, again

19:18 < sipa> our database is not designed to be accessible by multiple processes

19:18 < sipa> nor do we need any fancy indexes or queries

19:18 < gavinandresen> if there's a key-val store as well-supported and rock solid as sqlite..... that's the right answer.

19:18 < jgarzik> I still like the custom db approach, but I'm highly biased

19:19 < sipa> if sqlite offered a mechanism to just use its low-level key-value store, i would

19:19 < sipa> (because it's a database layer built on top of a key-value store)

19:19 < jgarzik> program in sqlite VM assembler and you can...

19:19 < * jgarzik> runs

19:19 < maaku> imho the answer would be to switch to a better maintained descendent of leveldb (there are many)

19:20 < maaku> also minimizes risk of fork

19:20 < gmaxwell> Its unhelpful to have this conversation in parallel with the meeting.

19:26 < kanzure_> upstream maintainers, apparently

19:26 < kanzure_> oops, bad scrollback

19:32 < bsm1175321> Don't want to interject in the meeting, but I re-ran the SQLite vs. libdb benchmarks sipa mentioned (because they were 6 years old).

19:32 < bsm1175321> Results here: https://gist.github.com/mcelrath/6952eab246a7c705a0fb

19:35 < kanzure_> huh? why should that not be mentioned?

19:36 < bsm1175321> Because the topic has moved on, and the conclusion was to continue research.

19:44 < bsm1175321> Above link updated with 500k records, transactional DB. (Took a few minutes to run)

20:03 < BlueMatt> IIRC gmaxwell was gonna ask the crash consistency guys for their test harness?

20:04 < BlueMatt> so we could test other dbs

20:04 < BlueMatt> or maybe I'm crazy

20:04 < BlueMatt> someone should do that, though

20:04 < BlueMatt> before we decide

20:06 < sipa> maaku: sqlite is better at large writes than leveldb

20:06 < maaku> btcdrak: sqlite4 maybe matches leveldb ... because it's basically using leveldb's datastructure under the hood

20:06 < sipa> as leveldb writes everything twice tovdisk

20:08 < btcdrak> gmaxwell: this was the discussion earlier today about sqlite: https://botbot.me/freenode/bitcoin-core-dev/2015-10-22/?msg=52492157&page=2

20:08 < gmaxwell> btcdrak: yes I read it. I am confident from expirence that its incorrect, but I still think jeff's work is very useful.

20:09 < btcdrak> gmaxwell: yeah, I was also surprised, but fingers crossed something impressive comes out of jeff's research.

20:09 < gmaxwell> (in part because sqllite4 may be faster)

20:09 < btcdrak> sqlite4 is still in development though isnt it?

20:10 < gmaxwell> sure, not useful yet.

20:10 < btcdrak> i wonder how far on they are and if they have a release schedule yet

20:10 < Luke-Jr> am I missing something? why is this being treated as a non-consensus thing?

20:10 < sipa> it obviously is consensus critical

20:10 < sipa> it's not a consensus change though

20:11 < Luke-Jr> we don't and can't know that.

20:11 < sipa> recompiling with a different libc is also consensus critical

20:11 < Luke-Jr> replacing 26kLOC to 140kLOC is impossible to be sure on

20:11 < BlueMatt> Luke-Jr: we should be confident that it is not before anything moves forward

20:11 < sipa> or running on a new linix kernel

20:12 < Luke-Jr> BlueMatt: I don't think we can.

20:12 < helo> i really appreciate the meetings, great idea (btcdrak?)

20:12 < btcdrak> helo

20:12 < sipa> we are making certain assumptions from other software we are using

20:12 < BlueMatt> Luke-Jr: then we're stuck with a shitty db that corrupts randomly and isnt self-consistent (ie not consensus-compatible)

20:12 < Luke-Jr> best I can see happening, is knowing it's unlikely to have any accidentally-triggered failures

20:12 < gmaxwell> Luke-Jr: it's not being treated as anything right now, it's just a test.

20:13 < sipa> that includes compilers, libraries, and operation systems with _well specified_ behaviour

20:13 < gmaxwell> Luke-Jr: by that basis we don't know that leveldb is self-consistent, or consistent between windows / unix, etc.

20:13 < maaku> Other than Windows corruption errors, is this solving any real problems right now?

20:13 < btcdrak> Luke-Jr: research is a good thing.

20:13 < gmaxwell> maaku: we have durability problems on all platforms, though the windows ones are very severe.

20:13 < Luke-Jr> btcdrak: yes, if this is just research fine

20:14 < gmaxwell> The performance of leveldb is also problematic already, (well reported for very large databases), enough that our fat caching layer is very very criticial for performance, and thats causing us other problems.

20:14 < gmaxwell> maaku: durability problems in this database are incompatible with the survival of a network where puning is widely used.

20:14 < sipa> puny network

20:15 < gmaxwell> hah

20:15 < gmaxwell> pruning*

20:15 < btcdrak> LOL

20:16 < wumpus> in general leveldb works fine; greatest concern is that it seems to be no longer maintained, so any bugs (like corruption on crashes in windows) is unlikely to get solved

20:16 < maaku> i thought gmaxwell was declaring sipa a risk to bitcoin there

20:17 < gmaxwell> maaku: well that much is obvious.

20:17 < sipa> we couls try to find another win env for leveldb

20:17 < sipa> there may be bugs in that

20:18 < jonasschnelli> i know this sounds hard: but did we never consider to stop supporting windows? Would this be so wrong?

20:18 < wumpus> I'm sure it is possible to fix the windows issue, but it's not really nice to inherit a dependency and suddenly have to maintain it

20:19 < Luke-Jr> jonasschnelli: I think it would be problematic to do so right now.

20:19 < Luke-Jr> jonasschnelli: as bad as it is, many people still use it

20:19 < gmaxwell> jonasschnelli: I think it would be a bad idea, its a very widely used platform and dropping it would be a major move against the ability of people to independnantly run, audit, etc. the bitcoin system.

20:19 < wumpus> jonasschnelli: well I've been close to declaring that. it wouldn't be pretty but we have zero active developers for windows.

20:20 < gmaxwell> The security problems of windows stink, but hardware wallets are a tidy tool to address that.

20:20 < wumpus> jonasschnelli: then again, the software is working for a lot of people also on windows

20:20 < jonasschnelli> not even speaking for the wallet. Just the node/core itself.

20:20 < gmaxwell> we are in a state right now where it is is almost constructively non-supported by our failure to do a a great job about it.

20:20 < maaku> well to be clear, gmaxwell's concerns about a pruned network wouldn't apply if the durability issues were wholly isolated to the windows platform

20:20 < Luke-Jr> jonasschnelli: then people will just stop running a node :/

20:21 < wumpus> gmaxwell: I disagree

20:21 < Luke-Jr> before we drop Windows support, we would ideally want the alternative to be "run a node on Linux", not "stop running a node altogether"

20:21 < gmaxwell> maaku: they are not wholly isolated to windows, though perhaps its safe enough elsewhere that my concern is less of an issue.

20:21 < jonasschnelli> wumpus: Agree. It's running. But i think we should not move away form leveldb because of windows...

20:21 < wumpus> gmaxwell: many people are running it on windows, and not complaining at all

20:21 < wumpus> gmaxwell: no need to blow this issue out of proportion

20:21 < gmaxwell> wumpus: and many people have stopped running it too.

20:21 < wumpus> gmaxwell: obviously

20:22 < gmaxwell> I believe more people have stopped running bitcoin core on windows than currently run it.

20:22 < gmaxwell> (stopped due to corruption issues of varrious forms)

20:22 < wumpus> well this is an open source project - if we don't have developers supporting windows, we can't support windows

20:22 < jonasschnelli> are most corruption issues caused by leveldbs layer?

20:22 < gmaxwell> (not just leveldb but also AV)

20:22 < sipa> jonasschnelli: we don't know

20:23 < gmaxwell> We know there is leveldb corruption on unclean shutdown, and we know about antivirus mediated corruption. But there may be more things we don't know about.

20:23 < wumpus> if the corruption issues are as bad as you say they are, and no one is fixing them, then we should indeed drop windows support

20:23 < Luke-Jr> maybe if we stop supporting Windows "officially", but leave it open, others will step up to do it

20:23 < wumpus> but I don't believe you

20:23 < gmaxwell> wumpus: or we should get more windows developers.

20:23 < wumpus> gmaxwell: well, good luck with that

20:23 < gmaxwell> Which can be done.

20:24 < gmaxwell> wumpus: I mean I can _hire_ people for this, but it hasn't been on my radar.

20:24 < wumpus> in any case, this is not constructive *goes back to sleep*

20:24 < jonasschnelli> we could ship it together with a vmbox ubuntu. *duck*

20:24 < * Luke-Jr> wonders if Diapolo is good enough at C++ that he could transition to more than just GUI if he gets funding

20:24 < * jonasschnelli> looks strange at Luke-Jr

20:24 < gmaxwell> wumpus: :-/ I think you're misunderstanding my perspective, sorry I've communicated poorly.

20:24 < Luke-Jr> jonasschnelli: got a better suggestion? :P

20:25 < Luke-Jr> I don't know many Windows developers..

20:25 < jonasschnelli> Luke-Jr: hah. No. Not really. :)

20:25 < jonasschnelli> serious: what would be the overhead to run bitcoin-qt/core in a vm on windows?

20:25 < Luke-Jr> maybe the people who did those libconsensus bindings?

20:26 < wumpus> gmaxwell: it's not impossible to just solve the windows crash corruption issue

20:26 < jonasschnelli> 10% CPU loss, 512MB ram overhead?

20:26 < Luke-Jr> jonasschnelli: I would be surprised if a VM kill reproduced the problem

20:26 < wumpus> gmaxwell: I even proposed to help with it today if someone can send me a corrupted database

20:26 < gmaxwell> wumpus: yup. did anyone do that?

20:27 < wumpus> gmaxwell: no

20:27 < jonasschnelli> Luke-Jr: the idea would be to bundle bitcoin-qt/core on windows together with vbox and a tiny distro. This would eliminate some ugly platform dependents.

20:27 < Luke-Jr> gmaxwell: I tried to, but couldn't reproduce it with IBD, and shortly thereafter my USB Armory died entirely. So it seems likely my problem was a bad microSD

20:27 < Luke-Jr> jonasschnelli: that sounds like a terrible UX

20:28 < jonasschnelli> Luke-Jr: the UX could still be native (once it's decoupled)

20:28 < wumpus> Luke-Jr: yes, your problem is likely adifferent one. You're the only one reporting it on linu

20:28 < wumpus> jonasschnelli: if you go that far to make a bundle you may as well fix the windows version :-) (it would be about as much development work)

20:29 < gmaxwell> jonasschnelli: doesn't jive really well with the resource costs of running a node.... VM with the constantly expanding storage is no fun. :P plus overheads. :P

20:29 < Luke-Jr> does UML support Windows? ;)

20:29 < gmaxwell> Fixing the windows shouldn't be a big deal once we get enough repros and data, similar to OSX.

20:29 < gmaxwell> sounded like it might be the same issue.

20:30 < wumpus> yeah, let's just solve the database problem instead of trying to work around it

20:30 < wumpus> even if we have to put a flush after every line in leveldb ...

20:30 < gmaxwell> I considered the exploration of other options to be orthorgonal with windows being on fire FWIW.

20:30 < jonasschnelli> gmaxwell: nowadays VM overhead is tiny... it's not perfect. But better a solution that works with a lost of 10-20% in overhead than unsolved corrupted databases.

20:30 < wumpus> gmaxwell: sure, I still like the idea of exploring other databases

20:31 < wumpus> but yes, orthogonally

20:31 < sipa> i really like the idea of just being able to use a maintained and tested database

20:31 < gmaxwell> (also the attempt may turn up bugs elsewhere in our codebase)

20:31 < wumpus> don't we all

20:31 < sipa> and not a hack we had to pull together

20:31 < * wumpus> thought leveldb was that

20:31 < gavinandresen> a bug bounty worked for the last leveldb corruption issue we had (if i recall correctly). I'm still holding some bitcoin in the core dev expenses fund

20:31 < sipa> wumpus: leveldb windows certainly isn't

20:32 < Luke-Jr> jonasschnelli: if someone really wants to investigate VM stuff, I'd suggest instead a thin OS that runs *Windows* in the VM, and provides a hardwarewallet-like interface on top

20:32 < jonasschnelli> is there an approach to test sqlites performance which represents our db-style?

20:32 < sipa> wumpus: we have local modifications to the win env

20:32 < wumpus> jonasschnelli: best would be to just try it

20:32 < gmaxwell> jonasschnelli: yes, testing bitcoin core using <alternative database> :P it shouldn't be that much work; presumably jeff will report back on that soon

20:33 < jonasschnelli> sqlite could initially be slower. But it's better maintained and much more portable.

20:33 < sipa> jonasschnelli: there is slower and slower

20:33 < jonasschnelli> Indeed

20:33 < sipa> if it's slower for things our caches compensates for, who cares

20:33 < gmaxwell> jonasschnelli: speed is a security consideration for us currently, if you were talking about 5% or something I'd agree. but thats now what I expect.

20:34 < gmaxwell> s/now/not/

20:34 < sipa> if it's slower to the point that it affects block propagation, it's a no go

20:34 < gmaxwell> s/currently/sadly/

20:34 < * wumpus> doubts it will make much of a difference

20:34 < jonasschnelli> Quote from sqlite4: The default built-in storage engine is a log-structured merge database. It is very fast, faster than LevelDB, supports nested transactions, and stores all content in a single disk file. Future versions of SQLite4 might also include a built-in B-Tree storage engine.

20:34 < jonasschnelli> promising. :)

20:35 < wumpus> there are so many factors influencing performance of bitcoind, that a slightly slower database won't be a big issue

20:35 < sipa> yes, sqlite4 sounds awesome _once_ it has had the same amount of testing and battle hardending as sqlite3

20:35 < wumpus> jonasschnelli: yes that was pasted before today :)

20:35 < jonasschnelli> ha.. okay. I'd love to play around with it. But my stack of things to work down is just to big right now.

20:36 < gmaxwell> we might also want to maintain a patch for sqllite4 and use it to try to get the sqllite devs to use us as a test harness. :P

20:36 < jonasschnelli> sipa: sqlite is widely used, also in almost every browser (local dbs) and smartphone (android and iOS IIRC). They very likely to update once to sqlite4...

20:38 < sipa> but quote from one of the sqlite devs a few months ago "sqlite4 is a dev toy"

20:38 < jonasschnelli> sipa: that sounds perfect for our bitcoin <1.0 version. :)

20:39 < sipa> jonasschnelli: if only we didn't have this pesky economy thing that relies on bitcoin

20:39 < jonasschnelli> sipa: na... that are just some fiat exchange bubbles.. :)

20:39 < gmaxwell> Lets fix that first then. :P

20:40 < wumpus> trying it out doesn't hurt, no one is talking about releasing with it...

20:40 < sipa> wumpus: exactly

20:40 < sipa> dev toy :p

20:40 < jonasschnelli> wumpus: +1 ... and a such pr would at least take 1/2 year to get in.

20:40 < gmaxwell> well as I said, might even be possible to get the sqllite developers to test using us. We're a pretty cool load generator (esp with signatures off)

20:41 < sipa> gmaxwell: we're boring

20:41 < sipa> single-threaded access, only bulk writes and small random reads

21:10 < jcorgan> < Luke-Jr> jonasschnelli: if someone really wants to investigate VM stuff, I'd suggest instead a thin OS that runs *Windows* in the VM

21:11 < jcorgan> *cough* Qubes OS *cough*

21:11 < Luke-Jr> jcorgan: Qubes OS is not very thin, and doesn't do GPU passthrough

21:12 < jcorgan> didn't think about GPU

21:12 < jcorgan> but thinness is poorly defined

21:12 < btcdrak> sipa: are the tests in #6816 (versionbits) enough? I was thinking it would be good to have some rpc-tests there as well, generating blocks and running through a couple of scenarios.

21:13 < btcdrak> or is that overkill?

21:21 < CodeShark> btcdrak: I think we should probably do some regtests with the integration, not necessarily over RPC

21:21 < CodeShark> well...

21:21 < CodeShark> other than just generating blocks

21:21 < CodeShark> I guess just calling generate and getblockchaininfo

21:26 < btcdrak> CodeShark: I mean tests like https://github.com/bitcoin/bitcoin/blob/master/qa/rpc-tests/bipdersig.py etc

21:27 < jgarzik> It builds: https://github.com/jgarzik/bitcoin/tree/2015_sqlite

21:28 < btcdrak> jgarzik: That was fast

21:28 < CodeShark> jgarzik: nicely done :)

21:28 < sipa> yes, but does it run linyx?

21:30 < CodeShark> btcdrak: the -blockversion thing will probably not be a good idea anymore using versionbits

21:30 < CodeShark> it will be better to provide a list of BIPs you don't want to set the bit for

21:30 < jgarzik> a few bits can be forwarded upstream immediately, like Seek* and Prev() removal

21:31 < btcdrak> CodeShark: well obviously the bipdersig.py tests are for an ISM sf.

21:32 < btcdrak> CodeShark: I'm just saying a set of tests like that which generate blocks and simulate a sf rollout but using versionbits protocol.

21:32 < CodeShark> I suppose we can just use VERSION_HIGH_BITS and VERSION_HIGH_BITS | 0x1 instead of versions 2 and 3 :)

21:33 < Luke-Jr> CodeShark: does versionbits sanitise the encoding btw? ;)

21:33 < Luke-Jr> ie, define the first 32-bits to be a big endian number

21:33 < maaku> Luke-Jr: it's little endian

21:33 < Luke-Jr> would be nice to fix that at the same time

21:33 < CodeShark> encoding? versionbits doesn't deal with serialization stuff

21:33 < btcdrak> CodeShark: it makes the most conclusive tests that it works, and provides some protection against regressions.

21:34 < maaku> Luke-Jr: why? nothing else is big endian

21:34 < Luke-Jr> CodeShark: anything consensus-relevant must deal with encoding

21:34 < maaku> Luke-Jr: he means versionbits doesn't touch serialization

21:34 < CodeShark> Luke-Jr: versionbits deals with block header data that has already been deserialized

21:34 < Luke-Jr> maaku: big endian is standard for protocols; and hashes at least are

21:34 < sipa> Luke-Jr: please

21:35 < sipa> we're not changing the serialization of block headers

21:36 < jgarzik> big endian is dead

21:37 < sipa> no endianness flamewar please

21:37 < CodeShark> in any case, the serialization format is entirely a separate issue...versionbits will play no part in that either way

21:39 < Luke-Jr> so the bits are numbered 7 6 5 4 3 2 1 15 14 13 12 11 10 9 8 23 22 21 20 19 18 17 16 31 30 29 28 27 26 25 24 ?

21:39 < sipa> irrelevant

21:39 < Luke-Jr> …

21:39 < CodeShark> versionbits deals with ints, uint32_t, etc...

21:39 < Luke-Jr> it's relevant because if I set the wrong BIP, it won't work.

21:40 < sipa> the block header serialization defines nVersion as a 32-bit little-endian signed integer

21:40 < Luke-Jr> wrong bit*

21:40 < CodeShark> it doesn't care what the underlying representation is for the particular architecture

21:40 < sipa> versionbits changes the semantics of the nVersion integer

21:40 < CodeShark> by the time versionbits comes into play, the version field has already been decoded into an int

21:41 < Luke-Jr> CodeShark: that is an implementation detail. the whole point of version bits is that it is no longer an int, but a bit array

21:41 < CodeShark> to test a bit, you do (nVersion >> bit) & 0x1

21:41 < sipa> Luke-Jr: it's still an int; an int in which particular bits are set

21:41 < * Luke-Jr> sighs.

21:42 < sipa> now please stop this silly discussion, we're not redefining how integers work

21:42 < gmaxwell> but, base-3!!!

21:42 < Luke-Jr> fine, not worth the time to argue about this stupid design decision.

21:45 < CodeShark> base-3 would allow us to do yea/nay/abstain :p

21:52 < Luke-Jr> the annoying part will be having to answer questions in a few years why it's misordered such. like I had to do with the getwork data for years

21:55 < phantomcircuit> jgarzik, you have benchmarks for sqlite?

21:55 < phantomcircuit> my experience using it in the past has been that it's slow as hell for inserts or any kind of concurrent access

21:56 < phantomcircuit> probably time would be better spent on building an interface spec for databases

21:58 < CodeShark> for insertions we could just use a raw file - the blockchain is a linear structure anyhow. then all we would need to do in sqlite is keep track of file positions ;)

22:03 < phantomcircuit> CodeShark, utxo only

22:04 < CodeShark> yeah, so we maintain a utxo index

22:04 < CodeShark> or are you talking about a pruned node?

22:04 < phantomcircuit> the index would be roughly the same size as the utxo data :P

22:04 < CodeShark> lol

22:05 < CodeShark> ok, for a utxo database perhaps it doesn't make so much sense

22:05 < phantomcircuit> yeah the actual data is really small

22:06 < gmaxwell> CodeShark: txindex works like you suggest there.

22:07 < gmaxwell> but for utxo set we get benefits from reducing the working set. saving 20 bytes of scriptpubkey/value in exchange for another random seek to the middle of some huge block file where 2/3rd of the data is not utxo relevant, basically defeates disk caching entirely. :)

22:12 < GitHub160> [bitcoin] petertodd opened pull request #6871: Full-RBF with opt-out (master...2015-10-rbf-with-opt-out) https://github.com/bitcoin/bitcoin/pull/6871

22:33 < bsm1175321> G

22:53 < GitHub51> [bitcoin] TheBlueMatt opened pull request #6872: Remove UTXO cache entries when the tx they were added for is removed/does not enter mempool (master...limitucache) https://github.com/bitcoin/bitcoin/pull/6872

23:05 < dcousens> petertodd: I hope you don't mind my questions :)

23:06 < phantomcircuit> BlueMatt, the last commit on #6872 seems to evict from the cache things which other transactions in the mempool want cached

23:06 < phantomcircuit> ie AcceptToMempool fails because it's a double spend of something with a higher feerate

23:08 < petertodd> dcousens: no worries!

23:08 < phantomcircuit> scratch that

23:08 < phantomcircuit> i missed the check

23:08 < petertodd> dcousens: I'm doing three things at once so the responses are probably sounding a bit terse :)

23:08 < BlueMatt> phantomcircuit: ahh, ok, good, i was about to say

23:08 < dcousens> petertodd: all good, I read everything in good spirit, just hope it comes across the same :)