#bitcoin-core-dev on 2018-06-06 — searchable irc log

02:50 < MarcoFalke> FYI, I have assigned all non-mergeable open pulls to a new label "Needs rebase"

02:50 < MarcoFalke> Makes it easier to sort

02:50 < MarcoFalke> e.g. https://github.com/bitcoin/bitcoin/pulls?q=is%3Apr+is%3Aopen+label%3A%22Needs+rebase%22

05:37 < bitcoin-git> [bitcoin] lucash-dev opened pull request #13404: [tests] speed up of tx_validationcache_tests by reusing of CTransaction. (master...speedup-tx_validationcache_tests) https://github.com/bitcoin/bitcoin/pull/13404

08:33 < wumpus> MarcoFalke: I hope that happens automatically? otherwise, it sounds like a nightmare to keep it up to date

08:33 < wumpus> MarcoFalke: also, it's already possible to use bitcoinacks.com to keep track of that

11:34 < wumpus> how are things with 0.16.0rc1? do we have anything that needs to be backported for rc2? I haven't heard any reports of bugs at least.

11:35 < wumpus> if not, we should do a very fast rc2 for the translations issue and then tag final

11:35 < wumpus> eh, 0.16.1 obvs

12:00 < bitcoin-git> [bitcoin] ken2812221 opened pull request #13406: travis: Add make step so that travis can build all executables for Mac. (master...travis_make_mac) https://github.com/bitcoin/bitcoin/pull/13406

13:18 < bitcoin-git> [bitcoin] MarcoFalke pushed 2 new commits to master: https://github.com/bitcoin/bitcoin/compare/a589f536b5e1...e4082d59f53d

13:18 < bitcoin-git> bitcoin/master 9d6c9db Ben Woosley: lint: Add linter to error on #include <*.cpp>...

13:18 < bitcoin-git> bitcoin/master e4082d5 MarcoFalke: Merge #13301: lint: Add linter to error on #include <*.cpp>...

13:19 < bitcoin-git> [bitcoin] MarcoFalke closed pull request #13301: lint: Add linter to error on #include <*.cpp> (master...lint-include-cpp) https://github.com/bitcoin/bitcoin/pull/13301

14:15 < jamesob_> anyone willing to trade reviews for #13168?

14:15 < gribble> https://github.com/bitcoin/bitcoin/issues/13168 | Thread names in logs and deadlock debug tools (take 2) by jamesob · Pull Request #13168 · bitcoin/bitcoin · GitHub

14:26 < ryanzim> https://github.com/bitcoin/bips/blob/master/bip-0141.mediawiki#p2wpkh says:

14:26 < ryanzim> > The HASH160 of the pubkey in witness must match the witness program.

14:27 < ryanzim> This seems to imply that the witness program is HASH160(pubkey), when it's actually HASH160(SHA256(pubkey))

14:27 < ryanzim> Shouldn't this be clarified?

14:31 < ryanzim> Ah, sorry; doing a little more research; seems I was confusing HASH160 and ripemd160; nvm

15:49 < promag> Currently there is a DbEnv can reference multiple Db instances right?

15:49 < promag> s/there is//

15:52 < promag> Also, BerkeleyDatabase::Flush calls BerkeleyEnvironment::Flush, is there a reason to keep it like that?

15:58 < promag> so that when unloading a wallet all wallets wouldn't have to be flushed

16:01 < bitcoin-git> [bitcoin] eudisd closed pull request #13373: Qt: Update Wallet Encryption Titles To Better Describe Process (master...feature/bitcoin-#13245) https://github.com/bitcoin/bitcoin/pull/13373

16:07 < promag> jnewbery: yes there is a couple of things to fix

16:07 < promag> jnewbery: one is as follow

16:07 < promag> bitcoind -regtest -debug -wallet=w1 -wallet=w2

16:07 < promag> bitcoin-cli -regtest unloadwallet w1

16:08 < promag> bitcoin-cli -regtest -rpcwallet=w2 getwalletinfo

16:08 < promag> see above questions

16:55 < MarcoFalke> wumpus: I know it is on bitcoinacks.com, but I don't like the idea to switch back and forth between websites when you could have it all in one place

16:55 < MarcoFalke> And yes, will be automated in the future

16:56 < MarcoFalke> ACK on the quick rc2

17:23 < bitcoin-git> [bitcoin] skeees opened pull request #13407: [refactor, move-only-ish] Refactor mempool accept/reject logic (master...atmp-p2p-refactor) https://github.com/bitcoin/bitcoin/pull/13407

17:31 < cfields> sipa: did you happen to bench sha2 with sse41 + avx?

17:37 < sipa> cfields: ah, forgot about that, good idea

17:38 < sipa> cfields: feel like fixing the proliferation of various crypto libs in the makefile?

17:38 < cfields> sipa: sure thing, will PR

17:48 < sipa> cfields: sse41 2.9ms, sse41 (compiled with -mavx) 2.0ms, avx2 1.1ms

17:49 < sipa> that's a very significant improvement...

17:49 < cfields> sipa: yep, same result on the aws instance I just fired up

17:51 < cfields> sipa: the current avx2 path falls back to sse41 for single transforms, right? So presumably that benefits as well?

17:51 < sipa> yup

17:52 < sipa> cfields: ah, no

17:52 < sipa> it falls back to sse41 for 4 elements

17:52 < sipa> there is also an asm sse4 single implementation... but that's not using intrinsics, so won't benefit from -mavx

17:53 < cfields> ah yes, ok

17:54 < sipa> we should try to convert that sse4 asm code to intrinsics, though

17:55 < cfields> so, how do you want to move forward? Seems to me it makes sense to compile different bundles, where all contents are built with the flags

17:56 < sipa> i guess we can compile the same source file twice, with a different -D specific to that build, which changes the namespace name?

17:57 < cfields> so for ex, libbitcoin_crypto-avx.a is sha256.cpp, sha256_sse41.cpp, etc. Just built with the avx flags...

17:57 < cfields> right

17:59 < cfields> ok. how about: I'll do the simple build changes, we can get shani and power8 in, then we can redo the structure

17:59 < sipa> sounds good

17:59 < cfields> ok, thanks for testing

18:02 < sipa> cfields: these numbers are on an 7th gen i7 CPU

18:03 < sipa> i'll also try on ryzen

18:04 < cfields> ok. based on the other tests, I'd be perfectly happy with no-change there :)

18:06 < gmaxwell> should figure out what AVX instruction's its substituting in...

18:08 < sipa> gmaxwell: my assumption is that it's just using the higher 128 bits of the 256 registers as extra register space

18:08 < cfields> gmaxwell: I did asm dumps of Round() split out, but don't speak enough asm to know what I'm looking at

18:08 < cfields> I can paste those if you'd like

18:10 < cfields> anything less focused than Round got too messy

18:13 < sipa> cfields: on Ryzen: sse41 2.8ms, sse41 w/ -mavx 2.2ms, avx2 2.1ms

18:13 < cfields> woohoo!

18:13 < sipa> (Ryzen actually only has 4-way parallel arithmetic, so avx2 doesn't have that much of a gain)

18:13 < cfields> interesting that it's almost the same as avx2

18:13 < cfields> ah

18:15 < jarthur> Yep, it's practically just "API compatible" with AVX2.

18:15 < sipa> well it also gives you 256-bit registers

18:15 < sipa> but i guess those exist at AVX already

18:17 < jarthur> sipa: gmaxwell mentioned that zen might be able to do parallel sha-ni runs if each step is loaded up side by side. Do you know if anyone has played with that yet? I volunteered at some point but didn't get around to it.

18:18 < sipa> jarthur: yup, 2-way SHA-NI is faster than 1-way on my system

18:18 < jarthur> nice! Is that code in your branch atm?

18:18 < sipa> yup

18:19 < jarthur> rockin

18:19 < sipa> (not quite 2x - the implementation needs 10 registers-ish, so 2-way needs 20, while there are only 16 addressable ones, resulting in spills)

18:19 < sipa> IIRC it took a benchmark from 0.83ms to 0.61ms by doing the 2-way

18:20 < jarthur> that's significant in bitcoin land

18:21 < sipa> jarthur: https://github.com/sipa/bitcoin/blob/bb80ab25963f56cad9bb560e59c77d40f351901b/src/crypto/sha256_shani.cpp#L151

18:21 < sipa> it just calls every round function twice in a row

18:22 < jarthur> Thanks. Looks nice and clean with those inlines.

18:54 < sipa> who is DrahtBot?

19:07 < wumpus> sipa: MarcoFalke's bot

19:07 < sipa> ah, nice

19:08 < MarcoFalke> [ ] I'm not a robot

19:08 < sipa> oh, i meant "who is running Drahtbot"

19:08 < sipa> i did realize it was a bot :)

19:08 < sipa> how does it figure out conflicts?

19:09 < sipa> does it try every combination of 2 PRs?

19:17 < MarcoFalke> sipa: Yes, rn. I might implement a smart solution when I have time. Though, the compute overhead is trivial compared to the latency by the github api for now...

19:26 < MarcoFalke> > i did realize it was a bot :)

19:26 < MarcoFalke> Thanks for the compliment :)

19:28 < bitcoin-git> [bitcoin] theuni opened pull request #13408: crypto: cleanup sha256 build (master...sha2-cleanup) https://github.com/bitcoin/bitcoin/pull/13408

19:28 < cfields> sipa: ^^

19:33 < sipa> cfields: i'd rather keep the explicit -D... in the makefile for the arch specific crypto libs

19:33 < sipa> rather than rely on config.h

19:34 < cfields> sipa: for what reason? We're already relying on config.h for endian/swap

19:34 < sipa> cfields: because different libs may be compiled with different flags

19:35 < sipa> (expecting the avx/sse4 split)

19:37 < cfields> sipa: hmm, I had a different approach in mind. But sure, I'll revert that and we can sha256 it out when we get there.

19:39 < jamesob> wumpus: how much more review does a bench-only change like #13219 need?

19:39 < gribble> https://github.com/bitcoin/bitcoin/issues/13219 | bench: Add block assemble benchmark by MarcoFalke · Pull Request #13219 · bitcoin/bitcoin · GitHub

19:40 < MarcoFalke> jamesob: I guess it is fine, but just covered/hidden by a ton of other open pull requests.

19:52 < sipa> cfields: if you have a different idea, sure

21:27 < cfields> sipa: https://github.com/theuni/bitcoin/commits/sha2-libs

21:27 < cfields> that has the libs split out, rebuilt for each isn set, and adds avx

21:28 < cfields> and yea, i see your point now, we need to pass in a flag for the namespace

21:46 < jarthur> sipa cfields, are we going down the direction that optimal instruction set would be picked at runtime, and default b86_64 build would have the lot of them compiled?

21:46 < cfields> jarthur: yes. that's already the case, we're just diving deeper.

21:50 < jarthur> thanks