Tx cache #110

RCasatta · 2024-09-19T13:47:58Z

Introduce a tx cache for the tx cache lookup.

The same crate for the cache has been introduced by romanz electrs, it's main advantages are that is a contiguous memory region capped in size.

The hypothesis is that with a big enough cache it's possible to run a public instance in lightmode, without duplicating tx data which is already present in the bitcoin node.

by running `cargo run` electrs is started otherwise you need to specify the indended binary to run: `cargo run --bin electrs`

it is less interesting to see how many rows are written in the db and more interesting knowing the last height indexed

shesek

The hypothesis is that with a big enough cache it's possible to run a public instance in lightmode, without duplicating tx data which is already present in the bitcoin node.

When lightmode is disabled, is there still an advantage to using a cache in addition to having the txs available in rocksdb?

shesek · 2024-09-24T21:34:37Z

src/daemon.rs

@@ -424,7 +424,7 @@ impl Daemon {
        loop {
            match self.handle_request_batch(method, params_list) {
                Err(Error(ErrorKind::Connection(msg), _)) => {
-                    warn!("reconnecting to bitcoind: {}", msg);
+                    warn!("reconnecting to bitcoind: {msg}\nmethod was:{method}\nwith_params:{params_list:?}");


Did you mean to include this?

No, just a draft...

And by the way i was trying to undersand why was failing in my test run but the log, even changed, wasn't helping, in the end the error was "rpc queue limit reached"

Were you running with #89? If so, note that it requires adjusting bitcoind's rpcworkqueue and rpcthreads options upwards.

shesek · 2024-09-24T21:37:33Z

src/new_index/schema.rs

@@ -838,7 +851,16 @@ impl ChainQuery {
    pub fn lookup_raw_txn(&self, txid: &Txid, blockhash: Option<&BlockHash>) -> Option<Bytes> {
        let _timer = self.start_timer("lookup_raw_txn");

-        if self.light_mode {
+        if let Ok(cache) = self.txs_cache.lock() {


Is it expected that locking may sometime fail and that the failure should be ignored?

I don't think is expected to fail, but since here everything would work normally in case of failure (just perf degradation) I preferred this way

A poisoned mutex is likely to indicate some underlying coding issue, wouldn't it be better to let it error visibly?

I would at least log a warn-level message instead of silently ignoring it, but generally my preference is the "fail fast" approach for unanticipated errors. Failing fast (and restarting) would also re-enable the tx cache, instead of having it continue with degraded performance (until the process eventually restarts for another reason).

shesek · 2024-09-24T21:48:17Z

src/new_index/mempool.rs

@@ -309,6 +309,18 @@ impl Mempool {
            txids.push(txid);
            self.txstore.insert(txid, tx);
        }
+
+        // Populate tx cache


Is there an advantage to populating the cache with mempool transactions, which already are stored entirely in memory?

Also, it seems that the cache is only used when looking up on-chain txs (via ChainQuery), so the mempool transactions populated into the cache won't actually be used until they get confirmed.

Well when the mempool is evicted they can still be looked up from the tx cache

Right, but why store them while they're still in the mempool and not wait until they get confirmed (and cached naturally when they're used)?

Also its possible that by the time they confirm, they won't longer be in the FIFO cache

RCasatta · 2024-09-25T11:42:52Z

When lightmode is disabled, is there still an advantage to using a cache in addition to having the txs available in rocksdb?

yes, avoiding to hit the disk

RCasatta · 2024-09-25T11:45:30Z

Thanks for the review, I wasn't expecting it while in draft, but it's useful to have early feedback...

shesek · 2024-09-29T14:31:28Z

When lightmode is disabled, is there still an advantage to using a cache in addition to having the txs available in rocksdb?

yes, avoiding to hit the disk

I guess my question was, if we have all transactions available in a reasonably-fast in-process database and don't require network roundtrips to fetch them (as in romanz/electrs), would a cache still improve things in the typical case for typical access patterns? How large does the cache has to be for it to be beneficial? I'm not saying that it wouldn't help, just not so sure how to analyze this.

Another difference from romanz/electrs is that it needs to fetch full transactions in order to provide scripthash history, so a single history request can result in processing potentially hundreds or thousands of funding/spending transactions. In our implementation everything needed is already available in the history indexes, so we don't lookup txs internally as much.

Another thing to consider is that in the Esplora setup, we also have caching implemented at the HTTP level by caching proxies. For transactions, once they reach 10 confirmations we set a max-age of 5 years. However this does not apply to the Electrum server or to internal transaction lookup like a tx cache would.

RCasatta added 8 commits September 19, 2024 15:22

Introduce a tx cache

82997ca

Add default run

7dee400

by running `cargo run` electrs is started otherwise you need to specify the indended binary to run: `cargo run --bin electrs`

Make the tx cache size configurable

1a680b0

refactor out method to add to tx cache

32d2316

add method to check cache misses

18fad46

populate tx cache with mempool txs

bfc323a

enable compaction during initial sync flag

ece6e73

fix test

3ffbda0

RCasatta force-pushed the tx_cache branch from 0d2e323 to 3ffbda0 Compare September 20, 2024 09:01

RCasatta added 2 commits September 20, 2024 11:47

improve warn message

33a8d22

improve logging during initial sync

9753745

it is less interesting to see how many rows are written in the db and more interesting knowing the last height indexed

RCasatta marked this pull request as draft September 20, 2024 17:48

RCasatta mentioned this pull request Sep 23, 2024

Introduce Zmq and interrupt the main loop wait if zmq is setup and a block hash notification is received #111

Open

shesek reviewed Sep 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tx cache #110

Tx cache #110

RCasatta commented Sep 19, 2024

shesek left a comment

shesek Sep 24, 2024

RCasatta Sep 25, 2024

RCasatta Sep 25, 2024

shesek Sep 29, 2024

shesek Sep 24, 2024

RCasatta Sep 25, 2024

shesek Sep 29, 2024

shesek Sep 24, 2024

RCasatta Sep 25, 2024

shesek Sep 29, 2024

RCasatta commented Sep 25, 2024

RCasatta commented Sep 25, 2024

shesek commented Sep 29, 2024 •

edited

Loading

Tx cache #110

Are you sure you want to change the base?

Tx cache #110

Conversation

RCasatta commented Sep 19, 2024

shesek left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

RCasatta commented Sep 25, 2024

RCasatta commented Sep 25, 2024

shesek commented Sep 29, 2024 • edited Loading

shesek commented Sep 29, 2024 •

edited

Loading