Higher level cache above the standard RocksDB cache #935

jakubcech · 2018-08-13T13:07:24Z

Description

Currently RocksDB caches data in bytes, this means that you may create two separate transaction objects and manipulate them in different threads.

We want to create a cache layer above the DB implementation that locks in X K transactions and only stored them to the actual DB when evicting. This should allow us to avoid most reads from the DB.

Motivation

Reducing I/O overhead.

Requirements

We store the transaction object in the cache.
Benchmark disabling the RocksDB block cache
Every time we save or read a transaction we store it in the cache. Not in the DB.
Only write to the disk when we evict from the cache.
Eviction policy: When the cache is full, we evict a block of X transactions. FIFO.
The cache will be Y transactions.
Purge the cache to the db on shutdown

Open Questions (optional)

We need to decide on the number of transactions that we want to store and calculate the amount of memory that would occupy for the node. Without any calculations, I'd like to be able to store an amount of transactions that supports 1000 TPS, but that is likely a ton of memory. We can start with 50-100 and see what that gets us. The eviction policy should be a fraction of the pool. For example 1%, 3%, 10%, ... whatever makes sense for the given size.

Configurability I'm open to unless we can squeeze a sufficient amount of TXs into a very small memory footprint (which I reckon we can't). In which case I'd recommend adding a minimum value to the configuration parameter, e.g., at least a transaction worth of 100-200 MB, which should at least somewhat help even low resource nodes.

I'm a bit reserved towards having the cache size dynamic. As we'd have to monitor/count the amount of inflow TXs and then react based on that, meaning that if a large jump in TPS happened, we'd have to evict a lot of transactions very fast before we adjust the cache mechanism. But happy if someone proves me wrong with an approach that would work here.

GalRogozinski · 2018-11-04T20:34:31Z

The problems are:

The data is replicated from the block cache to the java application layer, causing a waste of memory.
If a transaction has been read from cache more than one time, several Transaction will be created consuming more memory.
There can race conditions between the 2 objects created. Currently it is not too bad since "Solidity" and "Validity" can only be changed from false to true and not the other way around. Still we may perform needless calculations.

I offer that we create a new cache that will replace RocksDb block cache. It will be either based on Guava's cache or be a synchronized map of weak references like https://github.com/ehcache/ehcache3/blob/606c5dcba355f5ed1abb002d455ef05b5899f48e/core/src/main/java/org/ehcache/core/collections/ConcurrentWeakIdentityHashMap.java but with concurrent purging.

Everytime we store to the db we simply also write in cache.

GalRogozinski · 2019-06-26T12:55:41Z

This is the cache Hans created to whoever is interested:
https://github.com/iotadevelopment/iri/blob/dev/src/main/java/com/iota/iri/storage/cache/Cache.java

He said that the change failed on tests, and it was a problem fixing it which is why we didn't continue.

jakubcech added C-Persistence L-Groom This issue needs to be groomed labels Aug 13, 2018

GalRogozinski mentioned this issue Nov 22, 2018

Race condition while updating transactions #1187

Closed

kwek20 self-assigned this Jul 2, 2019

jakubcech removed the L-Groom This issue needs to be groomed label Jul 9, 2019

jakubcech added this to the LaLa milestone Jul 9, 2019

kwek20 mentioned this issue Jul 15, 2019

Feature: Cache in between Tangle and database #1520

Closed

5 tasks

jakubcech mentioned this issue Jul 16, 2019

Benchmark the higher level RocksDB cache before release #1522

Open

3 tasks

jakubcech modified the milestones: LaLa, Pingu Jul 22, 2019

jakubcech assigned GalRogozinski Aug 5, 2019

jakubcech modified the milestones: Pingu, Umbreon Aug 5, 2019

GalRogozinski mentioned this issue Nov 11, 2019

Benchmark HaloDb as a replacement for RocksDb #1661

Open

GalRogozinski added the Epic label Nov 26, 2019

GalRogozinski unassigned kwek20 and GalRogozinski Nov 26, 2019

GalRogozinski mentioned this issue Dec 19, 2019

Increased RPC response times & CPU usage correlates to datadir disk usage increase over time #1692

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Higher level cache above the standard RocksDB cache #935

Higher level cache above the standard RocksDB cache #935

jakubcech commented Aug 13, 2018 •

edited by GalRogozinski

Loading

GalRogozinski commented Nov 4, 2018

GalRogozinski commented Jun 26, 2019

Higher level cache above the standard RocksDB cache #935

Higher level cache above the standard RocksDB cache #935

Comments

jakubcech commented Aug 13, 2018 • edited by GalRogozinski Loading

Description

Motivation

Requirements

Open Questions (optional)

GalRogozinski commented Nov 4, 2018

GalRogozinski commented Jun 26, 2019

jakubcech commented Aug 13, 2018 •

edited by GalRogozinski

Loading