Use `MemoryOnlyProvider` as fallback to allow the app to work without storage #439

chrispader · 2023-12-21T12:15:06Z

Details

This PR adds a MemoryOnlyProvider fallback solution and changes the interface of Storage (lib/storage/index.js) to allow multiple simultaneous storage providers.

This PR also improves the structure of everything in lib/storage and establishes a consistent structure for providers and order for all the functions within them.

Related Issues

Expensify/App#29403

Automated Tests

This PR changes the way we use the storage but the functionality of the library to the outer world is the same. Therefore no new tests were added.

Manual Tests

Verify that no flows and functionality were broken by the changes. Check for console errors regarding Onyx.

Author Checklist

Screenshots/Videos

Android: Native

Android: mWeb Chrome

iOS: Native

iOS: mWeb Safari

MacOS: Chrome / Safari

Screen.Recording.2023-12-27.at.14.12.34.mov

MacOS: Desktop

… interface

tgolen · 2024-01-16T14:47:12Z

I'm going to simplify this PR and remove and unnecessary overhead for now and just use the NoopProvider instead of actually using a working fallback provider.

@chrispader Is this ready to review again? I wasn't sure and I don't want to be the hold-up here.

chrispader · 2024-01-16T14:48:23Z

I'm going to simplify this PR and remove and unnecessary overhead for now and just use the NoopProvider instead of actually using a working fallback provider.

@chrispader Is this ready to review again? I wasn't sure and I don't want to be the hold-up here.

ah no, not yet. Going to finish this this week!

chrispader · 2024-01-19T13:00:11Z

@tgolen @marcaaron i just removed the additional "multi-storage" logic and just fallback to the NoopProvider in case the main platform storage provider couldn't be initialised.

This whole re-structuring and cleanup as well as this extra "storage" layer comes in handy though, since we can now handle the initialisation phase first, and only then start handling operations.

I'm going to test this in the app now, but make sure to review this PR, as it should be ready!

marcaaron · 2024-01-19T19:59:52Z

lib/Onyx.js

-            cache.set(key, value);
-            keyChanged(key, value);
-        });
-    }


thought: Why was this moved?

Just to keep the code that basically initiates the Storage layer together. Technically, it makes no difference

marcaaron · 2024-01-19T20:03:33Z

lib/storage/index.js

+     * @return {Promise<*>}
+     */
+    getItem(key) {
+        return runAfterInit(() => provider.getItem(key));


confusion: Why do we need runAfterInit()?

Since i moved the initialization of the provider to provider.init(), we need this, to prevent any operations from running before the storage provider actually got intialized/created

marcaaron · 2024-01-19T20:09:59Z

lib/storage/providers/IDBKeyValProvider.js

+    init() {
+        const newIdbKeyValStore = createStore('OnyxDB', 'keyvaluepairs');
+
+        if (newIdbKeyValStore == null) throw Error('IDBKeyVal store could not be created');


What problem specifically does this solve? Do we have any documented cases where createStore() returned null? What was the problem we saw? What about when an irrecoverable error happens when trying to save after initializing the store successfully?

(that second situation is why this issue was created in the first place - and I'm not convinced we have solved it)

You're right. Currently i assumed that this or any other error would only occur during initialization.

I updated the PR to handle errors during runtime as well and gracefully degrade the performance of the app. Lmk what you think of the general approach, before actually reviewing it :)

It's still very useful to have this extra storage layer and a cleaned up provider structure.

chrispader · 2024-01-23T17:44:09Z

One more time, i'm asking for you're opinions on this new approach. @marcaaron @tgolen

(no need to completely review yet, i'm just curious what you think :))

The current approach handles errors during initialization and runtime, while also utilizing the extra storage layer and the cleaned up provider structure.

Still can't really test this in a real-life scenario, since we don't know how to trigger this Internal error opening backing store for indexedDB.open error, but it basically works with any error thrown for any operation.

As In the tryOrDegradePerformance function we could additionally check for specific errors. As long as we don't know which errors exactly to expect, we can start adding all the problematic errors to a list of error types and check for them in the functions catch block...

tgolen · 2024-01-23T19:18:22Z

it basically works with any error thrown for any operation.

My only concern with this is the chance of false positives. It seems like we are struggling between the extremes of not knowing which errors get thrown or only knowing one error that gets thrown with no way to reproduce it.

Maybe this hints that we need better error reporting first so that we can make a better assumption about which errors are actually occurring.

chrispader · 2024-01-25T09:12:38Z

it basically works with any error thrown for any operation.

My only concern with this is the chance of false positives. It seems like we are struggling between the extremes of not knowing which errors get thrown or only knowing one error that gets thrown with no way to reproduce it.

Yes you're right, it's either this or that extreme case. I don't see any other way right now to check for these errors, unless we're able to confidently reproduce it.

After doing some internet research, i'm like 80% sure the Internal error opening backing store for indexedDB.open error described in Expensify/App#29403 comes from the storage being full.

Maybe this hints that we need better error reporting first so that we can make a better assumption about which errors are actually occurring.

My suggestion here would be to basically have this error check for initialization and every operation (tryOrDegradePerformance) and only degrade performance if we detect a known error that will prevent the storage from keep working. This way, we can easily add more errors on the go.

There really is no official way from idb-keyval or IndexedDB to detect these kind of errors and handle them. I think the current approach is the most generic and will fix this issue at least for the known error.

marcaaron · 2024-01-25T22:08:38Z

On the subject of error reporting... I worked on improving it a bit in the past and an internal log search will show us when storage is failing and why it failed (some explanations more cryptic than others).

This will give us the ability to catch something like a "critical" vs. "benign" loss of client data (if there is such a thing).

With reliable updates, it seems extremely important to ensure storage failures don't lead to data loss at all e.g. if you are able to store one recent update, but for some reason could not store some previous update - the previous update is now lost forever. Maybe I am wrong, but the current system we have depends on a near perfect reliability. Anytime we fail to store something the update chain will essentially be broken.

So, I might actually argue that we should always stop storing things immediately when any error happens and address these errors with urgency.

chrispader · 2024-01-27T18:52:40Z

On the subject of error reporting... I worked on improving it a bit in the past and an internal log search will show us when storage is failing and why it failed (some explanations more cryptic than others).

This will give us the ability to catch something like a "critical" vs. "benign" loss of client data (if there is such a thing).

unfortunately i don't have access to that link.. are you referring to some sort of analytics where we collect data about (the type of) errors occurring (in production)?

With reliable updates, it seems extremely important to ensure storage failures don't lead to data loss at all e.g. if you are able to store one recent update, but for some reason could not store some previous update - the previous update is now lost forever. Maybe I am wrong, but the current system we have depends on a near perfect reliability. Anytime we fail to store something the update chain will essentially be broken.

So, I might actually argue that we should always stop storing things immediately when any error happens and address these errors with urgency.

If i understand you correctly, this means we would need to implement some sort of "transaction" system, to basically rollback changes if there an error occurs later in a chain of operations?

We can definitely check for initialization errors - probably like the original one described in the issue - and degrade performance before any operations are even triggered.

But if a fatal error occurs at some other point during Onyx's lifetime, we can only degrade performance from there... meaning after the failing operation. The data will still be stored in cache (in-memory), though that data will obviously be lost after a refresh. Without some other persisted fallback storage solution, i'm not sure if we can prevent this sort of data loss.

Maybe i got something wrong, because i'm not totally sure what you are referring to here and if it isn't extending the scope of this PR/issue massively.

chrispader · 2024-01-27T18:55:44Z

JFYI @marcaaron @tgolen

I'm going to be OOO from 31/01/2024 - 17/03/2024, which means i can either complete a (limited) version of this issue fix/PR by tomorrow/tuesday, or i'll have to hand it over to someone else to takeover.

From my perspective, i don't think the current state of this PR affects the behaviour of Onyx negatively at all. The current implementation of tryOrDegradePerformance checks for only this specific error (Internal error opening backing store for indexedDB.open) and only then degrades performance.

This PR also includes lots of re-structuring and code quality improvements and paves the way for future TypeScript typings and clean-up.

Do you guys think we can merge this (as-is) and work on advanced error reporting and preventing in a follow-up PR?

tgolen · 2024-01-29T15:38:51Z

I think with such large changes in this PR, and you being gone for so long, I'd actually prefer holding this PR until you are back. Unless someone else is willing and able to take over maintaining it. We've had a lot of issues where Onyx PRs need to be reverted and then it causes as cascade effect on other PRs. I think the risk of causing a regression here is pretty high. The best way to combat that is to split this PR up into multiple smaller pieces.

chrispader · 2024-01-29T16:09:09Z

I think with such large changes in this PR, and you being gone for so long, I'd actually prefer holding this PR until you are back. Unless someone else is willing and able to take over maintaining it. We've had a lot of issues where Onyx PRs need to be reverted and then it causes as cascade effect on other PRs. I think the risk of causing a regression here is pretty high. The best way to combat that is to split this PR up into multiple smaller pieces.

I understand! I can hand this PR over to another Margelo engineer and ask them to split it up into more easily reviewable PRs. Otherwise (if this not a priority) i can also continue working on it in March.

marcaaron · 2024-01-29T18:52:37Z

If i understand you correctly, this means we would need to implement some sort of "transaction" system, to basically rollback changes if there an error occurs later in a chain of operations

Hmm no I wasn't trying to imply any kind of transaction system (unsure what that would entail). I am just stating a fact - which is that if you are unable to store an update, but allow storing a later update then you will have irrecoverable "gaps" in the client side data.

hannojg · 2024-02-01T08:27:14Z

Hey, as Chris is OOO pretty long we are internally assigning someone new to carry on this PR.
The idea is as Tim pointed out to split up the PR in smaller PRs to reduce the risk of hard to fix regressions.

tgolen · 2024-02-02T15:47:33Z

Thanks @hannojg. I'm going to close this PR for now then.

Christoph Pader added 20 commits December 5, 2023 13:01

bring provider functions in same order and create consistent provider…

0b74731

… interface

create MemoryOnlyProvider

71f2246

use MemoryOnlyProvider in jest mocks

d5f864b

add object-sizeof package

40513bd

Merge branch 'main' into @chrispader/memory-only-provider

a20282e

prettier

5de8741

fix: tests with mock

bab2d65

implement MultiStorage interface

0a45abc

use MultiStorage

0eea75f

move MultiStorage and replace mock

735ff1e

init MultiStorage

c96c0d6

fix: tests

b69780f

move storage files

a093836

fix: tests

87c9a5c

fix: remaining imports

e3351d0

fix: jestSetup

6f233cf

rename

f2ea32d

rename

8b1661a

fix: tests

c03468b

remove logs

3d1a558

chrispader mentioned this pull request Dec 26, 2023

Allow the app to work without storage in extreme cases where it fails completely Expensify/App#29403

Closed

Christoph Pader added 9 commits December 27, 2023 12:35

update how multiple storage providers are handled

714e70b

fix: jest check

e44cea5

fix: param type

3684485

fix: keepInstancesSync

eba29ce

fix: call to keepInstancesSync

30844c0

fix: keepInstancesSync

d0f3f01

only allow calls after init

40e756d

fix: initialization

4d68287

remove logs

880ffc0

chrispader requested review from marcaaron and tgolen January 8, 2024 15:56

Christoph Pader added 2 commits January 19, 2024 13:48

use noop provider

d0f74fc

simplify runAfterInit function

919d0b3

fix: return types

75fd6f6

marcaaron requested changes Jan 19, 2024

View reviewed changes

Christoph Pader added 2 commits January 23, 2024 18:08

Merge branch 'main' into @chrispader/memory-only-provider

67dd00a

try or degrade performance

156938d

chrispader requested a review from marcaaron January 23, 2024 17:44

update tryOrDegradePerformance function

6febd90

Christoph Pader added 2 commits January 25, 2024 10:01

Merge branch 'main' into @chrispader/memory-only-provider

65543ee

add comments

26ce592

tgolen closed this Feb 2, 2024

This was referenced Feb 27, 2024

refactor: unify storage/providers (for further InMemory storage integration) [1/3] #475

Merged

feat: fallback to NoopProvider if we run into OOM [2/3] #483

Merged

feat: fallback to NoopProvider if OOM happens #485

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `MemoryOnlyProvider` as fallback to allow the app to work without storage #439

Use `MemoryOnlyProvider` as fallback to allow the app to work without storage #439

chrispader commented Dec 21, 2023 •

edited

Loading

tgolen commented Jan 16, 2024

chrispader commented Jan 16, 2024

chrispader commented Jan 19, 2024

marcaaron Jan 19, 2024

chrispader Jan 23, 2024

marcaaron Jan 19, 2024

chrispader Jan 23, 2024

marcaaron Jan 19, 2024

chrispader Jan 23, 2024

chrispader commented Jan 23, 2024

tgolen commented Jan 23, 2024

chrispader commented Jan 25, 2024

marcaaron commented Jan 25, 2024

chrispader commented Jan 27, 2024

chrispader commented Jan 27, 2024

tgolen commented Jan 29, 2024

chrispader commented Jan 29, 2024

marcaaron commented Jan 29, 2024

hannojg commented Feb 1, 2024

tgolen commented Feb 2, 2024

Use MemoryOnlyProvider as fallback to allow the app to work without storage #439

Use MemoryOnlyProvider as fallback to allow the app to work without storage #439

Conversation

chrispader commented Dec 21, 2023 • edited Loading

Details

Related Issues

Automated Tests

Manual Tests

Author Checklist

Screenshots/Videos

tgolen commented Jan 16, 2024

chrispader commented Jan 16, 2024

chrispader commented Jan 19, 2024

marcaaron Jan 19, 2024

Choose a reason for hiding this comment

chrispader Jan 23, 2024

Choose a reason for hiding this comment

marcaaron Jan 19, 2024

Choose a reason for hiding this comment

chrispader Jan 23, 2024

Choose a reason for hiding this comment

marcaaron Jan 19, 2024

Choose a reason for hiding this comment

chrispader Jan 23, 2024

Choose a reason for hiding this comment

chrispader commented Jan 23, 2024

tgolen commented Jan 23, 2024

chrispader commented Jan 25, 2024

marcaaron commented Jan 25, 2024

chrispader commented Jan 27, 2024

chrispader commented Jan 27, 2024

tgolen commented Jan 29, 2024

chrispader commented Jan 29, 2024

marcaaron commented Jan 29, 2024

hannojg commented Feb 1, 2024

tgolen commented Feb 2, 2024

Use `MemoryOnlyProvider` as fallback to allow the app to work without storage #439

Use `MemoryOnlyProvider` as fallback to allow the app to work without storage #439

chrispader commented Dec 21, 2023 •

edited

Loading