Add support for reviving Dates #829

michaelkrog · 2024-08-06T11:26:45Z

If objects in collections contains date objects, then they change to strings when read from a persistence adapter.

This is true for Opfs, Localstorage and Filesystem. All of them uses plain JSON.stringify and JSON.parse, which will convert Dates to strings.

A simple solution could be to add a reviver function to JSON.parse:

function dateReviver(key, value) {
   // handle dates
}

// Parse the JSON string with the reviver function
const parsedObj = JSON.parse(jsonString, dateReviver);

Better maybe even encapsulate it in a new class, that all persistence adapters can use.

Suggested solution

I suggest that serializing dates to json, should be resolve in an object like this:

{
  "createdAt": {
    "$type": "Date",
    "data": "2024-08-01T12:00:00Z"
  }
}

Doing that requires a replacer method for JSON.stringify, and a reviver for JSON.parse. Here follows a pseudo example.

stringify

function dateReplacer(key, value) {
  if (this[key] instanceof Date) {
    return { $type: 'Date', data: this[key].toISOString() };
  }
  return value;
}

const obj = {
  createdAt: new Date('1990-01-01')
};

const jsonString = JSON.stringify(obj, dateReplacer);

parse

function dateReviver(key, value) {
  if (value && typeof value === 'object' && value.$type === 'Date') {
    return new Date(value.data);
  }
  return value;
}

const parsedObj = JSON.parse(jsonString, dateReviver);

The text was updated successfully, but these errors were encountered:

maxnowack · 2024-08-06T20:05:35Z

Good idea! I know EJSON from the meteor, that provides a similar functionality.
I did a bit of research and found superjson which looks almost complete.
I'm unsure right now if it's a good idea to introduce such a dependency to SignalDB, but as it's a clear issue I think there is no way around it.
One additional though I had is to split the default persistence providers out of the signaldb package.
Do you have an opinion on that?

jamesgibson14 · 2024-08-06T23:28:40Z

My five cents. I use EJSON because I come from Meteor, but I only use it for Dates. I think a simple out of box solution for data persistence is nice, but I am also looking into syncing large collections ( or subsets of large collections ) and it's looking like the sync and persistence need to be separated from signalDB collections.

obedm503 · 2024-08-07T01:09:28Z

Superjson could be an optional dependency since not everyone will use the storage adapters that store json.

michaelkrog · 2024-08-07T05:42:22Z

Interesting how superjson solves this (with extra meta on the side). 🤔 But it seems very complete. An important factor is also performance, and I noticed some bit showing poor performance in superjson.

blitz-js/legacy-framework#778 (comment)

flightcontrolhq/superjson#25

I think it would make sense to split persistence providers into its own package together with any dependency on any extra deps.

michaelkrog · 2024-08-07T05:46:42Z

devalue is mentioned in one of the links above as another (fast) json converter.

maxnowack · 2024-08-07T09:37:47Z

I’ve just created a pull request (#832) that adds the ability to pass custom serialization and deserialization functions to the local storage, file system, and OPFS persistence adapters. This is an intermediate solution until we can separate the persistence adapters into their own packages (planned for v1).

Thanks to @michaelkrog for highlighting the performance issues with superjson. Performance is definitely a crucial factor in selecting a library for this purpose. I’m still undecided on whether this functionality should be integrated into signaldb or would be more appropriate in a custom persistence adapter.

Additionally, we need to consider how to handle multiple versions of different serialized data formats and possibly migrate them during upgrades.

michaelkrog · 2024-08-07T10:37:10Z

Additionally, we need to consider how to handle multiple versions of different serialized data formats and possibly migrate them during upgrades.

I see 2 different angles to migration here.

When replicated source changes it's model

I noticed in one of Linear's videos that what they do when migration is needed (fx. models change server-side) is simply pulling everything again. Even if that means loading a lot of data again, for replicated cases I prefer this direction. I can't even imagine the complexity of synchronizing proper migration client side with the migrations server side.

When the persistence layer changes it's storage format

I think your comment was from this viewpoint, namely the migration needed when the persistence layer in signaldb changes how it stores data.

As I see it signaldb is the in-memory database - FULL STOP! 😀 signaldb by itself needs no migration, because it has no persistence.

Persistence layers and replication are from my point of view just "add-ons" and migration lays with them.

My recommendation

For Replication migration (when a replication source changes its model), I would simply explain in the documentation that the preferred method is to re-pull everything on the client. Down the road that could come with some recommendations on how to keep track of it. What I we plan to do in Previsto is to add the version of the REST-api to the name for the persistence layer, fx. 2024-01-04.contacts for version 2024-01-04 and contacts entities.

When the application is started and version does not match the last used api-version, we wipe everything and pull everything again.

For change to the Persistence layer data format I think your intermediate solution is quite flexible and can go a long way. It leaves the migration up the the consumer of signaldb, but in reality I would imagine that most users would be using signaldb with replication and then they would probably always be able to just re-pull everything. Guess what I am saying is: Let's just wait and see what is needed.

maxnowack · 2024-08-08T09:58:08Z

I just released the custom serialization (#832) in v0.15.0 🙂

I noticed in one of Linear's videos that what they do when migration is needed (fx. models change server-side) is simply pulling everything again.

I totally agree with you. It isn't worth it running and maintaining backend database migrations on the client 😀

I think your comment was from this viewpoint, namely the migration needed when the persistence layer in signaldb changes how it stores data.

Yes, exactly. It’s only an issue in a client-only application with persistent storage when the format in which the data is saved changes.

As I see it signaldb is the in-memory database - FULL STOP! 😀 signaldb by itself needs no migration, because it has no persistence.
Persistence layers and replication are from my point of view just "add-ons" and migration lays with them.

Absolutely! I don't want to have any migration related stuff in the core package 😀
I had in mind that if we separate the persistence adapters into their own packages and integrate a tool such as devalue or superjson to serialize complex types, we must stick with the selection or devise a clever plan to manage potential migrations if we switch formats. Of course, there are solutions; all have their pros and cons. For example, if we include the migration directly in the code, this part of the code is totally unneeded, except for the case where the data needs to be migrated.

What I we plan to do in Previsto is to add the version of the REST-api to the name for the persistence layer, fx. 2024-01-04.contacts for version 2024-01-04 and contacts entities.

That's actually a neat idea! It even makes it possible to send persistent data from a previous version that hasn't yet been committed to an endpoint that accepts the previous version, before deleting and pulling the new data.
Perhaps it is possible to include such functionality in a future and more sophisticated version of the ReplicatedCollection, so that it is done automatically and you don't have to rely on parsing the name and extracting the version.

For change to the Persistence layer data format I think your intermediate solution is quite flexible and can go a long way. It leaves the migration up the the consumer of signaldb, but in reality I would imagine that most users would be using signaldb with replication and then they would probably always be able to just re-pull everything. Guess what I am saying is: Let's just wait and see what is needed.

You're right. I think this is our best option so far 🙂

maxnowack mentioned this issue Aug 7, 2024

feat: allow using custom serialization in default persistence adapters #832

Merged

michaelkrog closed this as completed Aug 8, 2024

maxnowack mentioned this issue Sep 30, 2024

Split persistence adapters into separate packages #992

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for reviving Dates #829

Add support for reviving Dates #829

michaelkrog commented Aug 6, 2024

maxnowack commented Aug 6, 2024 •

edited

Loading

jamesgibson14 commented Aug 6, 2024

obedm503 commented Aug 7, 2024

michaelkrog commented Aug 7, 2024

michaelkrog commented Aug 7, 2024

maxnowack commented Aug 7, 2024

michaelkrog commented Aug 7, 2024 •

edited

Loading

maxnowack commented Aug 8, 2024

Add support for reviving Dates #829

Add support for reviving Dates #829

Comments

michaelkrog commented Aug 6, 2024

Suggested solution

stringify

parse

maxnowack commented Aug 6, 2024 • edited Loading

jamesgibson14 commented Aug 6, 2024

obedm503 commented Aug 7, 2024

michaelkrog commented Aug 7, 2024

michaelkrog commented Aug 7, 2024

maxnowack commented Aug 7, 2024

michaelkrog commented Aug 7, 2024 • edited Loading

When replicated source changes it's model

When the persistence layer changes it's storage format

My recommendation

maxnowack commented Aug 8, 2024

maxnowack commented Aug 6, 2024 •

edited

Loading

michaelkrog commented Aug 7, 2024 •

edited

Loading