Read-only secondary indexes should get a tailored in-memory data structure #624

JohannesLichtenberger · 2023-06-19T20:14:19Z

Currently, we're reading the red-black tree nodes from the disk on the first load and putting the nodes into a global buffer manager/cache. Still, we could, for instance, also use the adaptive radix tree or an even better data structure for read-only...

anmol797 · 2024-06-19T21:13:28Z

Still open ?

JohannesLichtenberger · 2024-06-19T21:23:27Z

Yes

JohannesLichtenberger · 2024-06-21T17:15:53Z

@anmol797 we should probably chat about this

anmol797 · 2024-06-21T17:26:25Z

Yes

can i take it up ?
i am new to open source contribution

anmol797 · 2024-06-21T17:26:41Z

@anmol797 we should probably chat about this

Sure

JohannesLichtenberger · 2024-06-21T17:38:59Z

Are you familiar with basic data structures, especially functional/persistent structures? I'd like to discuss this first with someone already familiar to these topics :-)

anmol797 · 2024-06-21T17:40:47Z

Are you familiar with basic data structures, especially functional/persistent structures? I'd like to discuss this first with someone already familiar to these topics :-)

yes i am aware of all these
i am a fresher working professional in IT sector

JohannesLichtenberger · 2024-06-21T18:02:59Z

Ok, so basically the main datastructure in Sirix is a "keyed", persistent trie to fetch pages (full pages) or page fragments (e.g. only changed records plus records which fall out of a sliding window). The data is stored in the leaf pages of the tries. Currently, we store JSON nodes or XML nodes in a trie. Secondary indexes based on Red/Black balanced binary trees are currently stored in other tries.

Now, as red/black trees are not cache-line friendly (and hopefully Valhalla bears some fruits better sooner than later) we could alternatively store the secondary indexes as Adaptive Radix trees (or the newer variant Height Optimized trees). That said to incorporate also the sliding snapshot algorithm used to version the current leaf pages is a lot of work...

I'm currently not sure if the rotations due to balancing a binary tree lead to a lot of copied tree nodes (which are currently stored in the trie leaf pages). Instead of implementing a persistent ART with also versioning the leaf pages it would of course be much simpler to simply read the stored red black tree nodes into an ART, but not sure if that really would make sense.

On the other hand it's currently "nice to have", but a lot of work (maybe half a year or maybe even much more as it's done in spare time).

Another pressing issue is, why a full scan of a bigger resource (3,8Gb JSON file stored in Sirix) in parallel traversed by N read-only trxs is much slower than with only one trx (it's not at all obvious for me currently when profiling what's the issue).

JohannesLichtenberger · 2024-06-21T18:04:04Z

https://sirix.io/docs/concepts

anmol797 · 2024-06-22T14:55:01Z

Ok, so basically the main datastructure in Sirix is a "keyed", persistent trie to fetch pages (full pages) or page fragments (e.g. only changed records plus records which fall out of a sliding window). The data is stored in the leaf pages of the tries. Currently, we store JSON nodes or XML nodes in a trie. Secondary indexes based on Red/Black balanced binary trees are currently stored in other tries.

Now, as red/black trees are not cache-line friendly (and hopefully Valhalla bears some fruits better sooner than later) we could alternatively store the secondary indexes as Adaptive Radix trees (or the newer variant Height Optimized trees). That said to incorporate also the sliding snapshot algorithm used to version the current leaf pages is a lot of work...

I'm currently not sure if the rotations due to balancing a binary tree lead to a lot of copied tree nodes (which are currently stored in the trie leaf pages). Instead of implementing a persistent ART with also versioning the leaf pages it would of course be much simpler to simply read the stored red black tree nodes into an ART, but not sure if that really would make sense.

On the other hand it's currently "nice to have", but a lot of work (maybe half a year or maybe even much more as it's done in spare time).

Another pressing issue is, why a full scan of a bigger resource (3,8Gb JSON file stored in Sirix) in parallel traversed by N read-only trxs is much slower than with only one trx (it's not at all obvious for me currently when profiling what's the issue).

okay okay , understood , so basically the optimization of Read need to be done ""

" Instead of implementing a persistent ART with also versioning the leaf pages it would of course be much simpler to simply read the stored red black tree nodes into an ART, but not sure if that really would make sense." can you please explain a bit more ?
and this is the approach you want me to use ? "Now, as red/black trees are not cache-line friendly (and hopefully Valhalla bears some fruits better sooner than later) we could alternatively store the secondary indexes as Adaptive Radix trees"

JohannesLichtenberger · 2024-06-22T15:33:34Z

Point is it's currently more of a "nice to have" thing with a huge possibility that work is not going to be finished and it's more like "this could be better" ;-)

However, I think what would be valuable in any case would be to separate the PageReadOnlyTrx and the PageTrx from the current trie implementation. IMHO these should be the StorageEngineReader,StorageEngineWriter with separate KeyedTrieReader,KeyedTrieWriter classes. Maybe you'd like to start with this "subtask" to get familiar with the code base?

JohannesLichtenberger · 2024-06-22T16:20:44Z

TreeModifierImpl and the interface would probably be the KeyedTrieWriter.

JohannesLichtenberger · 2024-06-22T16:32:20Z

Oh and BTW: we have to fix this first as currently the CI build always fails due to an old docker image format used by Keycloak 7.0.1, which is of course super old and should be updated: #711

anmol797 · 2024-06-22T22:29:12Z

Point is it's currently more of a "nice to have" thing with a huge possibility that work is not going to be finished and it's more like "this could be better" ;-)

However, I think what would be valuable in any case would be to separate the PageReadOnlyTrx and the PageTrx from the current trie implementation. IMHO these should be the StorageEngineReader,StorageEngineWriter with separate KeyedTrieReader,KeyedTrieWriter classes. Maybe you'd like to start with this "subtask" to get familiar with the code base?

yeah sure , that will be more effective and helpful for me

please let me know what to start with

anmol797 · 2024-06-22T22:30:43Z

Oh and BTW: we have to fix this first as currently the CI build always fails due to an old docker image format used by Keycloak 7.0.1, which is of course super old and should be updated: #711

okay
i can help with that too

JohannesLichtenberger · 2024-06-23T08:25:35Z

You could start with the Keycloak issue and then with the proposed refactoring?

anmol797 · 2024-06-24T16:31:06Z

You could start with the Keycloak issue and then with the proposed refactoring?

ok ,
please tell me how to start with that

JohannesLichtenberger · 2024-06-24T17:15:57Z

You have to update the docker files and check that it works again, as startup scripts to import the realm are not supported anymore

anmol797 · 2024-06-25T15:47:49Z

You have to update the docker files and check that it works again, as startup scripts to import the realm are not supported anymore

how to test that change ? any reference ?

JohannesLichtenberger added enhancement good first issue help wanted Hacktoberfest labels Jun 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read-only secondary indexes should get a tailored in-memory data structure #624

Read-only secondary indexes should get a tailored in-memory data structure #624

JohannesLichtenberger commented Jun 19, 2023

anmol797 commented Jun 19, 2024

JohannesLichtenberger commented Jun 19, 2024

JohannesLichtenberger commented Jun 21, 2024

anmol797 commented Jun 21, 2024

anmol797 commented Jun 21, 2024

JohannesLichtenberger commented Jun 21, 2024

anmol797 commented Jun 21, 2024

JohannesLichtenberger commented Jun 21, 2024

JohannesLichtenberger commented Jun 21, 2024

anmol797 commented Jun 22, 2024

JohannesLichtenberger commented Jun 22, 2024

JohannesLichtenberger commented Jun 22, 2024

JohannesLichtenberger commented Jun 22, 2024

anmol797 commented Jun 22, 2024

anmol797 commented Jun 22, 2024

JohannesLichtenberger commented Jun 23, 2024

anmol797 commented Jun 24, 2024

JohannesLichtenberger commented Jun 24, 2024

anmol797 commented Jun 25, 2024

Read-only secondary indexes should get a tailored in-memory data structure #624

Read-only secondary indexes should get a tailored in-memory data structure #624

Comments

JohannesLichtenberger commented Jun 19, 2023

anmol797 commented Jun 19, 2024

JohannesLichtenberger commented Jun 19, 2024

JohannesLichtenberger commented Jun 21, 2024

anmol797 commented Jun 21, 2024

anmol797 commented Jun 21, 2024

JohannesLichtenberger commented Jun 21, 2024

anmol797 commented Jun 21, 2024

JohannesLichtenberger commented Jun 21, 2024

JohannesLichtenberger commented Jun 21, 2024

anmol797 commented Jun 22, 2024

JohannesLichtenberger commented Jun 22, 2024

JohannesLichtenberger commented Jun 22, 2024

JohannesLichtenberger commented Jun 22, 2024

anmol797 commented Jun 22, 2024

anmol797 commented Jun 22, 2024

JohannesLichtenberger commented Jun 23, 2024

anmol797 commented Jun 24, 2024

JohannesLichtenberger commented Jun 24, 2024

anmol797 commented Jun 25, 2024