Clean md store objects #8700

vh05 · 2025-01-20T09:29:36Z

Clean md store objects once the max deleted objects reach the limit.

Fixes: https://issues.redhat.com/browse/DFBUGS-1339

Clean md store objects once the max deleted objects reach the limit. Fixes: https://issues.redhat.com/browse/DFBUGS-1339 Signed-off-by: Vinayakswami Hariharmath <[email protected]>

jackyalbo · 2025-01-28T15:05:40Z

src/server/bg_services/db_cleaner.js

@@ -58,23 +58,39 @@ async function clean_md_store(last_date_to_remove) {
        ${total_objects_count} objects - Skipping...`);
        return;
    }
+    const objects_to_remove = await clean_md_store_objects(last_date_to_remove, config.DB_CLEANER_DOCS_LIMIT);


Did you rearrange the same code in smaller functions? I'm not sure if it's needed.

Yes. looking to clean only deleted md objects and other calls in the function not needed. Divided the function into 3 sub functions.

jackyalbo · 2025-01-28T15:05:56Z

src/server/bg_services/db_cleaner.js

    dbg.log2('DB_CLEANER: list objects:', objects_to_remove);
-    if (objects_to_remove.length) {
+    if (objects_to_remove.length > config.MD_STORE_MAX_DELETED_OBJECTS_LIMIT) {


why are we adding this limit? why not deleting less?

What would be the good number ?

I think number is arbitrary, but we can go with 10 or 20 for example

jackyalbo · 2025-01-28T15:12:55Z

src/server/bg_services/objects_reclaimer.js

@@ -42,6 +43,9 @@ class ObjectsReclaimer {
        if (has_errors) {
            return config.OBJECT_RECLAIMER_ERROR_DELAY;
        }
+
+        await clean_md_store_objects(Date.now());


Don't think this belongs here... why should we run db_cleaner inside object_reclaimer - db_cleaner will run on objects with no regard to the list that was reclaimed (and of course, we don't want to db-delete objects that were just marked as deleted completely from the DB!). I think what we wanted here is a new functionality that for each of the objects, will check how many deleted objects we have with the same key and, if it's more than X - delete the older copies - @dannyzaken to keep me honest here...

The jira issue (https://issues.redhat.com/browse/DFBUGS-1339) is specifically mentioned that we can call clean the deleted objects from the md store in object reclaimer. Thought this is the place. @dannyzaken Please correct me here

@vh05 I didn't intend for the description in Jira to be specific instructions on how to fix it. I wanted to provide some general context.
I don't see a reason to call from object_reclaimer to the db_cleaner. These are two separate bg_workers.

dannyzaken · 2025-01-29T13:33:26Z

@vh05. I think this PR does not do what it is supposed to do. We want to keep up to SOME_CONSTANT deleted object_md per key, not keep SOME_CONSTANT of total deleted object_mds.
This is mainly to handle the example of overwriting an object. Let's say you upload an object with the key foo every few seconds. You will have many rows with the same key in a short time (all except one should be deleted). We want to limit the number of these rows (I think 100 is a reasonable limit to start with).

In your implementation, you delete any object_md that is deleted and only keep 100 in total. This is a bit too aggressive, in my opinion.

Another thing to consider is that you probably want to ignore objectmds that are not marked as reclaimed.

I strongly suggest creating a similar case on a local deployment. You can easily produce a dataset with >100 overwrites of the same key, so you can test your code.

dannyzaken · 2025-01-29T13:41:11Z

You will probably need to implement new functions in md_store.
you can take this function as an example

noobaa-core/src/server/object_services/md_store.js

Lines 805 to 826 in f32827b

    
           async count_objects_per_bucket(system_id) { 
        
               // TODO check which index is needed to cover this aggregation 
        
               const res = await this._objects.groupBy({ 
        
                   system: system_id, 
        
                   deleted: null, 
        
                   delete_marker: null, 
        
                   version_past: null 
        
               }, { 
        
                   _id: '$bucket', 
        
                   count: { 
        
                       $sum: 1 
        
                   } 
        
               }); 
        
               const buckets = {}; 
        
               let total_count = 0; 
        
               _.forEach(res, r => { 
        
                   buckets[r._id] = r.count; 
        
                   total_count += r.count; 
        
               }); 
        
               buckets[''] = total_count; 
        
               return buckets; 
        
           }

It is probably not sufficient, and you need to aggregate more data than this (for example, you will need to keep track of the _ids of objects to delete\keep).
Notice that the md_store is written in Mongo query language and is translated to Postgres JSONB queries in the postgres_client.

noobaa-core/src/util/postgres_client.js

Lines 1180 to 1200 in f32827b

    
           async groupBy(match, group) { 
        
               const WHERE = mongo_to_pg('data', encode_json(this.schema, match), { disableContainmentQuery: true }); 
        
               const P_GROUP = this._prepare_aggregate_group_query(group); 
        
               try { 
        
                   const res = await this.single_query(`SELECT ${P_GROUP.SELECT} FROM ${this.name} WHERE ${WHERE} GROUP BY ${P_GROUP.GROUP_BY}`); 
        
                   return res.rows.map(row => { // this is temp fix as all the keys suppose to be ints except _id 
        
                       const new_row = {}; 
        
                       for (const key of Object.keys(row)) { 
        
                           if (key === '_id') { 
        
                               new_row._id = new mongodb.ObjectID(row[key]); 
        
                           } else { 
        
                               new_row[key] = parseInt(row[key], 10); 
        
                           } 
        
                       } 
        
                       return new_row; 
        
                   }); 
        
               } catch (err) { 
        
                   dbg.error('groupBy failed', match, group, WHERE, P_GROUP, err); 
        
                   throw err; 
        
               } 
        
           }

This translation is far from perfect, and we should verify that the generated queries work as expected and perform reasonably well.

vh05 · 2025-01-30T08:28:13Z

@vh05. I think this PR does not do what it is supposed to do. We want to keep up to SOME_CONSTANT deleted object_md per key, not keep SOME_CONSTANT of total deleted object_mds. This is mainly to handle the example of overwriting an object. Let's say you upload an object with the key foo every few seconds. You will have many rows with the same key in a short time (all except one should be deleted). We want to limit the number of these rows (I think 100 is a reasonable limit to start with).

In your implementation, you delete any object_md that is deleted and only keep 100 in total. This is a bit too aggressive, in my opinion.

Another thing to consider is that you probably want to ignore objectmds that are not marked as reclaimed.

I strongly suggest creating a similar case on a local deployment. You can easily produce a dataset with >100 overwrites of the same key, so you can test your code.

If we delete the objects that are already deleted (keeping only reclaimed), would that harm us or there is any reason we keep the deleted objects in md_store ?
If the deleted objects are unusually high on regular basis, there is high chance of overwrite on a particular key and if we delete them after certain limit, actually we are deleting the deleted objects as result of overwrite. Is n't it ?
As you said, deleting object mds of a particular key requires tracking the bucket and object there on. Is it not too costly operation ?

dannyzaken · 2025-01-30T16:30:14Z

If we delete the objects that are already deleted (keeping only reclaimed), would that harm us or there is any reason we keep the deleted objects in md_store ?

We perform soft delete (mark the deleted timestamp) since it can help debug or fix issues with deleted objects. the data is still in the DB and not gone forever.

If the deleted objects are unusually high on regular basis, there is high chance of overwrite on a particular key and if we delete them after certain limit, actually we are deleting the deleted objects as result of overwrite. Is n't it ?

I'm not sure I understand your point. We don't want to remove deleted rows arbitrarily after reaching some limit. We specifically want to handle this overwrite use case, where a single object is frequently overwritten.
for the general cleanup of deleted rows, the db_cleaner deletes them by date, which is good enough for now and we can tweak if necessary. this is not the scope of this bug.

As you said, deleting object mds of a particular key requires tracking the bucket and object there on. Is it not too costly operation ?

Of course we need to profile it once we have the code ready. We can run EXPLAIN queries once we have a working query, and analyze the performance.

vh05 · 2025-01-31T06:21:51Z

If we delete the objects that are already deleted (keeping only reclaimed), would that harm us or there is any reason we keep the deleted objects in md_store ?

We perform soft delete (mark the deleted timestamp) since it can help debug or fix issues with deleted objects. the data is still in the DB and not gone forever.

If the deleted objects are unusually high on regular basis, there is high chance of overwrite on a particular key and if we delete them after certain limit, actually we are deleting the deleted objects as result of overwrite. Is n't it ?

I'm not sure I understand your point. We don't want to remove deleted rows arbitrarily after reaching some limit. We specifically want to handle this overwrite use case, where a single object is frequently overwritten. for the general cleanup of deleted rows, the db_cleaner deletes them by date, which is good enough for now and we can tweak if necessary. this is not the scope of this bug.

As you said, deleting object mds of a particular key requires tracking the bucket and object there on. Is it not too costly operation ?

Of course we need to profile it once we have the code ready. We can run EXPLAIN queries once we have a working query, and analyze the performance.

Sure Danny. That clarifies my queries.

pull-request-size bot added the size/M label Jan 20, 2025

vh05 requested a review from liranmauda January 20, 2025 09:29

vh05 force-pushed the dfbugs-1339 branch from 4be34e8 to 1be8b86 Compare January 20, 2025 10:42

vh05 requested review from dannyzaken and jackyalbo January 22, 2025 08:05

Clean md store objects

9851a1d

Clean md store objects once the max deleted objects reach the limit. Fixes: https://issues.redhat.com/browse/DFBUGS-1339 Signed-off-by: Vinayakswami Hariharmath <[email protected]>

vh05 force-pushed the dfbugs-1339 branch from 1be8b86 to 9851a1d Compare January 28, 2025 05:23

jackyalbo requested changes Jan 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean md store objects #8700

Clean md store objects #8700

vh05 commented Jan 20, 2025

jackyalbo Jan 28, 2025

vh05 Jan 29, 2025

jackyalbo Jan 28, 2025

vh05 Jan 29, 2025

nimrod-becker Jan 30, 2025

jackyalbo Jan 28, 2025 •

edited

Loading

vh05 Jan 29, 2025

dannyzaken Jan 29, 2025

dannyzaken commented Jan 29, 2025

dannyzaken commented Jan 29, 2025

vh05 commented Jan 30, 2025 •

edited

Loading

dannyzaken commented Jan 30, 2025

vh05 commented Jan 31, 2025

Clean md store objects #8700

Are you sure you want to change the base?

Clean md store objects #8700

Conversation

vh05 commented Jan 20, 2025

jackyalbo Jan 28, 2025

Choose a reason for hiding this comment

vh05 Jan 29, 2025

Choose a reason for hiding this comment

jackyalbo Jan 28, 2025

Choose a reason for hiding this comment

vh05 Jan 29, 2025

Choose a reason for hiding this comment

nimrod-becker Jan 30, 2025

Choose a reason for hiding this comment

jackyalbo Jan 28, 2025 • edited Loading

Choose a reason for hiding this comment

vh05 Jan 29, 2025

Choose a reason for hiding this comment

dannyzaken Jan 29, 2025

Choose a reason for hiding this comment

dannyzaken commented Jan 29, 2025

dannyzaken commented Jan 29, 2025

vh05 commented Jan 30, 2025 • edited Loading

dannyzaken commented Jan 30, 2025

vh05 commented Jan 31, 2025

jackyalbo Jan 28, 2025 •

edited

Loading

vh05 commented Jan 30, 2025 •

edited

Loading