Implement the migrate iterator for iterating key values #1989

git-hulk · 2024-01-05T10:42:28Z

Currently, we need to iterate all keys in the database in different places like the cluster migration and kvrocks2redis, but don't have an iterator for this purpose. It's very error-prone to implement this in different places since Kvrocks may add a new column family in the future, and we must be careful to iterate all keys in all column families. This would be a burden for maintenance, So I think we could add an iterator to iterate all keys in all column families.

Proposal

For the API, we could generally comply with the rocksdb's iterator API, but we allow to add some extra functions if needed.

class Iterator {
public:
    Iterator(Storage *storage, const rocksdb::ReadOptions &options, const int slot = -1);
    ~KeyIterator();

    bool Valid() const;
    void Next();

    rocksdb::WriteBatch *Batch() const;
    Slice Key() const;
    Slice Value() const;
    RedisType Type() const;
};

And when implementing this iterator, it will iterate the metadata column family first and check its type, if it's not a string, then it will iterate the corresponding column family to get subkeys. That said, if we have a key foo with type hash, then the iterator will iterate foo and foo:field1, foo:field2, and so on.

This solution can bring those benefits:

The codes look more intutive
Can reuse this iterator if we want to iterate keys only

The text was updated successfully, but these errors were encountered:

git-hulk · 2024-01-10T12:22:04Z

@caipengbo Would you mind taking a look while you're free.

caipengbo · 2024-01-10T12:57:22Z

Would you mind taking a look while you're free.

Good job, LGTM!

Currently, we need to iterate all keys in the database in different places like the cluster migration and kvrocks2redis, but don't have an iterator for this purpose. It's very error-prone to implement this in different places since Kvrocks may add a new column family in the future, and we must be careful to iterate all keys in all column families. This would be a burden for maintenance, So we want to implement an iterator for iterating keys. ```C++ DBIter iter(storage, read_option); for (iter.Seek(); iter.Valid(); iter.Next()) { if (iter.Type() == kRedisString || iter.Type() == kRedisJSON) { // the string/json type didn't have subkeys continue; } auto subkey_iter = iter.GetSubKeyIterator(); for (subkey_iter.Seek(); subkey_iter.Valid(); subkey_iter.Next()) { // handle its subkey and value here } } ``` When using this iterator, it will iterate the metadata column family first and check its type, if it's not a string or JSON, then it will iterate the corresponding column family to get subkeys. That said, if we have a key foo with type hash, then the iterator will iterate foo and foo:field1, foo:field2, and so on. This solution can bring those benefits: - The codes look more intuitive - Can reuse this iterator if we want to iterate keys only This closes #1989

git-hulk mentioned this issue Jan 5, 2024

Improve slot migration speed and resource consumption using raw key values #1223

Open

5 tasks

git-hulk self-assigned this Jan 5, 2024

git-hulk mentioned this issue Jan 11, 2024

Implement an unify key-value iterator for Kvrocks #2004

Merged

git-hulk closed this as completed in #2004 Jan 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement the migrate iterator for iterating key values #1989

Implement the migrate iterator for iterating key values #1989

git-hulk commented Jan 5, 2024 •

edited

Loading

git-hulk commented Jan 10, 2024

caipengbo commented Jan 10, 2024

Implement the migrate iterator for iterating key values #1989

Implement the migrate iterator for iterating key values #1989

Comments

git-hulk commented Jan 5, 2024 • edited Loading

Proposal

git-hulk commented Jan 10, 2024

caipengbo commented Jan 10, 2024

git-hulk commented Jan 5, 2024 •

edited

Loading