[server] Add the leader of bucket to metadata cache #386

luoyuxia · 2025-02-12T04:09:03Z

Purpose

Linked issue: close #386
Currnetly, to know the leader of a TableBucket, we always need to get from zk..When the buckets increase, it will cost too many time which will cause the job fail to startup just like the issue describes

In this pr, we only add the leader of a TableBucket to metadata for simplicity, maybe in the future, we can add more to metadata cache.

In ServerMetadataCache, cache the mapping from tableBucket to leader
Modify UpdateMetadataRequest in rpc proto to add PbTableBucketMetadata, deleted_table_id, `deleted_partition_id.
When the leader of any bucket changes, coordinator sends UpdateMetadataRequest with the updated leader in PbTableBucketMetadata to tablet servers, then tablet servers can update their own cache.
When table/partition is dropped, coordinator sends UpdateMetadataRequest with deleted_table_id/deleted_partition_id to tablet servers, then tablet servers can remove these tables from their own cache

Tests

ServerMetadataCacheImplTest to verify the ServerMetadataCache works as expected.
MetadataUpdateITCase to verify when table created, bucket leader changed, server down/up, the cache is as expected.

API and Format

N/A

Documentation

N/A

luoyuxia · 2025-02-13T12:23:05Z

@wuchong Add the leader of bucket to metadata cache to avoid get table metadata cost too much time.. Pr ready

wuchong · 2025-03-03T08:57:49Z

...onnector-flink/src/test/java/com/alibaba/fluss/connector/flink/catalog/FlinkCatalogTest.java

@@ -130,7 +129,7 @@ void beforeEach() throws Exception {
        } catch (CatalogException e) {
            // the auto partitioned manager may create the db zk node
            // in an another thread, so if exception is NodeExistsException, just ignore
-            if (!ExceptionUtils.findThrowable(e, KeeperException.NodeExistsException.class)
+            if (!ExceptionUtils.findThrowableWithMessage(e, "KeeperException$NodeExistsException")


Could you extract this into a separate hotfix pull request? This seems quite unstable in these days. https://github.com/alibaba/fluss/actions/runs/13626469089/job/38084662575

add metadata cache

8621768

luoyuxia force-pushed the add-metadata-cache-final branch 3 times, most recently from 20fbae3 to c6aca6e Compare February 12, 2025 12:10

luoyuxia marked this pull request as ready for review February 13, 2025 02:41

luoyuxia force-pushed the add-metadata-cache-final branch 4 times, most recently from 96ba9ea to 8432333 Compare February 13, 2025 08:23

refactor

07bfb61

luoyuxia force-pushed the add-metadata-cache-final branch from 8432333 to 07bfb61 Compare February 13, 2025 11:55

luoyuxia changed the title ~~add metadata cache~~ [server] Add the leader of bucket to metadata cache Feb 13, 2025

luoyuxia requested review from wuchong, swuferhong and loserwang1024 February 13, 2025 12:22

wuchong reviewed Mar 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[server] Add the leader of bucket to metadata cache #386

[server] Add the leader of bucket to metadata cache #386

luoyuxia commented Feb 12, 2025 •

edited

Loading

luoyuxia commented Feb 13, 2025

wuchong Mar 3, 2025

[server] Add the leader of bucket to metadata cache #386

Are you sure you want to change the base?

[server] Add the leader of bucket to metadata cache #386

Conversation

luoyuxia commented Feb 12, 2025 • edited Loading

Purpose

Tests

API and Format

Documentation

luoyuxia commented Feb 13, 2025

wuchong Mar 3, 2025

Choose a reason for hiding this comment

luoyuxia commented Feb 12, 2025 •

edited

Loading