[Feature Discuss] Partition-Level Bucket Index #12699

zhangyue19921010 · 2025-01-24T09:19:19Z

Hi Hudis:

As we known, Hudi proposed and introduced Bucket Index in RFC-29. Bucket Index can well unify the indexes of Flink and Spark, that is, Spark and Flink could upsert the same Hudi table using bucket index.

However, Bucket Index has a limit of fixed number of buckets. In order to solve this problem, RFC-42 proposed the ability of consistent hashing achieving bucket resizing by splitting or merging several local buckets dynamically.

But from PRD experience, sometimes a Partition-Level Bucket Index and a offline way to do bucket rescale is good enough without introducing additional efforts (multiple writes, clustering, automatic resizing,etc.). Because the more complex the Architecture, the more error-prone it is and the greater operation and maintenance pressure.

In this regard, we could upgrade the traditional Bucket Index to implement a Partition-Level Bucket Index, so that users can set a specific number of buckets for different partitions through a rule engine (such as regular expression matching). On the other hand, for a certain existing partitions, an off-line command is provided to reorganized the data using insert overwrite(need to stop the data writing of the current partition).

More importantly, the existing Bucket Index table can be upgraded to Partition-Level Bucket Index smoothly and seamlessly.

Some thoughts on this change? Any feedback would be greatly appreciated !

danny0405 · 2025-01-27T02:30:57Z

Are you saying to support configuring the bucket number in partition-level? Can the bucket number be changed after an explicit configuration?

zhangyue19921010 · 2025-01-30T12:51:49Z

Are you saying to support configuring the bucket number in partition-level? Can the bucket number be changed after an explicit configuration?

Hi @danny0405

Thanks for your attention. Yes, we will provide a new mechanism to specify the number of buckets for different partitions through an expression. For existing partitions, we can change current partition's bucket number through an offline job (Insert Overwrite) with the new bucket number. For new partitions, the initial value of the bucket number is obtained based on the expression. Additionally, updates to the expression are also supported, but the updated expression will only take effect for new partitions.

Function similar to https://paimon.apache.org/docs/master/maintenance/rescale-bucket/

danny0405 · 2025-02-05T02:20:28Z

Can the bucket number be dynamically inferred with the inputs? Hudi has a partition metadata file under each partition, maybe we can record the bucket number there.

zhangyue19921010 · 2025-02-07T01:50:40Z

Can the bucket number be dynamically inferred with the inputs? Hudi has a partition metadata file under each partition, maybe we can record the bucket number there.

Hi @danny0405 , Thanks for your replay.
We could abstract a strategy to control the calculation logic of bucket number for example:

Calculate the bucket number based on regular expression.
Calculate the bucket number based on fixed value set by user.
Calculate the bucket number based on metadata(daily input size and average bucket size) based on MDT.

As for where to record partition bucket number. How about .hoodie_partition_metadata. Users who disable MDT also can use this partition-level bucket index.

xiarixiaoyao · 2025-02-07T11:06:13Z

@zhangyue19921010 We have implemented dynamic partition bucketing, which supports regular expressions, similar to your idea. The only difference is that we store bucket information in the ./hoodie/.bucket directory. Since the bucket information is minimal, it's efficient to store it in a single file. This approach simplifies the process of retrieving partition-level bucket counts and performing bucket pruning. At the same time, with the help of Hudi's timeline, we can easily ensure the consistency of bucket information

zhangyue19921010 · 2025-02-07T11:34:27Z

@zhangyue19921010 We have implemented dynamic partition bucketing, which supports regular expressions, similar to your idea. The only difference is that we store bucket information in the ./hoodie/.bucket directory. Since the bucket information is minimal, it's efficient to store it in a single file. This approach simplifies the process of retrieving partition-level bucket counts and performing bucket pruning. At the same time, with the help of Hudi's timeline, we can easily ensure the consistency of bucket information

Hi @xiarixiaoyao Thanks for your replay! It seems that dynamic partition-level bucket index is indeed a common requirement.
./hoodie/.bucket directory is a good idea. But how to solve the problem of two jobs concurrently writing? At this time, there may be multiple tasks operating on the partition meta file(Even if wrote to different partitions)?

danny0405 · 2025-02-10T02:31:35Z

@xiarixiaoyao Can you share the data structure under ./hoodie/.bucket ?

zhangyue19921010 added index feature labels Jan 24, 2025

ad1happy2go added this to Hudi Issue Support Jan 27, 2025

github-project-automation bot moved this to ⏳ Awaiting Triage in Hudi Issue Support Jan 27, 2025

zhangyue19921010 mentioned this issue Feb 7, 2025

[HUDI-8990][RFC-89] Claim RFC-89: Dynamic Partition Level Bucket Index #12806

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Discuss] Partition-Level Bucket Index #12699

[Feature Discuss] Partition-Level Bucket Index #12699

zhangyue19921010 commented Jan 24, 2025 •

edited

Loading

danny0405 commented Jan 27, 2025

zhangyue19921010 commented Jan 30, 2025 •

edited

Loading

danny0405 commented Feb 5, 2025

zhangyue19921010 commented Feb 7, 2025 •

edited

Loading

xiarixiaoyao commented Feb 7, 2025

zhangyue19921010 commented Feb 7, 2025

danny0405 commented Feb 10, 2025

[Feature Discuss] Partition-Level Bucket Index #12699

[Feature Discuss] Partition-Level Bucket Index #12699

Comments

zhangyue19921010 commented Jan 24, 2025 • edited Loading

danny0405 commented Jan 27, 2025

zhangyue19921010 commented Jan 30, 2025 • edited Loading

danny0405 commented Feb 5, 2025

zhangyue19921010 commented Feb 7, 2025 • edited Loading

xiarixiaoyao commented Feb 7, 2025

zhangyue19921010 commented Feb 7, 2025

danny0405 commented Feb 10, 2025

zhangyue19921010 commented Jan 24, 2025 •

edited

Loading

zhangyue19921010 commented Jan 30, 2025 •

edited

Loading

zhangyue19921010 commented Feb 7, 2025 •

edited

Loading