You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We are using Merge on Hive Acid tables (non-bucketed) to maintain SCD Type-1 data with incremental updates from our sources. Over a period of time, regular merge statements on this table resulted in lot small files (even after compaction). This has a direct impact on dependent ETLs and queries. However this issue could be avoided with support for bucketed tables since the data would always be split into pre-defined # buckets post compaction.
@amoghmargoor above limitation is causing a lot of instability in the hive warehouse. Kindly check the possibility of supporting bucketed tables in MERGE.
The text was updated successfully, but these errors were encountered:
We are using Merge on Hive Acid tables (non-bucketed) to maintain SCD Type-1 data with incremental updates from our sources. Over a period of time, regular merge statements on this table resulted in lot small files (even after compaction). This has a direct impact on dependent ETLs and queries. However this issue could be avoided with support for bucketed tables since the data would always be split into pre-defined # buckets post compaction.
@amoghmargoor above limitation is causing a lot of instability in the hive warehouse. Kindly check the possibility of supporting bucketed tables in MERGE.
The text was updated successfully, but these errors were encountered: