[Bug]: MAJOR Optimizing is running repeatedly #1924

celltobig · 2023-09-06T03:28:10Z

What happened?

In the absence of the need for optimizing, a Iceberg format table are still undergoing major optimizing repeatedly.

The table have enable full optimizing with configuration:

'self-optimizing.full.trigger.interval'='86400000'

Affects Versions

master

What engines are you seeing the problem on?

AMS, Optimizer

How to reproduce

Create a Iceberg v2 partition table
Set set table property 'self-optimizing.full.trigger.interval'='86400000'
Insert overwrite some data into one partition.

CREATE TABLE spark_catalog.dl_ods.ods_iceberg_t1 (
channel_id INT,
label STRING,
price_sign BIGINT,
item_id BIGINT,
item_type INT,
is_maintain INT,
cur_name STRING,
adjust_code STRING,
platform_id BIGINT,
business_id BIGINT NOT NULL,
price DECIMAL(20,4),
price_type_name STRING,
price_type_id BIGINT,
price_type_code STRING,
sku_id BIGINT,
goods_no STRING,
spu_id BIGINT,
store_id BIGINT,
gmt_update TIMESTAMP,
gmt_create TIMESTAMP,
id BIGINT NOT NULL,
store_status STRING,
store_no STRING,
com_id STRING,
store_name STRING)
USING iceberg
PARTITIONED BY (business_id)
LOCATION 'hdfs://xxxxx/user/hive/warehouse/datalake/dl_ods/ods_iceberg_t1'
TBLPROPERTIES(
'clean-independent-delete-files.enabled' = 'true',
'clean-orphan-file.enabled' = 'true',
'clean-orphan-file.min-existing-time-minutes' = '1440',
'current-snapshot-id' = '5211867258629833319',
'engine.hive.enabled' = 'true',
'flink.max-continuous-empty-commits' = '2147483647',
'format' = 'iceberg/parquet',
'format-version' = '2',
'identifier-fields' = '[id,business_id]',
'self-optimizing.enabled' = 'true',
'self-optimizing.full.trigger.interval' = '-1',
'self-optimizing.group' = 'external-group',
'self-optimizing.quota' = '0.1',
'snapshot.base.keep.minutes' = '60',
'table-expire.enabled' = 'true',
'write.distribution-mode' = 'hash',
'write.metadata.delete-after-commit.enabled' = 'true',
'write.metadata.previous-versions-max' = '1',
'write.upsert.enabled' = 'true')
;

Relevant log output

No response

Anything else

No response

Are you willing to submit a PR?

Yes I am willing to submit a PR!

Code of Conduct

I agree to follow this project's Code of Conduct

The text was updated successfully, but these errors were encountered:

wangtaohz · 2023-09-08T07:53:23Z

Thanks for your report! I will add this issue to the roadmap for version 0.5.1 and look forward to your PR.👍

celltobig added the type:bug Something isn't working label Sep 6, 2023

wangtaohz mentioned this issue Sep 8, 2023

Release-0.5.1 roadmap #1930

Closed

56 tasks

wangtaohz added the priority:major label Sep 8, 2023

wangtaohz mentioned this issue Sep 13, 2023

[AMORO-1924]Fix Iceberg tables undergoing major optimizing repeatedly #1976

Merged

3 tasks

wangtaohz changed the title ~~[Bug]: MAJOR Optimizing is running~~ [Bug]: MAJOR Optimizing is running repeatedly Sep 13, 2023

zhoujinsong closed this as completed in #1976 Sep 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: MAJOR Optimizing is running repeatedly #1924

[Bug]: MAJOR Optimizing is running repeatedly #1924

celltobig commented Sep 6, 2023 •

edited by zhoujinsong

Loading

wangtaohz commented Sep 8, 2023

[Bug]: MAJOR Optimizing is running repeatedly #1924

[Bug]: MAJOR Optimizing is running repeatedly #1924

Comments

celltobig commented Sep 6, 2023 • edited by zhoujinsong Loading

What happened?

Affects Versions

What engines are you seeing the problem on?

How to reproduce

Relevant log output

Anything else

Are you willing to submit a PR?

Code of Conduct

wangtaohz commented Sep 8, 2023

celltobig commented Sep 6, 2023 •

edited by zhoujinsong

Loading