Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

rptest/datalake: test metadata interoperability with 3rd party system #24643

Open
wants to merge 7 commits into
base: dev
Choose a base branch
from

Conversation

nvartolomei
Copy link
Contributor

@nvartolomei nvartolomei commented Dec 23, 2024

Backports Required

  • none - not a bug fix
  • none - this is a backport
  • none - issue does not exist in previous branches
  • none - papercut/not impactful enough to backport
  • v24.3.x
  • v24.2.x
  • v24.1.x

Release Notes

  • none

@nvartolomei nvartolomei force-pushed the nv/iceberg-3rdparty-rewrite branch 2 times, most recently from 6f1f148 to 6e607dd Compare December 23, 2024 20:21
@vbotbuildovich
Copy link
Collaborator

Retry command for Build#60089

please wait until all jobs are finished before running the slash command

/ci-repeat 1
tests/rptest/tests/enterprise_features_license_test.py::EnterpriseFeaturesTest.test_enable_features@{"disable_trial":false,"feature":5,"install_license":true}

@vbotbuildovich
Copy link
Collaborator

vbotbuildovich commented Dec 23, 2024

CI test results

test results on build#60089
test_id test_kind job_url test_status passed
rptest.tests.datalake.partition_movement_test.PartitionMovementTest.test_cross_core_movements.cloud_storage_type=CloudStorageType.S3 ducktape https://buildkite.com/redpanda/redpanda/builds/60089#0193f576-ce17-455b-91e6-2f265aaa9185 FLAKY 2/6
rptest.tests.enterprise_features_license_test.EnterpriseFeaturesTest.test_enable_features.feature=Feature.oidc.install_license=True.disable_trial=False ducktape https://buildkite.com/redpanda/redpanda/builds/60089#0193f576-ce16-4642-b44b-4eb53b3c9ee5 FAIL 0/1
test results on build#60100
test_id test_kind job_url test_status passed
rptest.tests.datalake.partition_movement_test.PartitionMovementTest.test_cross_core_movements.cloud_storage_type=CloudStorageType.S3 ducktape https://buildkite.com/redpanda/redpanda/builds/60100#0193f642-b1a2-4bf2-af5d-e55ed5e9b6f4 FLAKY 1/6
test results on build#60178
test_id test_kind job_url test_status passed
rptest.tests.datalake.partition_movement_test.PartitionMovementTest.test_cross_core_movements.cloud_storage_type=CloudStorageType.S3 ducktape https://buildkite.com/redpanda/redpanda/builds/60178#01941785-f08b-45dd-85b4-2851b2d2bb86 FLAKY 2/6
rptest.tests.e2e_topic_recovery_test.EndToEndTopicRecovery.test_restore_with_aborted_tx.recovery_overrides=.retention.local.target.bytes.1024.redpanda.remote.write.True.redpanda.remote.read.True.cloud_storage_type=CloudStorageType.ABS ducktape https://buildkite.com/redpanda/redpanda/builds/60178#01941785-f089-4e4a-9d39-60eb4a8babe0 FAIL 0/1

@nvartolomei nvartolomei force-pushed the nv/iceberg-3rdparty-rewrite branch 2 times, most recently from 1292bb1 to 6e1e926 Compare December 24, 2024 00:06
Comment on lines +155 to +169
def run_sample_maintenance_task(self, namespace, table) -> None:
# Metadata query
# https://iceberg.apache.org/docs/1.6.1/spark-queries/#files
initial_parquet_files = self.run_query_fetch_one(
f"SELECT count(*) FROM {namespace}.{table}.files")[0]

# Want at least 2 files to be able to assert that optimization did something.
assert initial_parquet_files >= 2, f"Expecting at least 2 files, got {initial_parquet_files}"

# Spark Procedures provided by Iceberg SQL Extensions
# https://iceberg.apache.org/docs/1.6.1/spark-procedures/#rewrite_data_files
self.run_query_fetch_one(
f"CALL `redpanda-iceberg-catalog`.system.rewrite_data_files(\"{namespace}.{table}\")"
)

optimized_parquet_files = self.run_query_fetch_one(
f"SELECT count(*) FROM {namespace}.{table}.files")[0]
assert optimized_parquet_files < initial_parquet_files, f"Expecting fewer files after optimize, got {optimized_parquet_files}"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IMO these service classes should only hold operations, not validations. To that end, can we split this into some count_parquet_files() and optimize_parquet_files()? There are many different kinds of maintenance https://iceberg.apache.org/docs/1.5.1/maintenance/ and it'll be much easier to reuse if we're more descriptive and have smaller, tighter primitives.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@andrwng I see what you mean but here I wanted to give more freedom to these maintenance tasks implementation, i.e. it might as well be a metadata only maintenance. I documented the expectations in the code doc

def run_sample_maintenance_task(self, namespace, table) -> None:

optimize_parquet_files and count_parquet_files would work for the 2 engines we have now, do you reckon all of them expose the same primitives so that we could have a single test for all of them?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree that run_sample_maintenance_task() seems a bit vague, whereas the iceberg docs Andrew provided has a concrete list of tasks we could enumerate, these being:

Maybe these are the primitives we should be working with and leave the assertions to the user.

tests/rptest/tests/datalake/query_engine_base.py Outdated Show resolved Hide resolved
tests/rptest/tests/datalake/3rdparty_maintenance_test.py Outdated Show resolved Hide resolved
@nvartolomei nvartolomei force-pushed the nv/iceberg-3rdparty-rewrite branch 2 times, most recently from 4b21a17 to 92f74aa Compare December 30, 2024 10:39
@vbotbuildovich
Copy link
Collaborator

Retry command for Build#60178

please wait until all jobs are finished before running the slash command

/ci-repeat 1
tests/rptest/tests/e2e_topic_recovery_test.py::EndToEndTopicRecovery.test_restore_with_aborted_tx@{"cloud_storage_type":2,"recovery_overrides":{"redpanda.remote.read":true,"redpanda.remote.write":true,"retention.local.target.bytes":1024}}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants