-
Notifications
You must be signed in to change notification settings - Fork 16.3k
Fix the write-to-es feature #53821
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
Owen-CH-Leung
wants to merge
60
commits into
apache:main
Choose a base branch
from
Owen-CH-Leung:fix-write-to-es-feature
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Fix the write-to-es feature #53821
Changes from all commits
Commits
Show all changes
60 commits
Select commit
Hold shift + click to select a range
2233ddc
Introduce ElasticsearchRemoteLogIO
Owen-CH-Leung 9bf1cf4
Fix mypy error. Fix ruff format error. Remove test_write_to_es. Fix b…
Owen-CH-Leung 68c5546
Fix mypy error. Fix ruff format error. Fix spelling mistakes
Owen-CH-Leung b407075
Fix ruff check
Owen-CH-Leung 65f78c5
Fix ruff check on testEsTaskhandler
Owen-CH-Leung 960166b
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung 3e59474
Add ElasticsearchRemoteIO into TaskHandler. Refactor to handle read/w…
Owen-CH-Leung a60a315
Fix existing unit tests
Owen-CH-Leung aff9a30
Add unit test for ElasticsearchRemoteIO. Add testcontainer as new dep…
Owen-CH-Leung b1cb760
Fix provider test
Owen-CH-Leung f3ce8e7
Fix typo
Owen-CH-Leung d28d40a
Fix long running provider test
Owen-CH-Leung 998aeee
Remove leftover comments. Move testcontainer dependency to devel-common
Owen-CH-Leung c17147d
Patch REMOTE_TASK_LOG for airflow 3. Return message / event field dep…
Owen-CH-Leung 3c6d652
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung 9e815bd
Use _render_log_id for both reading & writing remote log
Owen-CH-Leung a74d7dc
use hasattr to return hit field or message field, instead of relying …
Owen-CH-Leung f1562c0
Move concat_logs under ESTaskHandler. Revise sample_log_response to h…
Owen-CH-Leung a7f6dd5
Rebase
Owen-CH-Leung 4c3a303
Fix ruff. get log id before creating runtime instance
Owen-CH-Leung 136ee8f
Fix failing CI
Owen-CH-Leung d86b92a
Use global session instead of creating a new session
Owen-CH-Leung 19a264c
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung 2fe3f6a
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung b0aede8
Do not fetch log_id_template from DB. Get it from config instead.
Owen-CH-Leung 98af510
Remove DB session. Fix compatibility CI
Owen-CH-Leung d456f2c
fix getting log_id_template_from config
Owen-CH-Leung fbdc3c6
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung f850dd8
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung 1c797d1
fix timeout error when connecting to elasticsearch
Owen-CH-Leung f4a02dd
Revert "fix timeout error when connecting to elasticsearch"
Owen-CH-Leung f9f359a
Set higher memory for testcontainer
Owen-CH-Leung a01e7ad
Do not refresh all index. Extend timeout to 30 seconds
Owen-CH-Leung 6dad32c
Prove es to be write-ready before starting the test
Owen-CH-Leung 4ae74d5
Revert "Prove es to be write-ready before starting the test"
Owen-CH-Leung e8bd766
Use fileLock to prevent contention
Owen-CH-Leung 6cc9cf8
Increase client timeout to 60s with 5 retry
Owen-CH-Leung 9b2e05c
Set TESTCONTAINERS_HOST_OVERRIDE to host.docker.internal
Owen-CH-Leung d2808a5
Revert "Set TESTCONTAINERS_HOST_OVERRIDE to host.docker.internal"
Owen-CH-Leung 52fcfd4
Add BulkIndexError to debug timeout
Owen-CH-Leung ce4a161
Fix mypy. Add TESTCONTAINER hostname override in docker compose
Owen-CH-Leung f3ff086
revert docker-compose changes. Add mem_reservation for testcontainer.
Owen-CH-Leung f0a893e
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung 059906e
Increase timeout to 120. Disable unnecessary modules of ES
Owen-CH-Leung 132922d
Extend pytest execution and setup timeout to 300
Owen-CH-Leung f822f86
Combine read write es into one test. Pre-create index and wait for sh…
Owen-CH-Leung e4731ed
Print container stats and indices to debug
Owen-CH-Leung aec7374
Disable disk-based allocation of shard
Owen-CH-Leung 1ad3376
Remove debug lines
Owen-CH-Leung e598b4e
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung 0711079
Merge branch 'apache:main' into fix-write-to-es-feature
Owen-CH-Leung b5d7e01
Fix infinite read
Owen-CH-Leung 765e7bc
Fix failing CI
Owen-CH-Leung 18d39a6
Fix failing CI
Owen-CH-Leung 3694219
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung 49fca58
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung d9b2444
Update default helm value file
Owen-CH-Leung dcc8926
Merge branch 'main' into fix-write-to-es-feature
Owen-CH-Leung 6a0e355
Fix failing ruff
Owen-CH-Leung 4b648e4
Merge branch 'main' into fix-write-to-es-feature
jason810496 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this a breaking change for the chart?
cc @jedcunningham
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, but we do not have execution date anymore.
So maybe we need to make it based on AF2/3 as string?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can someone refresh my memory - does this config item control writing, reading, or both?
IOW is there a risk here that if the chart default changes, or changes based on 2.x vs 3.x, that valid logs written from Airflow 2.x tasks won't be readable when a stack goes to Airflow 3.x?
(I think I recall that the task run table has a column for its log_id template, so the correct log_id can always be reconstructed for any task run...however I also remember somebody mentioning this column might be removed? I just don't remember exact details here.)
Anyway earlier logs vanishing in the UI because of changing chart defaults would be an uncomfortable surprise 😅