-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Only write airbyte messages to duckdb on read
#38647
Conversation
The latest updates on your projects. Learn more about Vercel for Git ↗︎ 1 Ignored Deployment
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just added a note on that @alafanechere - I downgraded to 10.0.2 and it didn't fix the problem perhaps a bigger downgrade will work. Checking. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🙏 Can you please bump the package version?
@@ -168,7 +168,7 @@ async def _run( | |||
http_dump=await self.http_proxy.retrieve_http_dump() if self.http_proxy else None, | |||
executed_container=executed_container, | |||
) | |||
await execution_result.save_artifacts(self.output_dir, self.duckdb_path) | |||
await execution_result.save_artifacts(self.output_dir, self.duckdb_path if airbyte_command == "read" else None) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we add a mapping like:
persist_command_to_duck_db = {
"read": True,
"write": True,
"spec": False,
"discover": False
}
await execution_result.save_artifacts(self.output_dir, self.duckdb_path if airbyte_command == "read" else None) | |
await execution_result.save_artifacts(self.output_dir, self.duckdb_path if persist_command_to_duck_db[airbyte_command] else None) |
With a comment saying why we disable persistance on other commands?
@alafanechere downgrading to 0.10.1 fixed the issue so I think that's the way to go here. Sorry for the back and forth. |
85fa9d0
to
11d9b7e
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Cool! Definitely a preferable fix!
Weird that a minor version bump on duckdb causes problems for us. What’s
the plan long-term?
Also, Augustin, why introduce a mapping when we only need to check for one
specific operation?
|
This fixes an issue that @askarpets encountered during regression test runs for multiple connection IDs for
source-pinterest
(report here).I was able to reproduce the issue locally.
I'm still not entirely sure why this is just coming up now for source-pinterest, but don't see a major advantage to storing the results for non-
read
commands in duckdb so am proposing this change. I haven't tried settingignore_errors=true
because it didn't feel like quite the right thing to do e.g. duringread
, but am open to other opinions.Note: I found this issue that may be related, but rolling back the version didn't help.