-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hubspot -> Postgres Not Syncing Incrementally #2028
Comments
@staufman thanks for reporting the issue! I believe this is correct albeit confusing behavior from the Hubspot source. The Hubspot source uses a cursor value (like There are two potentially confusing parts here:
If you want a deduplicated view of your records based on their primary key, you'll need to create a view that dedupes on the primary key and maxes by |
I see. That makes sense. Given the data in an eventual consistency model can grow unbounded, do people typically write cleanup jobs to discard unneeded data? Also, I'm very much looking forward to this PR (#1491) being released. As it stands, to get the details of a Contact, it looks like we would need to join a |
That is correct -- we are actively working on #1491 and hope to deliver it soon so you don't have to do this rigamarole :)
yes, having a "vacuum" query that is run periodically is recommended. It might make sense for us to offer this at the end of each sync (or each N syncs) or as a separate "job" type in Airbyte, but it requires knowledge of primary key, which is something we'll be working on in the near-ish future. |
@staufman I added an example deduplication query in this doc under the "Getting the latest snapshot" section https://docs.airbyte.io/architecture/incremental#getting-the-latest-snapshot-of-data |
Awesome! Just a heads up that I had to write the query as follows for Postgres (also, I adapted it for my particular situation with contacts). Note: it has been a long time since I last wrote some SQL :)
|
Super nice! |
Expected Behavior
I perform a sync from Hubspot -> Postgres. After it completes, I sync again without changing anything in Hubspot. In this case, I expect nothing new to be synced over.
Current Behavior
Instead, I am seeing duplicate Contact and Company records synced over.
Steps to Reproduce
vid
orcompanyid
, respectively.Severity of the bug for you
High
Airbyte Version
VERSION=0.14.1-alpha
Additional context
N/A
The text was updated successfully, but these errors were encountered: