Skip to content
This repository has been archived by the owner on Sep 6, 2023. It is now read-only.

Multi company parallel exports session queue #50

Conversation

DuttaSoumya
Copy link
Contributor

The first version of the pull request still relies on starting up as many table exports as permitted by the operating limits. So if you have a fairly large table set or if you have too many companies being exported at the same time, chances are that not all exports will happen in the same batch. In order to minimize (not eliminate) this issue, there is a provision made that starts up exports for those tables that were not activated because of the op limits.

@DuttaSoumya DuttaSoumya merged commit 2897803 into MultiCompanyParallelExports Oct 6, 2022
@DuttaSoumya DuttaSoumya deleted the MultiCompanyParallelExports-SessionQueue branch October 6, 2022 15:32
DuttaSoumya added a commit that referenced this pull request Oct 12, 2022
* Intermmediate changes

* second draft

* Adding SessionID as a secondary key on the session table

* Working for simultaneous exports

* Code Cop

* Remove obsolete tags

* Documentation

* More documentation

* improve picture

* more fixes

* Multi company parallel exports session queue (#50)

* Changed field name

* Adding pending tables feature

* restoring the deleted fields as obsolete

* when data canot be exported, JQ fails

Co-authored-by: Soumya Dutta <soudutta@microsoft.com>

* Elaborated on the usage of Multi- company export

* Made the execution logs viewable

* broke the comment line for readability

* Made the messaging more elaborate

* Do export for tables that failed last time, even though the data may not have changed since.

* Explain the concept when duplicate data has been exported

* Fix for entity JSONs not being written during export

* removing comments

Co-authored-by: Soumya Dutta <soudutta@microsoft.com>
HenriSchulte-MS added a commit that referenced this pull request Nov 2, 2022
Fixes irregularities on the System Audit fields. (1) Very old records do not appear in the lake sometimes because the SystemCreatedAt field is set to null. This field is now artificaly initialized to a date so that it appears in the lake, and (2) The SystemID field may be repeated over different records belonging to different companies in the same table. Thus, the uniqueness contraint has been fixed.

* Intermmediate changes

* second draft

* Adding SessionID as a secondary key on the session table

* Working for simultaneous exports

* Code Cop

* Remove obsolete tags

* Documentation

* More documentation

* improve picture

* more fixes

* Multi company parallel exports session queue (#50)

* Changed field name

* Adding pending tables feature

* restoring the deleted fields as obsolete

* when data canot be exported, JQ fails

Co-authored-by: Soumya Dutta <soudutta@microsoft.com>

* Elaborated on the usage of Multi- company export

* Made the execution logs viewable

* broke the comment line for readability

* Made the messaging more elaborate

* Do export for tables that failed last time, even though the data may not have changed since.

* Explain the concept when duplicate data has been exported

* Fix for entity JSONs not being written during export

* Handling the case when SystemCreatedAt is null for records.

* Documenting change

* Fixing the uniqueness criteria

* more readable

* Adding Company field to CurrentData as well, doc

* beautification of notable change

* Added derived column called UniqueRecordKey, Determine if entity is company specific

* Moved around expressions to eliminateSelectAllExceptUniqueRecordKeyColumn. Column selection is now done in sink. Does not affect behavior.

Co-authored-by: Soumya Dutta <soudutta@microsoft.com>
Co-authored-by: Henri Schulte <henri.schulte@microsoft.com>
DuttaSoumya added a commit that referenced this pull request Nov 18, 2022
* Intermmediate changes

* second draft

* Adding SessionID as a secondary key on the session table

* Working for simultaneous exports

* Code Cop

* Remove obsolete tags

* Documentation

* More documentation

* improve picture

* more fixes

* Multi company parallel exports session queue (#50)

* Changed field name

* Adding pending tables feature

* restoring the deleted fields as obsolete

* when data canot be exported, JQ fails

Co-authored-by: Soumya Dutta <soudutta@microsoft.com>

* Elaborated on the usage of Multi- company export

* Made the execution logs viewable

* broke the comment line for readability

* Made the messaging more elaborate

* Do export for tables that failed last time, even though the data may not have changed since.

* Explain the concept when duplicate data has been exported

* Fix for entity JSONs not being written during export

* Linked to the replica seeker

* Fixing App ID

* update docs

* Compilation warnings

Co-authored-by: Soumya Dutta <soudutta@microsoft.com>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant