You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The number of parquet files created for each resource type in case of DirectRunner was equal to the parallelism parameter being passed in the FhirEtlOptions, however after the changes made in this PR, the number of parquet files created is equal to the number of FHIR resources, i.e. for a total of 12K Observations now we create 12K parquet files. This can cause performance issues while reading back the data. This has to be fixed.
The text was updated successfully, but these errors were encountered:
bashir2
added
P1:must
As issue that definitely needs to be implemented in near future.
and removed
P2:should
An issue to be addressed in a quarter or so.
labels
May 22, 2024
The number of parquet files created for each resource type in case of DirectRunner was equal to the
parallelism
parameter being passed in theFhirEtlOptions
, however after the changes made in this PR, the number of parquet files created is equal to the number of FHIR resources, i.e. for a total of 12K Observations now we create 12K parquet files. This can cause performance issues while reading back the data. This has to be fixed.The text was updated successfully, but these errors were encountered: