What is a Dataset? #226
ghalimi
started this conversation in
Documentation
Replies: 1 comment
-
A Dataset is a data source loaded through the Data Integration Engine and imported within a table. A dataset is registered on the Data Catalog and stored in the Object Store through partitions. Usually, a partitioned dataset has a single partition index. Nevertheless, some datasets can be stored using multiple partition indexes. In such a case, the dataset is replicated on the Object Store, with one replica for every partition index. Such datasets are uniquely identified on the Data Catalog using the Related Topic: What is the difference between a Data Source, a Dataset, and a Master Table? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Definition
Dataset
Beta Was this translation helpful? Give feedback.
All reactions