You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: format/spec.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -221,7 +221,7 @@ Supported primitive types are defined in the table below. Primitive types added
221
221
||**`fixed(L)`**| Fixed-length byte array of length L ||
222
222
||**`binary`**| Arbitrary-length byte array ||
223
223
|[v3](#version-3)|**`geometry(C)`**| Geospatial features from [OGC – Simple feature access][1001]. Edge-interpolation is always linear/planar. See [Appendix G](#appendix-g-geospatial-notes). Parameterized by CRS C. If not specified, C is `OGC:CRS84`. ||
224
-
|[v3](#version-3)|**`geography(C, A)`**| Geospatial features from [OGC – Simple feature access][1001]. See [Appendix G](#appendix-g-geospatial-notes). Parameterized by CRS C and edge-interpolation algoritm A. If not specified, C is `OGC:CRS84` and A is `spherical`. |
224
+
|[v3](#version-3)|**`geography(C, A)`**| Geospatial features from [OGC – Simple feature access][1001]. See [Appendix G](#appendix-g-geospatial-notes). Parameterized by CRS C and edge-interpolation algorithm A. If not specified, C is `OGC:CRS84` and A is `spherical`. |
225
225
226
226
Notes:
227
227
@@ -391,7 +391,7 @@ The set of metadata columns is:
391
391
|**`2147483544 row`**|`struct<...>`| Deleted row values, used in position-based delete files |
392
392
|**`2147483543 _change_type`**|`string`| The record type in the changelog (INSERT, DELETE, UPDATE_BEFORE, or UPDATE_AFTER) |
393
393
|**`2147483542 _change_ordinal`**|`int`| The order of the change |
394
-
|**`2147483541 _commit_snapshot_id`**|`long`| The snapshot ID in which the change occured|
394
+
|**`2147483541 _commit_snapshot_id`**|`long`| The snapshot ID in which the change occurred|
395
395
|**`2147483540 _row_id`**|`long`| A unique long assigned for row lineage, see [Row Lineage](#row-lineage)|
396
396
|**`2147483539 _last_updated_sequence_number`**|`long`| The sequence number which last updated this row, see [Row Lineage](#row-lineage)|
397
397
@@ -832,7 +832,7 @@ The `first_row_id` for existing manifests must be preserved when writing a new m
832
832
833
833
The first manifest without a `first_row_id` is assigned a value that is greater than or equal to the `first_row_id` of the snapshot. Subsequent manifests without a `first_row_id` are assigned one based on the previous manifest to be assigned a `first_row_id`. Each assigned `first_row_id` must increase by the row count of all files that will be assigned a `first_row_id` via inheritance in the last assigned manifest. That is, each `first_row_id` must be greater than or equal to the last assigned `first_row_id` plus the total record count of data files with a null `first_row_id` in the last assigned manifest.
834
834
835
-
A simple and valid approach is to estimate the number of rows in data files that will be assigned a `first_row_id` using the the manifest's `added_rows_count` and `existing_rows_count`: `first_row_id = last_assigned.first_row_id + last_assigned.added_rows_count + last_assigned.existing_rows_count`.
835
+
A simple and valid approach is to estimate the number of rows in data files that will be assigned a `first_row_id` using the manifest's `added_rows_count` and `existing_rows_count`: `first_row_id = last_assigned.first_row_id + last_assigned.added_rows_count + last_assigned.existing_rows_count`.
836
836
837
837
### Scan Planning
838
838
@@ -880,7 +880,7 @@ Notes:
880
880
881
881
1. An alternative, *strict projection*, creates a partition predicate that will match a file if all of the rows in the file must match the scan predicate. These projections are used to calculate the residual predicates for each file in a scan.
882
882
2. For example, if `file_a` has rows with `id` between 1 and 10 and a delete file contains rows with `id` between 1 and 4, a scan for `id = 9` may ignore the delete file because none of the deletes can match a row that will be selected.
883
-
3. Floating point partition values are considered equal if their IEEE 754 floating-point "single format" bit layout are equal with NaNs normalized to have only the the most significant mantissa bit set (the equivalent of calling `Float.floatToIntBits` or `Double.doubleToLongBits` in Java). The Avro specification requires all floating point values to be encoded in this format.
883
+
3. Floating point partition values are considered equal if their IEEE 754 floating-point "single format" bit layout are equal with NaNs normalized to have only the most significant mantissa bit set (the equivalent of calling `Float.floatToIntBits` or `Double.doubleToLongBits` in Java). The Avro specification requires all floating point values to be encoded in this format.
884
884
4. Unknown partition transforms do not affect partition equality. Although partition fields with unknown transforms are ignored for filtering, the result of an unknown transform is still used when testing whether partition values are equal.
0 commit comments