You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Clarify documentation about gathering statistics for parquet files (#16157)
* Improve CREATE EXTERNAL TABLE documatation, add note about statistics
* Add comments to SessionContext::read_parquet and register_parquet
* fmt
* Apply suggestions from code review
Co-authored-by: Oleks V <comphead@users.noreply.github.com>
---------
Co-authored-by: Oleks V <comphead@users.noreply.github.com>
Note that this statement actually reads data from a fixed-size file, so a better example would involve reading from a FIFO file. Nevertheless, once Datafusion sees the `UNBOUNDED` keyword in a data source, it tries to execute queries that refer to this unbounded source in streaming fashion. If this is not possible according to query specifications, plan generation fails stating it is not possible to execute given query in streaming fashion. Note that queries that can run with unbounded sources (i.e. in streaming mode) are a subset of those that can with bounded sources. A query that fails with unbounded source(s) may work with bounded source(s).
156
183
184
+
### Example: `WITH ORDER` Clause
185
+
157
186
When creating an output from a data source that is already ordered by
158
187
an expression, you can pre-specify the order of the data using the
159
188
`WITH ORDER` clause. This applies even if the expression used for
@@ -190,7 +219,7 @@ WITH ORDER (sort_expression1 [ASC | DESC] [NULLS { FIRST | LAST }]
190
219
[, sort_expression2 [ASC | DESC] [NULLS { FIRST | LAST }] ...])
191
220
```
192
221
193
-
### Cautions when using the WITH ORDER Clause
222
+
####Cautions when using the WITH ORDER Clause
194
223
195
224
- It's important to understand that using the `WITH ORDER` clause in the `CREATE EXTERNAL TABLE` statement only specifies the order in which the data should be read from the external file. If the data in the file is not already sorted according to the specified order, then the results may not be correct.
0 commit comments