Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Arrow2 test fix #1733

Merged
merged 31 commits into from
Feb 8, 2022
Merged

Arrow2 test fix #1733

merged 31 commits into from
Feb 8, 2022

Commits on Jan 27, 2022

  1. Configuration menu
    Copy the full SHA
    7b8d72c View commit details
    Browse the repository at this point in the history

Commits on Jan 28, 2022

  1. Configuration menu
    Copy the full SHA
    18ced8d View commit details
    Browse the repository at this point in the history
  2. Move information_schema tests out of execution/context.rs to `sql_i…

    …ntegration` tests (apache#1684)
    
    * Move tests from context.rs to information_schema.rs
    
    * Fix up tests to compile
    alamb authored Jan 28, 2022
    Configuration menu
    Copy the full SHA
    ed1de63 View commit details
    Browse the repository at this point in the history
  3. Move timestamp related tests out of context.rs and into sql integrati…

    …on test (apache#1696)
    
    * Move some tests out of context.rs and into sql
    
    * Move support test out of context.rs and into sql tests
    
    * Fixup tests and make them compile
    alamb authored Jan 28, 2022
    Configuration menu
    Copy the full SHA
    ab145c8 View commit details
    Browse the repository at this point in the history

Commits on Jan 29, 2022

  1. Fix parquet projection

    Igosuki committed Jan 29, 2022
    Configuration menu
    Copy the full SHA
    39632dd View commit details
    Browse the repository at this point in the history
  2. fix pruning casting

    Igosuki committed Jan 29, 2022
    Configuration menu
    Copy the full SHA
    a34213e View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    530f4f4 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b95044e View commit details
    Browse the repository at this point in the history
  5. Add MemTrackingMetrics to ease memory tracking for non-limited memo…

    …ry consumers (apache#1691)
    
    * Memory manager no longer track consumers, update aggregatedMetricsSet
    
    * Easy memory tracking with metrics
    
    * use tracking metrics in SPMS
    
    * tests
    
    * fix
    
    * doc
    
    * Update datafusion/src/physical_plan/sorts/sort.rs
    
    Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>
    
    * make tracker AtomicUsize
    
    Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>
    yjshen and alamb authored Jan 29, 2022
    Configuration menu
    Copy the full SHA
    641338f View commit details
    Browse the repository at this point in the history

Commits on Jan 30, 2022

  1. Implement TableProvider for DataFrameImpl (apache#1699)

    * Add TableProvider impl for DataFrameImpl
    
    * Add physical plan in
    
    * Clean up plan construction and names construction
    
    * Remove duplicate comments
    
    * Remove unused parameter
    
    * Add test
    
    * Remove duplicate limit comment
    
    * Use cloned instead of individual clone
    
    * Reduce the amount of code to get a schema
    
    Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>
    
    * Add comments to test
    
    * Fix plan comparison
    
    * Compare only the results of execution
    
    * Remove println
    
    * Refer to df_impl instead of table in test
    
    Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>
    
    * Fix the register_table test to use the correct result set for comparison
    
    * Consolidate group/agg exprs
    
    * Format
    
    * Remove outdated comment
    
    Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>
    cpcloud and alamb authored Jan 30, 2022
    Configuration menu
    Copy the full SHA
    0d6d1ce View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    75c7578 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    a7f0156 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    fecce97 View commit details
    Browse the repository at this point in the history
  5. Incorporate dyn scalar kernels (apache#1685)

    * Rebase
    
    * impl ToNumeric for ScalarValue
    
    * Update macro to be based on
    
    * Add floats
    
    * Cleanup
    
    * Newline
    matthewmturner authored Jan 30, 2022
    Configuration menu
    Copy the full SHA
    3494e9c View commit details
    Browse the repository at this point in the history

Commits on Jan 31, 2022

  1. Configuration menu
    Copy the full SHA
    2512608 View commit details
    Browse the repository at this point in the history
  2. Support create_physical_expr and ExecutionContextState or `Defaul…

    …tPhysicalPlanner` for faster speed (apache#1700)
    
    * Change physical_expr creation API
    
    * Refactor API usage to avoid creating ExecutionContextState
    
    * Fixup ballista
    
    * clippy!
    alamb authored Jan 31, 2022
    Configuration menu
    Copy the full SHA
    1caf52a View commit details
    Browse the repository at this point in the history
  3. Fix can not load parquet table form spark in datafusion-cli. (apache#…

    …1665)
    
    * fix can not load parquet table form spark
    
    * add Invalid file in log.
    
    * fix fmt
    Ted-Jiang authored Jan 31, 2022
    Configuration menu
    Copy the full SHA
    f849968 View commit details
    Browse the repository at this point in the history
  4. add upper bound for pub fn (apache#1713)

    Signed-off-by: remzi <13716567376yh@gmail.com>
    HaoYang670 authored Jan 31, 2022
    Configuration menu
    Copy the full SHA
    d01d8d5 View commit details
    Browse the repository at this point in the history
  5. Create SchemaAdapter trait to map table schema to file schemas (apach…

    …e#1709)
    
    * Create SchemaAdapter trait to map table schema to file schemas
    
    * Linting fix
    
    * Remove commented code
    thinkharderdev authored Jan 31, 2022
    Configuration menu
    Copy the full SHA
    7bec762 View commit details
    Browse the repository at this point in the history
  6. approx_quantile() aggregation function (apache#1539)

    * feat: implement TDigest for approx quantile
    
    Adds a [TDigest] implementation providing approximate quantile
    estimations of large inputs using a small amount of (bounded) memory.
    
    A TDigest is most accurate near either "end" of the quantile range (that
    is, 0.1, 0.9, 0.95, etc) due to the use of a scalaing function that
    increases resolution at the tails. The paper claims single digit part
    per million errors for q ≤ 0.001 or q ≥ 0.999 using 100 centroids, and
    in practice I have found accuracy to be more than acceptable for an
    apprixmate function across the entire quantile range.
    
    The implementation is a modified copy of
    https://github.com/MnO2/t-digest, itself a Rust port of [Facebook's C++
    implementation]. Both Facebook's implementation, and Mn02's Rust port
    are Apache 2.0 licensed.
    
    [TDigest]: https://arxiv.org/abs/1902.04023
    [Facebook's C++ implementation]: https://github.com/facebook/folly/blob/main/folly/stats/TDigest.h
    
    * feat: approx_quantile aggregation
    
    Adds the ApproxQuantile physical expression, plumbing & test cases.
    
    The function signature is:
    
    	approx_quantile(column, quantile)
    
    Where column can be any numeric type (that can be cast to a float64) and
    quantile is a float64 literal between 0 and 1.
    
    * feat: approx_quantile dataframe function
    
    Adds the approx_quantile() dataframe function, and exports it in the
    prelude.
    
    * refactor: bastilla approx_quantile support
    
    Adds bastilla wire encoding for approx_quantile.
    
    Adding support for this required modifying the AggregateExprNode proto
    message to support propigating multiple LogicalExprNode aggregate
    arguments - all the existing aggregations take a single argument, so
    this wasn't needed before.
    
    This commit adds "repeated" to the expr field, which I believe is
    backwards compatible as described here:
    
    	https://developers.google.com/protocol-buffers/docs/proto3#updating
    
    Specifically, adding "repeated" to an existing message field:
    
    	"For ... message fields, optional is compatible with repeated"
    
    No existing tests needed fixing, and a new roundtrip test is included
    that covers the change to allow multiple expr.
    
    * refactor: use input type as return type
    
    Casts the calculated quantile value to the same type as the input data.
    
    * fixup! refactor: bastilla approx_quantile support
    
    * refactor: rebase onto main
    
    * refactor: validate quantile value
    
    Ensures the quantile values is between 0 and 1, emitting a plan error if
    not.
    
    * refactor: rename to approx_percentile_cont
    
    * refactor: clippy lints
    domodwyer authored Jan 31, 2022
    Configuration menu
    Copy the full SHA
    cfb655d View commit details
    Browse the repository at this point in the history
  7. suppport bitwise and as an example (apache#1653)

    * suppport bitwise and as an example
    
    * Use $OP in macro rather than `&`
    
    * fix: change signature to &dyn Array
    
    * fmt
    
    Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>
    liukun4515 and alamb authored Jan 31, 2022
    Configuration menu
    Copy the full SHA
    940d4eb View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    b6ace16 View commit details
    Browse the repository at this point in the history

Commits on Feb 1, 2022

  1. Configuration menu
    Copy the full SHA
    bacf10d View commit details
    Browse the repository at this point in the history
  2. Convert boolean case expressions to boolean logic (apache#1719)

    * Convert boolean case expressions to boolean logic
    
    * Review feedback
    tustvold authored Feb 1, 2022
    Configuration menu
    Copy the full SHA
    b9a8f15 View commit details
    Browse the repository at this point in the history

Commits on Feb 2, 2022

  1. substitute parking_lot::Mutex for std::sync::Mutex (apache#1720)

    * Substitute parking_lot::Mutex for std::sync::Mutex
    
    * enable parking_lot feature in tokio
    xudong963 authored Feb 2, 2022
    Configuration menu
    Copy the full SHA
    46879f1 View commit details
    Browse the repository at this point in the history
  2. Add Expression Simplification API (apache#1717)

    * Add Expression Simplification API
    
    * fmt
    alamb authored Feb 2, 2022
    Configuration menu
    Copy the full SHA
    e4a056f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    469731b View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    5ad5f7c View commit details
    Browse the repository at this point in the history

Commits on Feb 3, 2022

  1. Configuration menu
    Copy the full SHA
    b8f9bc2 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    80078b5 View commit details
    Browse the repository at this point in the history
  3. fix parquet file format adapted projection by providing the proper sc…

    …hema to the RecordBatch
    Igosuki committed Feb 3, 2022
    Configuration menu
    Copy the full SHA
    f2debbb View commit details
    Browse the repository at this point in the history