All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Problem with pip and cryptography and updated packages' versions
- Enable multi-line option for append load
- Make init condensation optional, but true by default.
- Modify append load to support more complex partitioning strategies without file_regex
- Added support for configuring write load mode and num output files in append load
- Support for specifying the quote and escape characters. More info on how to specify those here: https://spark.apache.org/docs/latest/api/java/org/apache/spark/sql/DataFrameReader.html
- Parameter source_dir_suffix was added to enable us to read simultaneously from several nested "folders" inside data/. Ex: data/20200101/filename.parquet
- Support for multiple partition attributes (non date-derived) and single non date-derived partition attributes.