This repository has been archived by the owner on May 18, 2022. It is now read-only.

Demo trame #21

Open

arnaudframmery opened this issue Mar 16, 2022 · 0 comments

Contributor

arnaudframmery commented Mar 16, 2022

On premise (small dataset) :

load a csv as a dataframe (CsvReader.read)
Show columns and types (Discovery.discoverDataframe)
Show a dataframe slice (ListQuery.list)
Select some rows with a condition (AggregateQuery.aggregate)
aggregate data (AggregateQuery.aggregate)

On cluster (big dataset) :

Show columns and types to check the connection (Discovery.discoverDataframe)
Perform a vector computation ()
Increase the worker number on the cluster to accelerate the computation (DatabricksManager.resize)
Perform a vector computation again ()
Reset the worker number (DatabricksManager.resize)

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.