Skip to content
This repository has been archived by the owner on May 18, 2022. It is now read-only.

Demo trame #21

Open
arnaudframmery opened this issue Mar 16, 2022 · 0 comments
Open

Demo trame #21

arnaudframmery opened this issue Mar 16, 2022 · 0 comments

Comments

@arnaudframmery
Copy link
Contributor

On premise (small dataset) :

  • load a csv as a dataframe (CsvReader.read)
  • Show columns and types (Discovery.discoverDataframe)
  • Show a dataframe slice (ListQuery.list)
  • Select some rows with a condition (AggregateQuery.aggregate)
  • aggregate data (AggregateQuery.aggregate)

On cluster (big dataset) :

  • Show columns and types to check the connection (Discovery.discoverDataframe)
  • Perform a vector computation ()
  • Increase the worker number on the cluster to accelerate the computation (DatabricksManager.resize)
  • Perform a vector computation again ()
  • Reset the worker number (DatabricksManager.resize)
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant