[Final] More examples to highlight new 0.5 features. #241

imarios · 2018-01-31T04:55:32Z

@OlivierBlanvillain, this is ready for review, but I might be adding few more examples here (hence the WIP).

codecov-io · 2018-01-31T05:39:20Z

Codecov Report

❗ No coverage uploaded for pull request base (master@dfd224a). Click here to learn what that means.
The diff coverage is n/a.

@@            Coverage Diff            @@
##             master     #241   +/-   ##
=========================================
  Coverage          ?   96.56%           
=========================================
  Files             ?       51           
  Lines             ?      874           
  Branches          ?       11           
=========================================
  Hits              ?      844           
  Misses            ?       30           
  Partials          ?        0

Impacted Files	Coverage Δ
...ataset/src/main/scala/frameless/TypedEncoder.scala	`100% <ø> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update dfd224a...b8e343d. Read the comment docs.

imarios · 2018-02-07T16:57:01Z

Hi @OlivierBlanvillain this is ready for review. Thanks!

frosforever · 2018-02-10T15:30:12Z

docs/src/main/tut/FeatureOverview.md

+Only column types that can be sorted are allowed to be selected for sorting. 
+
+```tut:book
+aptTypedDs.orderBy(aptTypedDs('city).asc).show(2).run()


default ordering as asc is also supported with #236. so this should be the same as

aptTypedDs.orderBy(aptTypedDs('city)).show(2).run()

I had it like that but implicit with poly didn’t work in tut. Not sure why ... I will give it another shot

Hmm you're right. It seems that explicitly calling the apply with the column type works but otherwise it fails to infer the sort. E.g.:

aptTypedDs.orderBy(aptTypedDs[String]('city)).show(2).run()

That's fairly annoying.
What's interesting is that orderByMany works without that.

aptTypedDs.orderByMany(aptTypedDs('city)).show(2).run()

It's not immediately obvious to me why the implicit isn't getting picked up. I'll try to look into it shortly.

OlivierBlanvillain

Thanks @imarios, this is very nicely done!

OlivierBlanvillain · 2018-02-04T08:40:54Z

docs/src/main/tut/FeatureOverview.md

+
+The union of `aptTypedDs2` with `aptTypedDs` uses all the fields of the caller (`aptTypedDs2`)
+and expects the other (`aptTypedDs`) dataset to include all those fields. 
+If field names/types do not much you get a compilation error. 


do not match

OlivierBlanvillain · 2018-02-04T08:41:19Z

docs/src/main/tut/FeatureOverview.md

+```
+
+The union of `aptTypedDs2` with `aptTypedDs` uses all the fields of the caller (`aptTypedDs2`)
+and expects the other (`aptTypedDs`) dataset to include all those fields. 


the other dataset (aptTypedDs) to ...

OlivierBlanvillain · 2018-02-04T08:44:24Z

docs/src/main/tut/FeatureOverview.md

+
+Frameless supports many of Spark's functions and transformations. 
+However, whenever a Spark function does not exist in Frameless, 
+calling the `.dataset` will take you back to vanilla `Dataset` where


will expose the underlying Dataset (from org.apache.spark.sql, the original Spark APIs), where you can use anything that would be missing from the Frameless API.

OlivierBlanvillain · 2018-02-11T11:07:02Z

docs/src/main/tut/FeatureOverview.md

+```
+
+A simple way to add a column without loosing important schema information is
+to project the entire source schema into a single column. 


We should maybe note that his is something new that is not part of the vanilla APIs

OlivierBlanvillain · 2018-02-11T11:09:26Z

docs/src/main/tut/TypedDatasetVsSparkDataset.md


+## Aggregate vs Projected columns 
+
+Vanilla `Datasets` do not distinguish between columns created from aggregate operations, 


I would all them "Spark's Datasets" in the docs

imarios · 2018-02-12T03:32:22Z

Thanks for all the help guys! @OlivierBlanvillain @frosforever
Olivier, can you take one last look at the edits you suggested? Feel free to squash and merge if you feel everything is good. Thanks!

... don't forget the squash, I have a ton of useless commits here :)

OlivierBlanvillain · 2018-02-12T13:10:37Z

LGTM 👍

imarios added this to the 0.5-release milestone Jan 31, 2018

imarios force-pushed the update_docs_for_0.5 branch 2 times, most recently from 9daa4d1 to 7668fbd Compare February 7, 2018 16:56

imarios changed the title ~~[WIP] More examples to highlight new 0.5 features.~~ [Final] More examples to highlight new 0.5 features. Feb 7, 2018

imarios requested a review from OlivierBlanvillain February 7, 2018 16:56

imarios force-pushed the update_docs_for_0.5 branch from 7668fbd to d037624 Compare February 9, 2018 04:18

frosforever reviewed Feb 10, 2018

View reviewed changes

OlivierBlanvillain approved these changes Feb 11, 2018

View reviewed changes

imarios added 7 commits February 11, 2018 19:15

More examples to highlight new 0.5 features.

21d0878

Minor revision. Add a when/otherwise example.

fa95cc6

Revising text and adding more examples.

e469ea1

Adding docs for asCol.

0bd8527

More examples comparing Frameless with Vanilla Spark.

0c478df

Sorting columns example

8e078a0

Addressing review comments.

b8e343d

imarios force-pushed the update_docs_for_0.5 branch from a524664 to b8e343d Compare February 12, 2018 03:30

imarios merged commit 968492a into typelevel:master Feb 13, 2018


		## Aggregate vs Projected columns

		Vanilla `Datasets` do not distinguish between columns created from aggregate operations,

Uh oh!

[Final] More examples to highlight new 0.5 features. #241

[Final] More examples to highlight new 0.5 features. #241

Uh oh!

Conversation

imarios commented Jan 31, 2018

Uh oh!

codecov-io commented Jan 31, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

imarios commented Feb 7, 2018

Uh oh!

frosforever Feb 10, 2018

Choose a reason for hiding this comment

Uh oh!

imarios Feb 10, 2018

Choose a reason for hiding this comment

Uh oh!

frosforever Feb 12, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

OlivierBlanvillain left a comment

Choose a reason for hiding this comment

Uh oh!

OlivierBlanvillain Feb 4, 2018

Choose a reason for hiding this comment

Uh oh!

OlivierBlanvillain Feb 4, 2018

Choose a reason for hiding this comment

Uh oh!

OlivierBlanvillain Feb 4, 2018

Choose a reason for hiding this comment

Uh oh!

OlivierBlanvillain Feb 11, 2018

Choose a reason for hiding this comment

Uh oh!

OlivierBlanvillain Feb 11, 2018

Choose a reason for hiding this comment

Uh oh!

imarios commented Feb 12, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OlivierBlanvillain commented Feb 12, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-io commented Jan 31, 2018 •

edited

Loading

frosforever Feb 12, 2018 •

edited

Loading

imarios commented Feb 12, 2018 •

edited

Loading