feat(csv): treat omitted values with no defaults as null in csv #691

wolffcm · 2019-01-08T20:10:11Z

Fixes #632.

Done checklist

docs/SPEC.md updated
Test cases written

jsternberg · 2019-01-08T20:16:15Z

execute/executetest/table.go

-				panic(fmt.Errorf("invalid value: %s", t.KeyValues[j]))
+			var v values.Value
+			if t.KeyValues[j] == nil {
+				v = values.NewNull(flux.SemanticType(t.ColMeta[idx].Type))


Is there a way to make it so values.New(nil) works instead of adding a new function?

I thought of that too, but a values.Value has to have a type.

We want null values to have a type, so I don't see a way to do it just by passing nil. If New accepted a pointer to a value, then it would be possible by passing a null pointer... but that seems kind of hacky to me.

nathanielc

LGTM with one note about how null and default should work together.

nathanielc · 2019-01-08T20:14:40Z

csv/result.go

@@ -397,6 +397,8 @@ func readMetadata(r *csv.Reader, c ResultDecoderConfig, extraLine []string) (tab
 				return tableMetadata{}, errors.Wrapf(err, "column %q has invalid default value", label)
 			}
 			defaultValues[j] = v
+		} else {
+			defaultValues[j] = values.NewNull(flux.SemanticType(cols[j].ColMeta.Type))


Can you add a constant for the nullValue and set it to "" for now? We want to make it clear how the null value interacts with the default value. The if statement above it would become switch statement with three cases:

switch defaults[j] { case nullValue: // Set explicit null as default defaultValues[j] = values.NewNull(flux.SemanticType(cols[j].ColMeta.Type)) case "": // Do nothing there is no default default: v, err := decodeValue(defaults[j], cols[j]) // ... }

Technically the second case will not be possible until nullValue is dynamic. But it makes it clear that its still possible to not have a default value.

I can't write the code like you've done it here because Go complains about duplicate cases, but I will add nullValue and comments to this effect.

nathanielc · 2019-01-08T20:24:10Z

csv/result.go

@@ -1104,23 +1104,38 @@ func encodeValue(value values.Value, c colMeta) (string, error) {
 	}
 }

-func encodeValueFrom(i, j int, c colMeta, cr flux.ColReader) (string, error) {
+func encodeValueFrom(i, j int, c colMeta, cr flux.ArrowColReader) (string, error) {
+	var v string


We should use the nullValue constant here too

aanthony1243

looks good, I have 2 comments about how we might enhance this more, to make the code more robust. but that shouldn't stop this PR.

aanthony1243 · 2019-01-08T20:26:56Z

execute/table.go

@@ -1192,6 +1192,10 @@ func (b *ColListTableBuilder) SetValue(i, j int, v values.Value) error {
 }

 func (b *ColListTableBuilder) AppendValue(j int, v values.Value) error {
+	if v.IsNull() {


This is so useful and safety, I want to propose making AppendBool, AppendInt, etc. all private functions, and forcing us to use AppendValue(j, values.New(false)), etc.

I suspect we'll still want to expose the methods that accept raw types for performance reasons, since values.Value uses interface{} and reflection under the hood. Were it not for this I would totally agree with you.

aanthony1243 · 2019-01-08T20:34:06Z

values/values.go

@@ -148,6 +152,13 @@ func New(v interface{}) Value {
 	}
 }

+func NewNull(t semantic.Type) Value {


this is great. I'm getting closer to voting that we can only add values.Value types to table builders, to make sure we don't have any errors of omission re: Nulls.

Fixes #632.

nathanielc

Looks good. When we add support for the null annotation we can revisit some of these edge cases.

wolffcm requested review from jsternberg and nathanielc January 8, 2019 20:10

jsternberg approved these changes Jan 8, 2019

View reviewed changes

nathanielc reviewed Jan 8, 2019

View reviewed changes

aanthony1243 reviewed Jan 8, 2019

View reviewed changes

feat(csv): treat omitted values with no defaults as nil in csv

9caa2bc

Fixes #632.

wolffcm force-pushed the feat/csv-nulls branch from b6db038 to 9caa2bc Compare January 8, 2019 21:36

nathanielc approved these changes Jan 8, 2019

View reviewed changes

wolffcm merged commit 9fb276b into master Jan 8, 2019

wolffcm deleted the feat/csv-nulls branch January 8, 2019 21:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(csv): treat omitted values with no defaults as null in csv #691

feat(csv): treat omitted values with no defaults as null in csv #691

wolffcm commented Jan 8, 2019

jsternberg Jan 8, 2019

aanthony1243 Jan 8, 2019

wolffcm Jan 8, 2019

nathanielc left a comment

nathanielc Jan 8, 2019

wolffcm Jan 8, 2019

nathanielc Jan 8, 2019

aanthony1243 left a comment

aanthony1243 Jan 8, 2019

wolffcm Jan 8, 2019

aanthony1243 Jan 8, 2019

nathanielc left a comment

feat(csv): treat omitted values with no defaults as null in csv #691

feat(csv): treat omitted values with no defaults as null in csv #691

Conversation

wolffcm commented Jan 8, 2019

Done checklist

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nathanielc left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aanthony1243 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nathanielc left a comment

Choose a reason for hiding this comment