Proposal: Make floats non-NaN by default

### Introduction

The idea behind this proposal comes from observing that `NaN` has some surprising commonalities with null pointers:
 1. It represents an invalid value using a specialized bit sequence
 2. Some functions expect to receive this invalid value, others assume they do not (for optimal performance)

In combination with the arithmetic/comparison behavior of NaNs, these troubles lead to a number of footguns in real-life code.

Examples include [sorting algorithms failing](https://bugs.python.org/issue36095) for NaN data, [every NaN colliding](https://bugs.python.org/issue43475) in a hash map, [NaNs propagating virally](https://www.dariosanfilippo.com/blog/2020/handling_inf_nan_values_in_faust_and_cpp/) in streaming outputs, [invalid image filtering](https://forum.image.sc/t/skimage-filters-median-using-mask-for-floating-point-image-with-nans/57289) when [operating on NaNs](https://stackoverflow.com/questions/52927319/nan-in-scipy-ndimage-filters-maximum-filter), nan values [persisting](https://www.mathworks.com/matlabcentral/answers/179142-how-can-i-remove-nan-values-from-a-matrix) after filtering, [parsers failing](https://stackoverflow.com/questions/15228651/how-to-parse-json-string-containing-nan-in-node-js) on NaN inputs, and formatting/display [unintentionally exposing NaN to the user](https://www.exploringbinary.com/another-nan-in-the-wild/).

Footguns abound when there is disagreement about whether NaN needs to be handled correctly.

### Proposal

#### Option A: Replace `f32` with `error{NaN}!f32`
   - All floating point operations (`+-*/%`) yield `error{NaN}!f32`
   - Arithmetic is overloaded on `error{NaN}!f32`
   - `error{NaN}!f32` can be unwrapped with `try`, `catch`, and `if` like any other error union
   - Comparison of `f32` yields `bool`. Comparison of `error{NaN}!f32` yields `error{NaNOperand}!bool`

Other error unions, such as `error{Foo}!f32` are **not** treated specially (no arithmetic, no special layout, etc.).

"NaN-boxing" is to be [supported via `getNaNPayload` and `setNaNPayload`](https://gist.github.com/topolarity/2b27148b090fa29dfb407efd9fa0955f)

#### Option B: Make comparisons of floats return `error{NaNOperand}!bool`

This is a minimal change to the language that would force users to explicitly account for `NaN` in floating point comparisons, which is the central oversight in above-mentioned bugs. 

### API Impacts

This means that "nan-safe" functions can be given a type that reflects their special handling of NaN. Meanwhile, highly-optimized routines that don't handle NaN correctly can be given a type that reflects their assumptions:
```zig
// Returns median, ignoring any NaN values
pub fn median(in: []const error{NaN}!f32) f32 { ... }
// Assumes that inputs do not include NaN
pub fn convolve(a: []const f32, b: []const f32,  out: []f32) { ... }
```

#### Example

```zig
/// Insertion sort. NaN values are sorted to the end
fn sort_inplace(vec: []error{NaN}!f32) void {
    for (vec) |maybe_key, i| {
        if (maybe_key) |key| { // If maybe_key is NaN, treat it as greater than everything (i.e. don't move it)
            var j = i;
            // `error{NaN}!f32` forces us to explicitly handle the NaN case here
            while ((vec[j - 1] > key) catch true) { // If vec[j - 1] is NaN, treat it as greater
                vec[j] = vec[j - 1];
                j = j - 1;
                if (j == 0) break;
            }
            vec[j] = key;
        }
    }
}
```

Meanwhile, the code for a non-NaN-safe version of this function [would look exactly like it does today](https://github.com/ziglang/zig/issues/11234#issuecomment-1075360192).

### Supplemental Ideas

These related ideas can be accepted/rejected separately from the main proposal:

1. **Size Optimization for `?f32`:** Define `?f32` to be stored in a typical float, by assigning it a NaN payload with a special value. This is similar to R's "NA" value, except that `?f32` would **not support arithmetic or comparison** (except with `null`), meaning that NA/NaN propagation is not an issue. It behaves like any other optional.

2. **`@assertFinite`/`@assertNonNaN` built-ins:** The UB-introducing `@setFloatMode(.Optimize)` assumptions are that inputs/outputs are non-Inf and non-NaN. All other fast-math optimization flags make a different performance/accuracy trade-off, but do not directly introduce `poison`/`undefined` into the program.`@assertFinite` would allow the programmer to make these dangerous assumptions explicit in their code where it's obvious exactly which operands it affects.

(1) can be particularly important for performance when operating on large, structured data, since it affects how many values can fit into a cache line. This is why it's common to see in statistical software, including R and Pandas.

**Edit: Updated 4/11 to use `error{NaN}!f32` instead of `?f32` + add supplemental ideas**

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Proposal: Make floats non-NaN by default #11234

Introduction

Proposal

Option A: Replace `f32` with `error{NaN}!f32`

Option B: Make comparisons of floats return `error{NaNOperand}!bool`

API Impacts

Example

Supplemental Ideas

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Proposal: Make floats non-NaN by default #11234

Description

Introduction

Proposal

Option A: Replace f32 with error{NaN}!f32

Option B: Make comparisons of floats return error{NaNOperand}!bool

API Impacts

Example

Supplemental Ideas

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Option A: Replace `f32` with `error{NaN}!f32`

Option B: Make comparisons of floats return `error{NaNOperand}!bool`