[WIP] Lazy attribute names #4154

infinisil · 2020-10-16T20:22:24Z

This is an initial prototype of lazy attribute names, allowing things like:

(throw "" // { x = 0; }).x
-> 0

See #4090 for more info.

This is a work-in-progress. I opened this PR to keep track of what needs to be done still and how far I've gotten, and maybe get some help with it too. In particular, the main problem right now is that this currently makes Nix about 40% slower.

Ping @edolstra

Todo

This code should also be generic enough that it should allow lazy lists and strings in the future (e.g. being able to do builtins.elemAt ([ 0 ] ++ throw "") 0). This could be done in a future PR once this is merged.

And for future reference, previous (and later) worse attempts of implementing this are in my branches lazy-attr-names and lazy-attr-names-v2 lazy-attr-names-v4

This is a version of Expr::eval that doesn't necessarily have to evaluate the value into a non-thunk, while returning a specific attribute The evalAttr of expressions that evaluate to a subexpression should call evalAttr on that subexpression, therefore bubbling up any potential thunks

infinisil · 2020-10-20T19:40:24Z

Cleaned up the commits into smallish changes, undoing some smarts along the way, which unexpectedly fixed a bug I was encountering with this, nice!

I also did some proper performance measurement, and the results are not as bad as I thought (presumably the smarts I undid were also causing the worse performance). I wrote a little measurement and plotting script to do this:

measure:

#!/usr/bin/env bash
set -euo pipefail

nixCommand() {
  #nix-instantiate '<nixpkgs/nixos>' \
  #  --arg configuration '<nixpkgs/nixos/modules/profiles/demo.nix>' \
  #  -A vm
  nix-instantiate ~/src/nixpkgs -A firefox
}

run() {
  local stats=$(mktemp)

  PATH=$PWD/bin:$PATH NIX_SHOW_STATS=1 NIX_SHOW_STATS_PATH="$stats" nixCommand >/dev/null 2>/dev/null

  jq -r '.cpuTime' "$stats"
  rm "$stats"
}

measure() {
  local name=$1
  local duration=$2
  local dest="times/$name"

  echo "Making sure binary is up-to-date"
  nix-shell --run "make install -j8"

  echo "Clearing any previous results in $dest"
  mkdir -p "$(dirname "$dest")"
  > "$dest"

  echo "Warming up with a single run"
  run >/dev/null

  epochStart=$(date +%s)
  epochEnd=$(( epochStart + duration ))

  echo "Measuring for at least $duration seconds"
  while now=$(date +%s) && [[ "$now" -le "$epochEnd" ]]; do
    result=$(run)
    echo "Measured $result seconds, writing to file. $(( epochEnd - now )) seconds left"
    echo "$result" >> "$dest"
  done
}

collectdata() {
  for f in times/*; do
    jq --arg name "$(basename "$f")" '[ ., inputs ] | map({ "CPU Time" : ., "Version" : $name })' -R "$f"
  done | jq -s '. | map(.[])'
}

plot() {
  collectdata > "data.json"
  # Last nixpkgs version where vega_lite wasn't broken
  vegaLite=$(nix-build --no-out-link https://github.com/NixOS/nixpkgs/archive/e1773ee0bb99e6785e2e06c0931cc8ffa9341d2a.tar.gz -A nodePackages.vega-lite)
  "$vegaLite/bin/vl2svg" plot.json plot.svg
  xdg-open plot.svg
}

case "$1" in
  plot)
    plot
    ;;
  *)
    measure "$1" "$2"
esac

plot.json:

{
  "$schema": "https://vega.github.io/schema/vega-lite/v4.json",
  "description": "Nix performance on `nix-instantiate '<nixpkgs/nixos>' --arg configuration '<nixpkgs/nixos/modules/profiles/demo.nix>' -A vm`",
  "data": {"url": "data.json"},
  "mark": {
    "type": "boxplot",
    "extent": "min-max"
  },
  "encoding": {
    "x": {"field": "Version", "type": "nominal"},
    "color": {"field": "Version", "type": "nominal", "legend": null},
    "y": {
      "field": "CPU Time",
      "type": "quantitative",
      "scale": {"zero": false}
    }
  }
}

To use:

Check out the code version you'd like to test
Run ./measure some-id 300, which runs the nixCommand in the script for 300 seconds, storing the results in times/some-id
Repeat with all the other commits you want to compare against, changing the some-id for each of them
Run ./measure plot, which uses Vega Lite to render all the measurements as a box plot

With 0 representing the base commit of this PR (master), 1 the first commit of this PR, 2 the second commit, etc., the plot looks as follows (with ~275 samples):

Note how this is only a tiny bit slower, from ~0.62 to ~0.64! And I do believe some things can be optimized still, so I hope to be able to improve performance with this PR in the end :)

This introduces an expression base class that can be used for lazy binary operations, along with a value type for storing partial results of such expressions

This makes the // operator lazy in attribute names, only evaluates what is necessary to get a specific attribute.

So as to not increase the sizeof(Value) from 24 bytes to 40 bytes

I think this is needed so that any variables on the sides get updated properly Without this, bin/nix-instantiate ~/src/nixpkgs/nixos --arg configuration ~/src/nixpkgs/nixos/modules/profiles/demo.nix -A vm fails with a segfault Not sure why this doesn't happen with the commit that makes // lazy though

The previous implementation relied on uninitialized memory not being a certain value for it to work

Since it throws inf rec when it doesn't have to sometimes

infinisil · 2020-10-27T23:45:04Z

Performance testing with the latest changes reveals that there's pretty much no measurable decrease in performance, yay! I heard you like plots? I give you plot (0 is the base of this PR, each number is an additional commit)

I'll consider the performance problem solved, though some fine tuning may still be done.

Infinite recursion detection

The main problem now is that infinite recursion detection is going to be much trickier, and I haven't figured that out. Previously if you had an unevaluated thunk (a Value with type = tThunk) which you want to evaluate, you'd set type = tBlackhole, then evaluate the expression the thunk points to, and throw an inf rec error if the evaluation tries to evaluate a tBlackhole (therefore trying to evaluate something while you're evaluating it already). Once it's evaluated, you set type = tAttrs or whatever the result is. This worked previously because a Value could either be unevaluated (tThunk, tApp, ...) or it could be evaluated (tAttrs, tInt, ...).

But with lazy attribute names, there's the new tLazyBinOp Value type, which is a value that can be partially evaluated, and that multiple times. E.g. if you have a = { x = 0; } // { y = 1; }, and you evaluate first a.y then a itself, you'll transform the Value multiple times:

Initially: type = tThunk
After a.y: type = tLazyBinOp; left.type = tThunk; right.type = tAttrs; by calling evalLazyBinOpAttrs
After a: type = tAttrs by calling evalLazyBinOp

With this, you can't just set type = tBlackhole the first time you evaluate a, because you may encountered a many more times after that, without having to encounter infinite recursion.

Here's some tricky examples of when inf rec should be triggered and when it shouldn't (currently the ones that should just give a stack overflow without position information):

{
  # Should throw inf rec
  a = let x = {} // x; in x.y;

  # Should not throw inf rec
  b = let x = x // { y = 0; }; in x.y;

  # Should throw inf rec
  c = let x = x // { y = 0; }; in x.z;

  # Should throw inf rec
  d = let x = { y = x.z; } // { z = x.y; }; in x.y;

  # Should not throw inf rec
  e = let x = ({ y = x.z; } // { z = x.y; }) // { y = 0; }; in x.z;

  # Should not throw inf rec, even with --strict
  f = let x = x.y // { y = {}; }; in x;
}

If it's possible to implement this well, it would probably involve tracking which sides of a tLazyBinOp are currently being evaluated. Note that a tLazyBinOp can have an arbitrary Value on either side, including another tLazyBinOp.

nixos-discourse · 2020-11-05T16:51:13Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/tweag-nix-dev-update-4/9862/1

This time it also works with lazy binops The previous optimizations for prevention of allocation had to be undone Code still needs cleanup, but it should be sound

With the previous commit, passing left/right is now unnecessary

Every evaluation can now pass a handler, which is called once the resulting value is either a tAttrs or a tLazyAttrs There are two handlers: - One for weak head normal form (changes tLazyAttrs into tAttrs) - One for getting an attribute (lazily gets attributes from tLazyAttrs, strictly from tAttrs)

infinisil · 2020-12-04T23:23:22Z

I made some good progress this week! I had to pretty much redesign this feature again, in order to deduplicate some function definitions i previously duplicated. I also fixed the infinite recursion detection, so that works again now. I think I have reached the final design of this, it's looking very promising now.

Unfortunately I'm now pretty sure that this does cause an evaluation slow-down of about 5% in the end. So I think it's best to make this an opt-in feature instead. So the next thing I'll work on for this PR is introducing a new primop, builtins.lazyUpdate (or builtins.lazyAttrsUpdate), which implements this lazy attribute name behavior. This is good enough for most purposes anyways.

While it would be possible to create a nix.conf option to opt-into the lazy attribute name behavior for //, this is probably a bad idea, since enabling it leads to expressions that require that option to be turned on to evaluate without error.

We could also consider making this the default again in the future in case the ~5% overhead can be removed.

By using bitfields to encode the left/right blackhole for infinite recursion detection

…me allocs" This reverts commit 4a2c47a.

nixos-discourse · 2021-07-08T05:39:48Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/tweag-nix-dev-update-15/13975/1

nixos-discourse · 2021-12-28T16:26:06Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/specifics-of-set-laziness/16837/2

nixos-discourse · 2021-12-28T22:43:39Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/specifics-of-set-laziness/16837/4

nixos-discourse · 2022-03-29T12:41:05Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/nixlang-how-do-you-find-all-uses-of-a-declaration/18369/13

axelkar · 2023-02-22T20:43:11Z

Does this fix the following too?

nix-repl> (let name = abort "b"; in { ${name} = 1; } // { a = 2; }).a
error: evaluation aborted with the following error message: 'b'

infinisil · 2023-07-14T18:45:09Z

@axelkar Yes that would work in the current state of this PR (however, see below)

Regarding the state of this PR, it should be changed to only expose this functionality under a builtins.lazyUpdate primitive. This will simplify this PR considerably. If I get to it I intend to pick this back up, because I'm constantly running into use cases where this would be beneficial.

ggPeti · 2023-10-26T09:22:42Z

@infinisil currently inherit is failing eagerly. does your change set fix that too?

but even

infinisil · 2023-10-26T10:03:43Z

@ggPeti It does not

nixos-discourse · 2024-11-10T21:00:41Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/is-it-possible-for-us-to-remove-builtins-functionargs/51960/4

infinisil mentioned this pull request Oct 16, 2020

Lazy attribute names #4090

Open

infinisil added 2 commits October 19, 2020 20:57

Move attribute selection code to Values

5ddfd1a

infinisil force-pushed the lazy-attr-names-v3 branch from 0a7ccf2 to 6d7d037 Compare October 19, 2020 19:06

infinisil force-pushed the lazy-attr-names-v3 branch from 6d7d037 to fd71421 Compare October 21, 2020 14:19

infinisil added 4 commits October 21, 2020 16:35

Unroll first iteration of ExprSelect loop to avoid thunk allocation

48511ea

Introduce ExprLazyBinOp and tLazyBinOp

35d5be4

This introduces an expression base class that can be used for lazy binary operations, along with a value type for storing partial results of such expressions

Make ExprOpUpdate be an ExprLazyBinOp

5b1ef7b

This makes the // operator lazy in attribute names, only evaluates what is necessary to get a specific attribute.

Allocate LazyBinOp values separately from Value

152048d

So as to not increase the sizeof(Value) from 24 bytes to 40 bytes

infinisil force-pushed the lazy-attr-names-v3 branch from fd71421 to 152048d Compare October 21, 2020 14:36

infinisil added 4 commits October 21, 2020 20:03

Pass left/right lazyBinOp to update functions directly

7b947a7

The previous implementation relied on uninitialized memory not being a certain value for it to work

Implement noAllocationValue for Expr's and use it to avoid some allocs

4a2c47a

Disable all inf rec checking for now

e8a7a5b

Since it throws inf rec when it doesn't have to sometimes

Implement infinite recursion detection again

f99249d

This time it also works with lazy binops The previous optimizations for prevention of allocation had to be undone Code still needs cleanup, but it should be sound

infinisil force-pushed the lazy-attr-names-v3 branch from cfd86dc to f99249d Compare December 1, 2020 21:39

infinisil added 2 commits December 2, 2020 14:07

Simplify lazyBinOp handling a bit

0edad24

With the previous commit, passing left/right is now unnecessary

Use bitfields for LazyBinOp blackholes

d1d9f1e

infinisil force-pushed the lazy-attr-names-v3 branch from 5d50971 to d1d9f1e Compare December 4, 2020 13:04

infinisil added 6 commits December 4, 2020 16:45

Split fromValue into handleAttrs and handleLazyBinOp

1642440

Better EvalHandler design

272c728

Simplify eval handler calling and rename to EvalStrategy

b86bd8a

Reimplement ExprSelect loop unroll

3584b83

Small lazyBinOp optimizations

ca1201f

Fix __curPos test

42a8ff1

infinisil added 9 commits December 5, 2020 02:40

More position info

e291fc6

Rename tLazyBinOp to tLazyUpdate

b97295e

Remove Env and ExprLazy from LazyBinOp

a5add93

Embed LazyBinOp values into Value

acecc9d

By using bitfields to encode the left/right blackhole for infinite recursion detection

bitfield union magic

e5a3b04

Change strategy function returns to void, blackhole optimizations

5af84e6

builtins.lazyAttrUpdate and make // strict by default

ba195b7

Revert "Implement noAllocationValue for Expr's and use it to avoid so…

cff6f8f

…me allocs" This reverts commit 4a2c47a.

Remove Expr::getPos again

c3bbb4d

infinisil force-pushed the lazy-attr-names-v3 branch from ddfe9a1 to c3bbb4d Compare December 5, 2020 22:47

fricklerhandwerk added the language The Nix expression language; parser, interpreter, primops, evaluation, etc label Sep 9, 2022

tomberek mentioned this pull request Oct 5, 2022

Search any attribute recursively if recurseForDerivation is set #6936

Closed

roberth mentioned this pull request Apr 9, 2023

Proxy like abstraction for an attrset #8187

Open

jtojnar mentioned this pull request Apr 12, 2023

rustPlatform.buildRustPackage: support finalAttrs style NixOS/nixpkgs#194475

Open

13 tasks

This was referenced May 7, 2023

Got infinite recursion encountered when accessing self.outPath in a flake #8300

Closed

Cannot access inputs.self.outPath in top level imports hercules-ci/flake-parts#148

Open

Atry mentioned this pull request May 9, 2023

Add moduleLocation to mkFlake argument hercules-ci/flake-parts#158

Merged

infinisil mentioned this pull request Jan 31, 2024

Implicit attribute defaults/overrides inside package definition NixOS/nixpkgs#273534

Open

infinisil mentioned this pull request Sep 22, 2024

Enforce that package argument defaults are applied, cleans up optional dependency convention NixOS/nixpkgs#131271

Draft

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Lazy attribute names #4154

[WIP] Lazy attribute names #4154

infinisil commented Oct 16, 2020 •

edited

Loading

infinisil commented Oct 20, 2020

infinisil commented Oct 27, 2020 •

edited

Loading

nixos-discourse commented Nov 5, 2020

infinisil commented Dec 4, 2020 •

edited

Loading

nixos-discourse commented Jul 8, 2021

nixos-discourse commented Dec 28, 2021

nixos-discourse commented Dec 28, 2021

nixos-discourse commented Mar 29, 2022

axelkar commented Feb 22, 2023

infinisil commented Jul 14, 2023

ggPeti commented Oct 26, 2023

infinisil commented Oct 26, 2023

nixos-discourse commented Nov 10, 2024

[WIP] Lazy attribute names #4154

Are you sure you want to change the base?

[WIP] Lazy attribute names #4154

Conversation

infinisil commented Oct 16, 2020 • edited Loading

Todo

infinisil commented Oct 20, 2020

infinisil commented Oct 27, 2020 • edited Loading

Infinite recursion detection

nixos-discourse commented Nov 5, 2020

infinisil commented Dec 4, 2020 • edited Loading

nixos-discourse commented Jul 8, 2021

nixos-discourse commented Dec 28, 2021

nixos-discourse commented Dec 28, 2021

nixos-discourse commented Mar 29, 2022

axelkar commented Feb 22, 2023

infinisil commented Jul 14, 2023

ggPeti commented Oct 26, 2023

infinisil commented Oct 26, 2023

nixos-discourse commented Nov 10, 2024

infinisil commented Oct 16, 2020 •

edited

Loading

infinisil commented Oct 27, 2020 •

edited

Loading

infinisil commented Dec 4, 2020 •

edited

Loading