Reduce the number of calls to simplify #537

irevoire · 2024-08-15T09:11:03Z

Reduce the number of calls to simplify

Fixes #535

We try to call FullSimplify as few as possible.
Basically right before outputting something to the end user (calling print, returning a value to the user or composing strings).

Performances

On my computer:

Startup time improved by ~16% (from 38ms to 32ms)
Large computation range(0, 1_000_000) |> map(sqrt) |> sum improved by ~12% (from 1.5s to 1.3s)

Original issue

Hey, after your answer, I started some work on #535

My initial idea was to call it full_simplify only at the end of statements.

The problem is that we don't want to run in unconditionally.

To solve this part, I added a boolean in every value that tells us if the immediate last operation was a ConvertTo.

As expected, I got a 10% performance improvement:

% hyperfine './numbat-master -e "range(0, 1_000_000) |> map(sqrt) |> sum"' './numbat-no-simplify -e "range(0, 1_000_000) |> map(sqrt) |> sum"'
Benchmark 1: ./numbat-master -e "range(0, 1_000_000) |> map(sqrt) |> sum"
  Time (mean ± σ):      1.606 s ±  0.006 s    [User: 1.525 s, System: 0.072 s]
  Range (min … max):    1.596 s …  1.617 s    10 runs
 
Benchmark 2: ./numbat-no-simplify -e "range(0, 1_000_000) |> map(sqrt) |> sum"
  Time (mean ± σ):      1.458 s ±  0.009 s    [User: 1.378 s, System: 0.072 s]
  Range (min … max):    1.440 s …  1.469 s    10 runs
 
Summary
  ./numbat-no-simplify -e "range(0, 1_000_000) |> map(sqrt) |> sum" ran
    1.10 ± 0.01 times faster than ./numbat-master -e "range(0, 1_000_000) |> map(sqrt) |> sum"

The issue

With this code, only two tests fail.

The issue is that when a function with a type annotation returns something, we should convert the returned value to the expected type:

fn _human_num_days(time: Time) -> Scalar = floor(time / day)

Here, for example, floor(time / day) returns a Time / Time, which we want to convert to a simple scalar, I guess?
I don't understand how to do that. The return_type_annotation of a function is a TypeAnnotation, and I need to give a Unit to ConvertTo. I didn't find any way to do that in the code and am struggling to make the conversion myself.

Let me know if you think this makes sense and know how to help me.

sharkdp · 2024-08-15T19:31:21Z

Thank you for looking into this!

The issue is that when a function with a type annotation returns something, we should convert the returned value to the expected type:

Hm, no. Whether or not a quantity is simplified should definitely not depend on the presence of a type annotation. It shouldn't depend on the type at all. Types in Numbat always mean: physical dimensions (Scalar, Length, Velocity) and not units (—, meter, meter/second).

Here, for example, floor(time / day) returns a Time / Time, which we want to convert to a simple scalar, I guess?

The type Time / Time is exactly the same type as Scalar.

With this code, only two tests fail.

I think I would need a bit more information to see what exactly went wrong in those two tests.

irevoire · 2024-08-18T10:07:26Z

Hm, no. Whether or not a quantity is simplified should definitely not depend on the presence of a type annotation. It shouldn't depend on the type at all. Types in Numbat always mean: physical dimensions (Scalar, Length, Velocity) and not units (—, meter, meter/second).

Thanks, I was completely wrong about my approach and was able to fix some stuff because of your comment.

I think the PR is almost ready to merge, but there is still an issue I don't really understand:

% cargo run -- -e "floor(1.5 seconds / 1 days)"
0.0000115741

The floor function doesn't work, and probably the ceil, round etc

Also, on main is this behavior expected? https://numbat.dev/?q=floor%281.5+second+%2F+day%29%E2%8F%8Efloor%281.5+second+%2F+day+%E2%9E%9E+second+%2F+day%29%E2%8F%8E

That seems strange to me 🤔

sharkdp · 2024-08-18T12:37:43Z

I think the PR is almost ready to merge, but there is still an issue I don't really understand:
% cargo run -- -e "floor(1.5 seconds / 1 days)"
0.0000115741

That happens because it gets floored to 1.0 seconds / days and then converted to a scalar (0.0000115741).

Also, on main is this behavior expected? https://numbat.dev/?q=floor%281.5+second+%2F+day%29%E2%8F%8Efloor%281.5+second+%2F+day+%E2%9E%9E+second+%2F+day%29%E2%8F%8E

Sort of.

The problem is not with your PR. The problem is the very existence of those functions. Please see my explanation here and the ticket #336.

I'll try to give a more helpful response (or maybe a fix) in a few days.

sharkdp · 2024-08-18T12:39:01Z

Actually. Maybe your PR even solves the problem that I mentioned in that description?! In that case, we should probably provide floor_in, round_in etc. and remove floor, round etc entirely.

irevoire · 2024-08-18T14:46:00Z

The problem is not with your PR. The problem is the very existence of those functions. Please see my explanation #534 (comment) and the ticket #336.

Oh yes, you're right; these functions don't really make sense when playing with different units; it's actually a way more complex subject than I expected 😅

In that case, we should probably provide floor_in, round_in etc.

That seems smart, and the right side of the function call could be any value instead of a type right?
So round_in(time, days) would works, but round_in(143, 13) would work as well right?

and remove floor, round etc entirely.

💯
Now that I understand the issue, I don't see how it could make sense to round stuff randomly on units we don't have control over, the in should always be specified.

On a side note, it should also always be the second arguments so we can easily write 80 km / hours |> round(m/s).

And last question

What should be the exact syntax in our case for simple scalars?
I guess in my case I should write: round_in(time / day, 1) and internally round_in should call convertTo?
The 1 is ugly though 😔

sharkdp · 2024-08-18T18:18:29Z

Let's try to fix the notation, so we don't get confused. I would propose we change the type annotation for round to make it more restrictive (instead of deleting it entirely) and introduce round_in like so:

fn round(value: Scalar) -> Scalar
fn round_in<D: Dim>(base: D, value: D) -> D = round(value / base) × base

I think this solves all problems:

assert_eq(round(1.234), 1)

assert_eq(1.234 m |> round_in(m), 1 m)
assert_eq(1.234 m |> round_in(cm), 123 cm)
assert_eq(1.234 m |> round_in(mm), 1234 mm)

assert_eq(1.234 m |> round_in(10 m), 0)
assert_eq(1.234 m |> round_in(1 m), 1 m)
assert_eq(1.234 m |> round_in(0.1 m), 1.2 m, 1e-9 m)
assert_eq(1.234 m |> round_in(0.01 m), 1.23 m, 1e-9 m)

assert_eq(1234 |> round_in(1000), 1000)
assert_eq(1234 |> round_in(100), 1200)
assert_eq(1234 |> round_in(10), 1230)
assert_eq(1234 |> round_in(1), 1234)
assert_eq(1234 |> round_in(0.1), 1234)

and the right side of the function call could be any value instead of a type right?

base can be any value, yes.

So round_in(time, days) would works, but round_in(143, 13) would work as well right?

So with the notation above: round_in(days, time) would work. It's the same as time |> round_in(days). And round_in(13, 143) would also work, but it would be a bit weird 😄. But things like round_in(10, 143) make sense and would yield 140.

On a side note, it should also always be the second arguments so we can easily write 80 km / hours |> round(m/s).

Yes.

What should be the exact syntax in our case for simple scalars?

I think we should keep round for that after all. It's by far the most common use case to call round on scalar values, so we should still provide. But restrict it to only work on Scalar values. If someone attempts to do round(1.234 m), that wouldn't work. Ideally, we could show a helpful error message mentioning round_in.

irevoire · 2024-08-18T22:08:46Z

Oops, I swapped the order of the parameters with |> in my previous message. My bad 🤦

I’ll look into it later but:

If someone attempts to do round(1.234 m),

Don't we have an issue here?
Above you also wrote:

The type Time / Time is exactly the same type as Scalar.

Which was causing the issue initially with a floor(time / day)

irevoire · 2024-08-19T09:32:32Z

I updated the code with a round_in/floor_in function, but it doesn't work because:

fn round_in<D: Dim>(base: D, value: D) -> D = round(value / base) × base

In floor(value / base) in my case returns something in s / day which is badly rounded and then returns something non-floored.

% cargo run -- -e "floor_in(day, 12420s)"
   Compiling numbat-cli v1.13.0 (/Users/irevoire/numbat/numbat-cli)
    Finished `dev` profile [unoptimized + debuginfo] target(s) in 1.93s
     Running `target/debug/numbat -e 'floor_in(day, 12420s)'`
0.14375 day

I guess one solution could be to call fullSimplify before calling floor?
That seems a bit hacky to me and hard to maintain in the VM...

Another solution could be to provide a new function, simplify, that can only be called on scalar and then move the complexity to the stdlib of numbat instead of the VM. Still hacky IMO

And the last solution I can think of would be to provide a new special internal type like « number » or something that is a fully simplified scalar. It could only be generated by converting a scalar to it and the VM could automatically generate the fullsimplify instruction.

sharkdp · 2024-08-19T20:27:40Z

In floor(value / base) in my case returns something in s / day which is badly rounded and then returns something non-floored.

I think you need to change simple_polymorphic_math_function!(round, round); to simple_scalar_math_function!(round, round) in numbat/src/ffi/math.rs (and similar floor and all the others). It will use Quantity::as_scalar which is guaranteed to succeed by the type checker (if we change the signature to Scalar -> Scalar).

sharkdp · 2024-08-19T21:28:34Z

See #546

irevoire · 2024-08-19T23:57:22Z

Oooh nice I didn't know we had that!
I’ll rebase once your PR lands on main 👌

sharkdp · 2024-08-20T07:11:43Z

I’ll rebase once your PR lands on main 👌

This has been merged now. (I really like how this example looks now with the new |> operator and the new round_in function).

irevoire · 2024-08-20T16:49:17Z

(I really like how this example looks now with the new |> operator and the new round_in function).

Ahah this one is awesome, I didn't know about it! 😂

irevoire · 2024-08-20T22:08:33Z

I rebased my PR, edited the original message and updated the performance improvements

sharkdp · 2024-08-21T15:19:01Z

This is great. What I don't fully instead at the moment: using your can_simplify trick.. could we even take this further and remove the FullSimplify instruction alltogether?

sharkdp · 2024-08-21T15:22:13Z

numbat/src/quantity.rs

@@ -24,20 +24,31 @@ pub type Result<T> = std::result::Result<T, QuantityError>;
 pub struct Quantity {
    value: Number,
    unit: Unit,
+    can_simplify: bool,


The name confused me a bit when first reading this code. Maybe call it simplification_allowed? And rename no_simplify to prevent_simplification?

irevoire · 2024-08-21T15:38:39Z

I think we can, but I didn't try it because it's hard to know if we can simplify something.

The context

So, as stated in the description, the idea is to not simplify when we don't need to.
So now, the only times we need to simplify an expression are:

When crafting a string: {expr}
When a statement only contains an expression (it'll be shown to the end-user)

The issue

The issue is that if someone wrote an expression that's not simplified, like expr -> km/m, even if it doesn't make sense, we should output the expression in the type he specified.

The trick

This boolean is only false after processing a ConvertTo expression, and any operation applied to it will make the boolean true again.

Avoiding the trick

could we even take this further and remove the FullSimplify instruction altogether?

In the end, I believe this is possible, but it requires us to keep track of the usage of each variable while compiling the stmts to asm and, although it doesn't seem hard, that's not something we already do in numbat.

let a = 3m/m -> km/m
let stuff = 47
let bidule = 34 + stuff + a

print(a) # We need to find back the definition of `a` and see if it contained a `ConvertTo` thing

Just thinking out loud now I wrote this example, maybe we have only two cases currently when we should not simplify an expression:
Either we're printing an identifier whose definition ends with a ConvertTo, or the expression we're currently printing ends with a ConvertTo.

In the first case it means we could store the boolean next to the identifier name instead of making every values heavier.
In the second case, it's easy and quick to check that the top node of an expression is a ConvertTo

Sorry I’m not sure that's super clear but let me know if it isn't 😅

sharkdp · 2024-08-21T15:48:50Z

I was thinking of the following:

we never perform any simplification as an instruction (remove FullSimplify)
we set the boolean flag that you introduced. This "flags" values that were created by a convert-to operation.
the flag can disappear again if further operations are applied to the value (e.g. (x -> m) + 2 cm)
whenever we print values or create strings out of them, we call full_simplify() on the value first. values that were previously flagged will not get simplified (because full_simplify checks the flag).

Wouldn't this be enough already?

irevoire · 2024-08-23T01:33:42Z

Yes, I need to try a few things, but it should work and be even faster 🔥

sharkdp · 2024-08-29T18:49:52Z

So it turns out this was really easy to implement on top of what you did (I hope I didn't miss something you had thought about). I basically just had to remove some code and add a few .full_simplify() calls.

It's very nice conceptually that FullSimplify is now gone.

This PR also fixes two issues we discovered recently. I added regression tests for them.

For the cargo bench prelude benchmark, I see a 20% improvement (48 ms => 37 ms). And similarly when measuring with hyperfine:

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`./numbat-master -e "1+1"`	56.1 ± 1.8	53.7	62.1	1.22 ± 0.05
`./numbat-no-simplify -e "1+1"`	45.9 ± 1.2	43.8	50.2	1.00

For the benchmark you suggested in your original post, I still see the 10% improvement. And I see that you have a much faster machine than I have 😄

Command	Mean [s]	Min [s]	Max [s]	Relative
`./numbat-master -e "…"`	2.291 ± 0.030	2.258	2.352	1.09 ± 0.03
`./numbat-no-simplify -e "…"`	2.098 ± 0.046	2.029	2.168	1.00

Thank you very much for your work. This is great. Let me know if I missed something.

irevoire · 2024-08-29T20:31:18Z

Hey, it's awesome! I'm sorry I didn't finish this myself. I started working again and didn't find the time to work on it again, but it's awesome that you finished it!

Thank you

xmbhasin · 2024-08-31T16:13:39Z

numbat/tests/interpreter.rs

@@ -943,5 +950,14 @@ mod tests {
              212121001.1 cm
            "###);
        }
+
+        #[test]
+        fn issue505_angles() {


Nice. This looks a lot better now.

sharkdp mentioned this pull request Aug 18, 2024

Unexpected behavior of unit conversions inside functions #534

Closed

irevoire added 3 commits August 20, 2024 23:18

reduce the number of calls to simplify

eaa57e5

Fix most of the simplify calls

064d564

add a floor_in and round_in function

6ccb113

irevoire force-pushed the reduce-call-to-simplify branch from 96194c9 to 6ccb113 Compare August 20, 2024 21:19

fix warning

7e449b8

irevoire marked this pull request as ready for review August 20, 2024 22:03

remove useless simplify call

68870fe

sharkdp reviewed Aug 21, 2024

View reviewed changes

sharkdp mentioned this pull request Aug 29, 2024

assert_eq: Better error messages #505

Closed

sharkdp added 2 commits August 29, 2024 20:30

Remove FullSimplify instruction

ffd1fa0

Add regression test for angle example in sharkdp#505

9782a50

Add regression test for sharkdp#534

94bbbcb

sharkdp merged commit 29a99e9 into sharkdp:master Aug 29, 2024
15 checks passed

sharkdp mentioned this pull request Aug 29, 2024

Improve startup time (regression) #525

Closed

xmbhasin reviewed Aug 31, 2024

View reviewed changes

BrewTestBot mentioned this pull request Oct 11, 2024

numbat 1.14.0 Homebrew/homebrew-core#193820

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce the number of calls to simplify #537

Reduce the number of calls to simplify #537

irevoire commented Aug 15, 2024 •

edited

Loading

sharkdp commented Aug 15, 2024

irevoire commented Aug 18, 2024

sharkdp commented Aug 18, 2024

sharkdp commented Aug 18, 2024

irevoire commented Aug 18, 2024

sharkdp commented Aug 18, 2024

irevoire commented Aug 18, 2024 •

edited

Loading

irevoire commented Aug 19, 2024

sharkdp commented Aug 19, 2024 •

edited

Loading

sharkdp commented Aug 19, 2024

irevoire commented Aug 19, 2024

sharkdp commented Aug 20, 2024

irevoire commented Aug 20, 2024

irevoire commented Aug 20, 2024

sharkdp commented Aug 21, 2024

sharkdp Aug 21, 2024

irevoire commented Aug 21, 2024

sharkdp commented Aug 21, 2024

irevoire commented Aug 23, 2024

sharkdp commented Aug 29, 2024

irevoire commented Aug 29, 2024

xmbhasin Aug 31, 2024

Reduce the number of calls to simplify #537

Reduce the number of calls to simplify #537

Conversation

irevoire commented Aug 15, 2024 • edited Loading