Split anomalous dimensions into a Rust crate #185

alecandido · 2022-12-24T11:13:51Z

The motivations for this issue are mainly:

we are using inconsistent tooling in the theory pineline to achieve performant computations: here we use Numba, while PineAPPL is implemented in Rust for the same reason
Numba has some limitations in terms of supported features, from this point of view Rust is much more flexible (it is a whole language with its own ecosystem, while Numba is bound to be a "subset" forever)
Numba seems to be especially slow at compiling N3LO terms
It is much more difficult to have bindings for anomalous dimensions for other languages, that might be relevant since we have original pieces

I'm currently trying to outline a suitable layout for the new Rust module, I'm doing some attempts in this repo:
https://github.com/AleCandido/atuin

I already asked for maturin authors support, but a minimal skeleton (no further crates for bindings) is almost working:
PyO3/maturin#1372

The text was updated successfully, but these errors were encountered:

alecandido · 2023-01-03T10:53:18Z

The template works, as soon as we close the current main refactoring (i.e. #172) I will provide a PR with a working layout, but almost nothing in.

The proposal is to move the numba decorated content to a standalone crate (let's say ekspressions), while provide bindings to it in a separate crate.

In principle, we can also move the integration on the Rust side (such that all the heavy lift will happen there, and we do not cross the Rust-Python boundary during intensive operations). This I would do in the bindings crate, or a separate one, keeping ekspressions (stupid name, suggest a better one) for the analytical expressions alone.

To integrate in Rust we can use bindings to the GSL: https://docs.rs/GSL/latest/rgsl/integration/index.html (i.e. essentially the same as SciPy, but we have to install GSL ourselves).
Still looking for better options, but this discourages me to attempt to do it in the short term...

felixhekhorn · 2023-01-19T12:44:15Z

Compiling just took me about half an hour (!)

Matching: computing operators - 4/60 took: 1644.154465 s

I think we need to do something ...

alecandido · 2023-01-19T12:59:46Z

I think we need to do something ...

Agreed, and the first step is already in #189. But it is lower priority than other businesses, in particular FONLL.

However, if @giacomomagni wants to start having a look we can discuss. But for me and you it is out of limits, until everything else is working smoothly.

alecandido · 2023-04-05T16:05:38Z

Rust supports incremental compilation (by default only for debug, opt-in for production, reasonably):
https://nnethercote.github.io/perf-book/compile-times.html
https://doc.rust-lang.org/cargo/reference/profiles.html#incremental

felixhekhorn · 2023-07-10T09:44:58Z

@scarlehoff said in NNPDF/pineko#105 :

my computer crashed after eko took every possible resource,

and also discussing with @giacomomagni last week, we found the same conclusion: we should do something on the compilation front ...

scarlehoff · 2023-07-10T09:48:22Z

To be more precise, my problem is that I have many cores but not so much memory so my scratch got completely filled (quite rapidly).

alecandido · 2023-07-10T10:44:51Z

To be more precise, my problem is that I have many cores but not so much memory so my scratch got completely filled (quite rapidly).

You're not alone, it happened the same to me...

and also discussing with @giacomomagni last week, we found the same conclusion: we should do something on the compilation front ...

Yes, we should do something. But it won't be quick. The current workaround is to limit Numba processors for compilation, but it is taking a lot...

scarlehoff · 2023-07-11T12:49:49Z

You're not alone, it happened the same to me...

And it happened again! This time while computing an eko for the evolution of a PDF. Not sure if it has gotten way worse lately (i.e., in one of the latest point releases) or whether I just didn't notice it.

felixhekhorn · 2023-07-11T14:26:09Z

I think you should set the default to 1 (instead of 8)

Not sure if it has gotten way worse lately (i.e., in one of the latest point releases) or whether I just didn't notice it.

the strategy and settings should not have changed ... (not since 0.13)

scarlehoff · 2023-07-11T14:47:48Z

It could've been that I just happen not to have used the relevant computers while they were in swap so I didn't notice.

8 seems to be safe (at least for now for me). I was going higher :P

alecandido · 2023-07-11T15:22:53Z

And it happened again! This time while computing an eko for the evolution of a PDF. Not sure if it has gotten way worse lately (i.e., in one of the latest point releases) or whether I just didn't notice it.

This instead should be quick to solve.

An EKO has many dimensions, in particular Q2 and x (the one to Mellin invert), and we can parallelize:

by dataset
top-level within dataset, on Q2 (available after jets rework)
bottom-level, in the quadrature integration (won't be available for Python, but coming soon in Rust)
mid-level, doing more integrals at the same time

At the beginning, only 1. and 4. were available, and 4. was needed by Giacomo (who implemented it). Unfortunately, this parallelization is handled by Python, with an enormous overhead in memory (essentially, copying the involved objects once per thread spawned, since they are bound to new interpreters instances). Now that 2. is available, I would deprecate 4., setting the number of threads by default to a single one, and eventually drop to avoid similar issues again.

giacomomagni · 2023-07-12T06:44:59Z

I think you should set the default to 1 (instead of 8)

I agree let's set it to 1. In the end now you want to parallelise only if you really need it.

alecandido self-assigned this Dec 24, 2022

alecandido added enhancement New feature or request rust Rust extension related labels Dec 24, 2022

alecandido mentioned this issue Jan 9, 2023

Introduce Rust extension #189

Merged

felixhekhorn mentioned this issue Jan 11, 2023

Reorganize package content #194

Merged

alecandido mentioned this issue Jan 27, 2023

Public Interface #145

Open

alecandido added this to the post milestone Jan 27, 2023

felixhekhorn mentioned this issue May 12, 2023

Use Cython as numba alternative #266

Closed

felixhekhorn closed this as completed in #189 Jan 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split anomalous dimensions into a Rust crate #185

Split anomalous dimensions into a Rust crate #185

alecandido commented Dec 24, 2022

alecandido commented Jan 3, 2023

felixhekhorn commented Jan 19, 2023

alecandido commented Jan 19, 2023

alecandido commented Apr 5, 2023

felixhekhorn commented Jul 10, 2023

scarlehoff commented Jul 10, 2023

alecandido commented Jul 10, 2023

scarlehoff commented Jul 11, 2023

felixhekhorn commented Jul 11, 2023

scarlehoff commented Jul 11, 2023 •

edited

Loading

alecandido commented Jul 11, 2023

giacomomagni commented Jul 12, 2023

Split anomalous dimensions into a Rust crate #185

Split anomalous dimensions into a Rust crate #185

Comments

alecandido commented Dec 24, 2022

alecandido commented Jan 3, 2023

felixhekhorn commented Jan 19, 2023

alecandido commented Jan 19, 2023

alecandido commented Apr 5, 2023

felixhekhorn commented Jul 10, 2023

scarlehoff commented Jul 10, 2023

alecandido commented Jul 10, 2023

scarlehoff commented Jul 11, 2023

felixhekhorn commented Jul 11, 2023

scarlehoff commented Jul 11, 2023 • edited Loading

alecandido commented Jul 11, 2023

giacomomagni commented Jul 12, 2023

scarlehoff commented Jul 11, 2023 •

edited

Loading