add edm4eic::Tensor #96

veprbl · 2024-09-19T04:25:18Z

This adds a new type that facilitates exchange of tensor data for ML applications.

simonge · 2024-10-02T17:14:43Z

I had some comments on this but answered them myself while writing this out, I've added them here anyway in case it's useful but otherwise the type looks fine.

If we want to make use larger batch inference we could use the folder/unfolder to stick together a bunch of events. This would add another variable dimension ontop of the event size batch. Admittedly this could be flattened out for running the inference and handled by the folder.
Any batch treatment of a graph neural network or other variable size input requires a non-static dimension. I remembered this is passed to the inference as a separate tensor, keeping track of how the data tensor is divided up.

veprbl · 2024-10-02T20:30:37Z

This is a good topic. My comment is that the current implementation in EICrecon PR does batch inference. As far as EDM is concerned, there is no difference between batch and single item evaluations, all tensors are supposed to have a concrete shape. ONNX model and the converters from/to-tensor are the one that have to know. Convertors obviously need to do the batching, and, like you say, folding unfolding, indeed is a possibility. For the second point, this is new to me, sounds like this is jagged tensor implementation. I'm not sure that ONNX models are coded like that, it actually supports "sequences" of tensors, which are more likely to be used, according to my intuition. Support for those may require another type in the EDM with a vector member for edm4eic::Tensors.

simonge · 2024-10-03T11:08:58Z

I have a demonstration model which (currently poorly) predicts the position and time of a hit from a varying sized collection of pixel hits. Here the model takes a tensorflow ragged tensor input and can be exported using the tf2onnx package:
https://github.com/simonge/AllpixFastSim/blob/GNN/MakeModel/train_predictor_gnn.py#L80

The model actually ends up taking two input tensors of different types. The first is just a float32 containing the input values and the second gives the position in the value array where different pseudo-clusters are separated as an int64.
https://github.com/simonge/AllpixFastSim/blob/GNN/TestModel/plot_input_resolutions_gnn.py#L127

From what I see, I believe this would be completely compatible with the data type and onnx inference algorithm you currently have.

ruse-traveler

LGTM! I'm happy with this as is, but I wonder if might want to add additional element types down the line... For example, I can imagine having a bool (elementType==9) might be useful for some ML classifiers.

veprbl · 2024-11-03T06:07:38Z

LGTM! I'm happy with this as is, but I wonder if might want to add additional element types down the line... For example, I can imagine having a bool (elementType==9) might be useful for some ML classifiers.

I'd like to avoid the bool. It's probably useless (no existing exporter will use it) and might present an edge case when we add any kind of support for zero-copy.

add edm4eic::Tensor

9693c91

veprbl requested a review from a team as a code owner September 19, 2024 04:25

veprbl mentioned this pull request Sep 19, 2024

add ONNXInference algorithm, use it to provide EcalEndcapNClusterParticleIDs eic/EICrecon#1618

Open

7 tasks

ruse-traveler approved these changes Oct 30, 2024

View reviewed changes

veprbl merged commit 9f0067b into main Nov 4, 2024
5 checks passed

veprbl deleted the pr/add_tensor_type branch November 4, 2024 21:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add edm4eic::Tensor #96

add edm4eic::Tensor #96

veprbl commented Sep 19, 2024

simonge commented Oct 2, 2024

veprbl commented Oct 2, 2024 •

edited

Loading

simonge commented Oct 3, 2024

ruse-traveler left a comment

veprbl commented Nov 3, 2024

add edm4eic::Tensor #96

add edm4eic::Tensor #96

Conversation

veprbl commented Sep 19, 2024

simonge commented Oct 2, 2024

veprbl commented Oct 2, 2024 • edited Loading

simonge commented Oct 3, 2024

ruse-traveler left a comment

Choose a reason for hiding this comment

veprbl commented Nov 3, 2024

veprbl commented Oct 2, 2024 •

edited

Loading