Use byte_count to determine size of CoDICE data arrays #1171

bourque · 2024-11-20T21:34:22Z

This PR fixes a bug in how the CoDICE data arrays were being processed. Some packet data contains a buffer at the end of the data array that needed to be stripped out before the decompression algorithm is called. The pipeline now properly uses the byte_count field to determine the which bytes are included in the data arrays. With this fix, more packets can now be processed, which changed some of the unit test values.

…sing byte_count to determine length of data array

tech3371 · 2024-11-21T17:01:05Z

imap_processing/codice/codice_l1a.py

-                )
-                decompressed_values = []
+        for packet_data, byte_count in zip(
+            science_values, self.dataset.byte_count.data


I was looking at what this byte_count data does/mean and I saw this description from the XTCE

NUMBER OF BYTES IN THE DATA ARRAY. IF COMPRESSED, THIS VALUE REPRESENTS THE LENGTH OF THE COMPRESSED DATA.

Looks like this byte_count is used by many CoDICE Hi and Lo packets. It says 'if compressed'. Then I looked at Lo and Hi's compression id lookup dictionary. It seems like all id maps to some compression form. Will there be a case when the data will not be compressed?

Also, I just noticed that in this line, you were using parameters from first packet for remaining all science packets. I think that may be remains from your previous implementation. Will that need to change now since packet_data from science_values may be using different view_id, plan_id, and so on.

Sorry for long comments. These comments are not for this PR but something I wanted to point out in case we missed it before.

Will there be a case when the data will not be compressed?

I haven't come across any data product yet that is expected to have no compression (algorithm=0) (see screenshot from the SCI_LUT spreadsheet). I think this is one of those knobs that CoDICE wants to be able to turn to theoretically, but in practice I am not sure it would ever happen. I will ask Joey about this though.

Also, I just noticed that in this line, you were using parameters from first packet for remaining all science packets. I think that may be remains from your previous implementation. Will that need to change now since packet_data from science_values may be using different view_id, plan_id, and so on.

I am a little confused here. Were you referring to this part of the code?

imap_processing/imap_processing/codice/codice_l1a.py

Line 545 in 1208208

table_id = int(dataset.table_id.data[0])

, or event data processing?

If it is the former, I think it is safe to use the first packet because we expect each packet for a particular APID to have the same view_id, plan_step, etc., but I will also double check that with Joey.

cool. Since this is outside of this PR scope, I will approve it. We can address this in future PR if needed.

Fixed bug in which data arrays were including unessary padding, now u…

1208208

…sing byte_count to determine length of data array

bourque added Ins: CoDICE Related to the CoDICE instrument Level: L1 Level 1 processing labels Nov 20, 2024

bourque added this to the Nov 2024 milestone Nov 20, 2024

bourque self-assigned this Nov 20, 2024

bourque requested review from tech3371, sdhoyt and joeymukherjee November 20, 2024 21:34

tech3371 reviewed Nov 21, 2024

View reviewed changes

tech3371 approved these changes Nov 25, 2024

View reviewed changes

bourque merged commit 8be9182 into IMAP-Science-Operations-Center:dev Nov 25, 2024
17 checks passed

bourque deleted the codice-byte-count-bug branch November 25, 2024 17:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use byte_count to determine size of CoDICE data arrays #1171

Use byte_count to determine size of CoDICE data arrays #1171

bourque commented Nov 20, 2024

tech3371 Nov 21, 2024

bourque Nov 21, 2024

tech3371 Nov 25, 2024

Use byte_count to determine size of CoDICE data arrays #1171

Use byte_count to determine size of CoDICE data arrays #1171

Conversation

bourque commented Nov 20, 2024

tech3371 Nov 21, 2024

Choose a reason for hiding this comment

bourque Nov 21, 2024

Choose a reason for hiding this comment

tech3371 Nov 25, 2024

Choose a reason for hiding this comment