Fix clock drift rounding error #82

carloshorn · 2020-08-27T14:07:28Z

This PR closes #80.

carloshorn · 2020-08-27T14:11:03Z

Ohh... I did not create the branch from master.... so it also contains #81.

mraspaud · 2020-08-27T14:11:42Z

ok, we'll just have to be careful and merge them in the right order

mraspaud · 2020-08-27T14:14:37Z

pygac/pod_reader.py


            min_idx = min(line_indices)
-            max_idx = max(max(line_indices),
-                          max(self.scans["scan_line_number"] - min_idx)) + 1
+            max_idx = max(line_indices.max(), self.scans["scan_line_number"].max()) + 1


why remove min_idx here?

because this leads to the index error. The max argument compares line numbers and is therefore in the line number domain. the - min_idx is the transformation to the complete_lons array index domain. In the case of the crashing orbits, we had exactly the situation, where the right side wins the max comparison.

Maybe better variable names could help to avoid this confusion. Something like min_line, max_line, line_range = max_line - min_line. Then we can use index or idx for the indices of complete_lons, complete_lats which are numbers in 0:line_range+1, where the interesting ones are self.scans["scan_line_number"] - min_line.

because this leads to the index error. The max argument compares line numbers and is therefore in the line number domain. the - min_idx is the transformation to the complete_lons array index domain. In the case of the crashing orbits, we had exactly the situation, where the right side wins the max comparison.

From reading again the code, I see, that the right hand side should win the max comparison whenever the clock drift is positive, because the array line_indices results from

pygac/pygac/pod_reader.py

Lines 438 to 444 in 5c59f2b

offsets *= -2

int_offsets = np.floor(offsets).astype(np.int)

# filling out missing geolocations with computation from pyorbital.

line_indices = (self.scans["scan_line_number"]

+ int_offsets)

where offsets is a floating point number in seconds. The multiplication with -2 to convert the time offset into a line number offset is not well documented, but we know that the sign of the offset determines who wins the max comparison. I think the reason why it sill worked in most of the cases is the +2 in the following line:

pygac/pygac/pod_reader.py

Line 453 in 5c59f2b

idx_len = max_idx - min_idx + 2

the line I tested that crashed had an int_offset of -3, which in combination with the unexplained +2 appeared suspicious to me. Since the max index has already the +1, the range of valid indices should not require any +2. My first guess was that the +2 is required for the later interpolation, but slerp is linear and therefore only requires the next neighbours.

Now, I assume that it also requires this change to cover all possible indexes

min_idx = min(line_indices.min(), self.scans["scan_line_number"].min())

And without understanding the transformation, but arguing with symmetry, I would assume that

pygac/pygac/pod_reader.py

Line 440 in 5c59f2b

int_offsets = np.floor(offsets).astype(np.int)

should be changed to

int_offsets = np.where(offsets<0, np.floor(offsets), np.ceil(offsets)).astype(np.int)

to cover the right index range, but here I am not yet 100% sure.

carloshorn · 2020-08-31T11:45:24Z

I need to further understand the underlying code...
It appears like the same offset adjustment is done again in Reader.compute_lonlat

pygac/pygac/reader.py

Lines 485 to 497 in 5c59f2b

    
           if clock_drift_adjust: 
        
               from pygac.clock_offsets_converter import get_offsets 
        
               try: 
        
                   offset_times, clock_error = get_offsets(self.spacecraft_name) 
        
               except KeyError: 
        
                   LOG.info("No clock drift info available for %s", 
        
                            self.spacecraft_name) 
        
               else: 
        
                   offset_times = np.array(offset_times, dtype='datetime64[ms]') 
        
                   offsets = np.interp(utcs.astype(np.uint64), 
        
                                       offset_times.astype(np.uint64), 
        
                                       clock_error) 
        
                   utcs = utcs - (offsets * 1000).astype('timedelta64[ms]')

where in this case there is no rounding error.
So the first get_offsets call is only used to get the index range. Then it is called again to do the actual shifting by the correct offset...

I will prepare a refactoring of the clock drift adjustment, so maybe #81 and #82 become obsolete.
Could you tell me the difference between reader.time and reader.utcs?
It appears to me like it is the same information, but stored in different data types.
Storing it twice has the disadvantage of synchronization. Maybe one of them should become a property which does the type conversion.

horn added 2 commits August 27, 2020 13:55

fix IndexError in POD reader adjust clock drift

09a4fc5

Consider milliseconds of clock offsets

27f3a85

mraspaud reviewed Aug 27, 2020

View reviewed changes

carloshorn mentioned this pull request Sep 2, 2020

Refactor clock drift #84

Merged

mraspaud closed this in #84 Sep 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix clock drift rounding error #82

Fix clock drift rounding error #82

carloshorn commented Aug 27, 2020

carloshorn commented Aug 27, 2020

mraspaud commented Aug 27, 2020

mraspaud Aug 27, 2020

carloshorn Aug 27, 2020

carloshorn Aug 27, 2020

carloshorn Aug 28, 2020

carloshorn Aug 28, 2020

carloshorn Aug 28, 2020

carloshorn commented Aug 31, 2020

	offsets *= -2

	int_offsets = np.floor(offsets).astype(np.int)

	# filling out missing geolocations with computation from pyorbital.
	line_indices = (self.scans["scan_line_number"]
	+ int_offsets)

Fix clock drift rounding error #82

Fix clock drift rounding error #82

Conversation

carloshorn commented Aug 27, 2020

carloshorn commented Aug 27, 2020

mraspaud commented Aug 27, 2020

mraspaud Aug 27, 2020

Choose a reason for hiding this comment

carloshorn Aug 27, 2020

Choose a reason for hiding this comment

carloshorn Aug 27, 2020

Choose a reason for hiding this comment

carloshorn Aug 28, 2020

Choose a reason for hiding this comment

carloshorn Aug 28, 2020

Choose a reason for hiding this comment

carloshorn Aug 28, 2020

Choose a reason for hiding this comment

carloshorn commented Aug 31, 2020