Detect clearsky modifications #510

benbenboben · 2018-07-26T21:55:38Z

Brief description of the problem and proposed solution (if not already fully described in the issue linked to above):

The fifth condition (c5) of the clearsky.detect_clearsky function wasn't calculated as defined in the original reference (equations 12 - 14). This pull request also addresses part of #507 where there was an implicit assumption of minutely data. All derivatives are now calculated by explicitly dividing by the sample_interval. This does not address non-uniform time intervals.

Closes issue Correct condition 5 in clearsky.detect_clearsky #506 (addresses part of Handle different time intervals in clearsky.detect_clearsky #507)
I am familiar with the contributing guidelines.
Fully tested. Added and/or modified tests to ensure correct behavior for all reasonable inputs. Tests (usually) must pass on the TravisCI and Appveyor testing services.
Updates entries to docs/sphinx/source/api.rst for API changes.
Adds description and name entries in the appropriate docs/sphinx/source/whatsnew file for all changes.
Code quality and style is sufficient. Passes git diff upstream/master -u -- "*.py" | flake8 --diff
New code is fully documented. Includes sphinx/numpydoc compliant docstrings and comments in the code where necessary.
Pull request is nearly complete and ready for detailed review.

…iginal reference.

cwhanse

I feel strongly (perhaps to the point of insisting) that this function provide a warning if passed data with timesteps different enough from 1 minute (say outside of 30s to 2 min) and the default thresholds values are used. The validation of this algorithm only used ~1 min timesteps, and the default threshold values derive from that validation.

cwhanse · 2018-07-27T14:23:54Z

pvlib/clearsky.py

@@ -687,7 +687,7 @@ def detect_clearsky(measured, clearsky, times, window_length,
        raise NotImplementedError('algorithm does not yet support unequal ' \
                                  'times. consider resampling your data.')

-    samples_per_window = int(window_length / sample_interval)
+    samples_per_window = int(window_length / sample_interval) + 1


Revert this change please. samples_per_window is counting intervals, not endpoints. Because it's an intermediate it could be renamed to be more clear.

I believe that not adding 1 gives the incorrect number of samples per window. For example, if my data is 30-minute frequency (sample_interval=30) and I want 60 minute windows (window_length=60), the current implementation would only give 2 points per window (when the Hankel matrix is constructed in the following lines). In this case, 2 points per window would only span a 30 minute window, not the intended 60.

Your example makes my point. The algorithm operates on intervals not on points in time. The value at a timestamp is considered as the value for the following (?) interval - I'd have to look carefully at the Hankel matrix and the diff to see if we adopted a left- or right- endpoint convention.

cwhanse · 2018-07-27T14:25:48Z

pvlib/clearsky.py

@@ -697,25 +697,27 @@ def detect_clearsky(measured, clearsky, times, window_length,
    # calculate measurement statistics
    meas_mean = np.mean(measured[H], axis=0)
    meas_max = np.max(measured[H], axis=0)
-    meas_slope = np.diff(measured[H], n=1, axis=0)
+    meas_ghi_diff = np.diff(measured[H], n=1, axis=0)
+    meas_slope = np.diff(measured[H], n=1, axis=0) / sample_interval


The non-uniform time steps could be handled here by

converting the time to UNIX timestamps

diffing the time to get time intervals

element-wise division in the calculation of meas_slope

using the diffed time in the calculation of meas_line_length , 'clear_slope`, etc.

This could be the subject of a subsequent pull request. In hindsight, I could have separated #507 into two issues (different from 1 minute, and non-uniform time steps.)

wholmgren · 2018-10-10T16:21:47Z

completed in #596

benbenboben added 2 commits July 16, 2018 09:33

Fixed inconsistencies between the detect_clearsky function and the or…

f4e4ad3

…iginal reference.

Minor cleanup.

3f9dc9d

cwhanse reviewed Jul 27, 2018

View reviewed changes

cwhanse mentioned this pull request Oct 3, 2018

Correct condition 5 in detect_clearksy #596

Merged

8 tasks

wholmgren closed this Oct 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Detect clearsky modifications #510

Detect clearsky modifications #510

Uh oh!

benbenboben commented Jul 26, 2018

Uh oh!

cwhanse left a comment

Uh oh!

cwhanse Jul 27, 2018

Uh oh!

benbenboben Jul 27, 2018

Uh oh!

cwhanse Jul 27, 2018

Uh oh!

cwhanse Jul 27, 2018

Uh oh!

wholmgren commented Oct 10, 2018

Uh oh!

Uh oh!

Detect clearsky modifications #510

Detect clearsky modifications #510

Uh oh!

Conversation

benbenboben commented Jul 26, 2018

Uh oh!

cwhanse left a comment

Choose a reason for hiding this comment

Uh oh!

cwhanse Jul 27, 2018

Choose a reason for hiding this comment

Uh oh!

benbenboben Jul 27, 2018

Choose a reason for hiding this comment

Uh oh!

cwhanse Jul 27, 2018

Choose a reason for hiding this comment

Uh oh!

cwhanse Jul 27, 2018

Choose a reason for hiding this comment

Uh oh!

wholmgren commented Oct 10, 2018

Uh oh!

Uh oh!