Computing residuals when scaling is logarithmic #1087

IndrajeetPatil · 2022-08-16T11:09:29Z

log(x) is ln(x) in R, right?
Seems wrong to me, I would expect log10(predicted)-log10(observed) and not ln(predicted)-ln(observed)???

Originally posted by @Yuri05 in #1085 (comment)

Referring to the code here:

OSPSuite-R/R/utilities-data-combined.R

Lines 234 to 238 in 1252a50

    
           if (scaling %in% c(tlf::Scaling$lin, "identity")) { 
        
             pairedData <- dplyr::mutate(pairedData, residualValues = predictedValues - yValues) 
        
           } else { 
        
             pairedData <- dplyr::mutate(pairedData, residualValues = log(predictedValues) - log(yValues)) 
        
           }

The text was updated successfully, but these errors were encountered:

IndrajeetPatil · 2022-08-16T11:11:44Z

This was done to stay faithful to what is done in the reporting engine:

https://github.com/Open-Systems-Pharmacology/OSPSuite.ReportingEngine/blob/3a3aa9ac469643a0a50db34ca4b4aafdeae71250/R/utilities-goodness-of-fit.R#L189-L194

But, yes, you are correct that log(x) in R is log(x, exp(1)), i.e. ln(x). Therefore, the correct way to do this would be using log10().

Yuri05 · 2022-08-16T11:15:18Z

@sfrechen @PavelBal Is calculating of logarithmic residuals using ln (and not log10) in RE correct?

sfrechen · 2022-08-16T12:51:16Z

Natural logarithm is correct. The residuals should follow then a lognormal distribution.

PavelBal · 2022-08-16T13:20:27Z

Then the PK-Sim implementation is not correct?

sfrechen · 2022-08-16T14:16:04Z

Let's say: uncommon. And not directly comparable to a lognormal distribution.

Yuri05 · 2022-08-16T23:53:43Z

Let's say: uncommon. And not directly comparable to a lognormal distribution.

Then I wonder, why in the "Histogram of residuals" in RE we plot normal distribution all the time??? @sfrechen

sfrechen · 2022-08-17T07:39:53Z

Then I wonder, why in the "Histogram of residuals" in RE we plot normal distribution all the time???

Well, if the assumed error model is logarithmic, i.e. epsilon follows log normal distribution (which is usually the case), then plotting the logarithmized residuals against a normal distribution is correct (because log(residual) follows then a normal distribution while residual follows a log normal distribution)

If the assumed error model is linear, i.e. epsilon follows normal distribution, then plotting the "raw" residuals against normal distribution is also correct.

Whether the residuals are logarithmized or not should depend on the underlying assumption of the error model!

Yuri05 · 2022-08-17T08:03:42Z

a) We don't plot log(residual) in the histogram. We plot residual.
b) Actually, the residuals can be negative. How can they be lognormal distributed then???

sfrechen · 2022-08-17T10:16:41Z

a) We don't plot log(residual) in the histogram. We plot residual.

Hm. As said: Whether the residuals are logarithmized or not should depend on the underlying assumption of the error model, i.e. if the parameter identification is done in log scale for an output, then the residuals should be calculated as res = log(obs)-log(sim).

b) Actually, the residuals can be negative. How can they be lognormal distributed then???

Sorry, I was not accurate:

If the assumed error model is logarithmic, i.e. log(y)=log(f)+eps and epsilon follows a normal distribution, thus exp(epsilon) follows log normal distribution, then plotting the logarithmized residuals against a normal distribution is correct.

PavelBal · 2022-10-20T15:32:13Z

Do we have an agreement?

Yuri05 assigned sfrechen and PavelBal Aug 16, 2022

IndrajeetPatil closed this as completed Aug 16, 2022

Yuri05 mentioned this issue Aug 16, 2022

Improvements to residual calculation function #1090

Merged

sfrechen reopened this Aug 17, 2022

Yuri05 mentioned this issue Aug 18, 2022

Log residuals calculation Open-Systems-Pharmacology/PK-Sim#2312

Open

PavelBal mentioned this issue Mar 7, 2023

- Use logSafe from utils package #1218

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Computing residuals when scaling is logarithmic #1087

Computing residuals when scaling is logarithmic #1087

IndrajeetPatil commented Aug 16, 2022

IndrajeetPatil commented Aug 16, 2022

Yuri05 commented Aug 16, 2022

sfrechen commented Aug 16, 2022

PavelBal commented Aug 16, 2022

sfrechen commented Aug 16, 2022

Yuri05 commented Aug 16, 2022

sfrechen commented Aug 17, 2022 •

edited

Loading

Yuri05 commented Aug 17, 2022 •

edited

Loading

sfrechen commented Aug 17, 2022

PavelBal commented Oct 20, 2022

Computing residuals when scaling is logarithmic #1087

Computing residuals when scaling is logarithmic #1087

Comments

IndrajeetPatil commented Aug 16, 2022

IndrajeetPatil commented Aug 16, 2022

Yuri05 commented Aug 16, 2022

sfrechen commented Aug 16, 2022

PavelBal commented Aug 16, 2022

sfrechen commented Aug 16, 2022

Yuri05 commented Aug 16, 2022

sfrechen commented Aug 17, 2022 • edited Loading

Yuri05 commented Aug 17, 2022 • edited Loading

sfrechen commented Aug 17, 2022

PavelBal commented Oct 20, 2022

sfrechen commented Aug 17, 2022 •

edited

Loading

Yuri05 commented Aug 17, 2022 •

edited

Loading