Skip to content

Preprocessing and spectral standardization of ring trial MIR instruments

License

Notifications You must be signed in to change notification settings

soilspectroscopy/ringtrial-preprocessing

Repository files navigation

Soil spectroscopy ring trial

Overview

Inter-laboratory comparison of soil spectral measurements as part of the SoilSpec4GG project.

This repository is used for preparing the MIR returns with preprocessing and spectral standardizaton for further analysis. Spectral harmonization is ommited to avoid publicly disclosing laboratories information.

The workspace development is defined by:

Original returns

The csv files were downloaded from Google Drive using the googledrive package. Files with .csv extension were listed inside MIR returns. Results, documentation and extra files were discarded. In addition, some raw files (.SPA, .0) were converted to .csv and uploaded to Google Drive before formatting to the OSSL specifications.

Formatting

In order to standardize the spectra to the same unit, scale, and spectral range, the original returns were formatted to the OSSL standard:

  • Transformed all spectra to apparent absorbance (A = log10(1/R)).
  • Trimmed all spectra between 650 and 4000 cm-1 range.
  • Resampled all spectra to 2 cm-1 resolution.

Preprocessing

In addition, all the separated datasets were row-binded and preprocessed by:

  • raw: Raw spectra (without preprocessing).
  • BOC: Baseline correction (Savitzky–Golay Smoothing [SGS] [s=0, p=2, w=11, delta.wav=2] + Baseline Offset Correction [BOC, i.e., subtracting spectrum minimum]).
  • SG1stDer: SG First Derivative (SG1stDer [s=1, p=2, w=11, delta.wav=2]).
  • SNV: Standard Normal Variate (SGS + SNV).
  • SNVplusSG1stDer: SNV followed by SG1stDer.
  • wavelet: Wavelet.

Spectral Subspace Transformation

Spectral space transformation (SST) was applied to correct all RT instruments’ spectra to a reference instrument (in this case the KSSL). The RT dataset is relatively small and a previous study demonstrated that around 50 samples are required to properly use SST, therefore all the subsequent analysis are employed for a constant test set size of n = 20. The SNV spectra were used for SST.

For selecting the 50 samples, Kennard-Stone algorithm was applied on raw KSSL spectra, compressed by PCA with components retaining 99% of the cumulative variance (cumvar = 99.99%), and distance metric set as Mahalanobis.

kenStone(X = spec.data.kssl, k = 50, metric = "mahal", pc = 0.9999, .center = T, .scale = T)

Subsets

Probability density distribution of soil properties from KSSL library, RT SST, and RT hold-out test sets.

About

Preprocessing and spectral standardization of ring trial MIR instruments

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages