Category: Regression

Genetic algorithm for wavelength selection using Numpy

Regression, Partial Least Squares Regression, Variable Selection 06/22/2024 Daniel Pelliccia

A implementation of a genetic algorithm for wavelength selection using basic Numpy functions.

Wavelength selection with a genetic algorithm

Data Operations and Plotting, Partial Least Squares Regression, Regression, Variable Selection 07/28/2023 Daniel Pelliccia

Wavelength selection methods aim at choosing the spectral bands that produce the best regression or classification model. Here we introduce a genetic algorithm for wavelength selection.

[Continue Reading...]

Updates and additions to the PLS Regression code

Regression, Partial Least Squares Regression, Plots and Charts 04/29/2023 Daniel Pelliccia

Updated code and additional utility scripts for PLS regression. Will keep it updated as we go.

[Continue Reading...]

Multivariate curve resolution: an introduction

Regression, Data Operations and Plotting, Multivariate Curve Resolution 03/11/2023 Daniel Pelliccia

Multivariate Curve Resolution deals with spectra, or other signals, from samples containing multiple components, and aims at recovering the pure components.

[Continue Reading...]

Understanding neural network parameters with TensorFlow in Python: the optimiser

Neural Networks, Regression 11/20/2022 Daniel Pelliccia

An introductory tutorial on optimisers for deep learning, including Python code for a regression training for NIR spectroscopy.

[Continue Reading...]

Regression optimisation with a Pipeline

Regression, Partial Least Squares Regression, Principal Components Regression 08/13/2022 Daniel Pelliccia

The process of developing and optimising a regression model, almost invariably requires a sequence of steps. These steps can be combined in a single predictor using the Pipeline function …

[Continue Reading...]

Parallel computation of loops for cross-validation analysis

Regression, Data Operations and Plotting, Partial Least Squares Regression 05/21/2022 Daniel Pelliccia

Using parallel computation to speed up cross-validation analysis for large data sets.

[Continue Reading...]

Understanding neural network parameters with TensorFlow in Python: the activation function

Neural Networks, Regression 04/09/2022 Daniel Pelliccia

Where we discuss the meaning of an activation function in neural networks, discuss a few examples, and show a comparison of neural network training with different activation functions.

[Continue Reading...]

Deep neural networks for spectral data regression with TensorFlow

Neural Networks, Regression 03/12/2022 Daniel Pelliccia

This post introduces basic Python code to build fully-connected deep neural networks with TensorFlow for regression analysis of spectral data.

[Continue Reading...]

The Akaike Information Criterion for model selection

Regression, Principal Components Regression, Regression metrics, Regression Model Validation 09/18/2021 Daniel Pelliccia

The Akaike Information Criterion (AIC) is another tool to compare prediction models. AIC combines model accuracy and parsimony in a single metric and can be used to evaluate data …

[Continue Reading...]

Minimal prediction models for linear regression

Regression 04/10/2021 Daniel Pelliccia

What is the minimum amount of information required to export and re-use a linear regression model? The answer is surprisingly simple. Here's a step by step example using PLS …

[Continue Reading...]

Backward Variable Selection for PLS regression

Regression, Partial Least Squares Regression 03/13/2021 Daniel Pelliccia

Backward Variable Selection for PLS regression is a method to discard variables that contribute poorly to the regression model. Here's a Python implementation of the method.

[Continue Reading...]

The Concordance Correlation Coefficient

Regression, Regression metrics, Regression Model Validation 01/09/2021 Daniel Pelliccia

The Concordance Correlation Coefficient (CCC) can be useful to quantify the quality of a linear regression model. In this tutorial we explain the CCC and describe its relation with …

[Continue Reading...]

Bias-Variance trade-off in PLS regression

Regression, Partial Least Squares Regression, Regression Model Validation 09/20/2020 Daniel Pelliccia

Bias-Variance trade-off refers to the optimal choice of parameters in a model in order to avoid both overfitting and underfitting. Let's look at a worked example using PLS regression.

[Continue Reading...]

Wavelength band selection with simulated annealing

Regression, Partial Least Squares Regression 08/15/2020 Daniel Pelliccia

Improve the performance of a PLS method by wavelength band selection using Simulated Annealing optimisation.

[Continue Reading...]

Principal component selection with simulated annealing

Principal Components Regression, Regression 02/09/2020 Daniel Pelliccia

Simulated annealing helps overcome some of the shortcomings of greedy algorithms. Here's a tutorial on simulated annealing for principal components selection in regression.

[Continue Reading...]

Principal component selection with a greedy algorithm

Principal Components Regression, Regression 01/28/2020 Daniel Pelliccia

Greedy algorithms are commonly used to optimise a function over a parameter space. Here's an implementation of a greedy algorithm for principal components selection in regression.

[Continue Reading...]

Moving window PLS regression

Regression, Partial Least Squares Regression 12/07/2019 Daniel Pelliccia

Not all wavelengths are created equals. A moving window PLS algorithm optimises the regression by discarding bands that are not useful for prediction.

[Continue Reading...]

K-fold and Montecarlo cross-validation vs Bootstrap: a primer

Regression, Partial Least Squares Regression, Regression Model Validation 11/15/2019 Daniel Pelliccia

Cross-validation is a standard procedure to quantify the robustness of a regression model. Compare K-Fold, Montecarlo and Bootstrap methods and learn some neat trick in the process.

[Continue Reading...]

Principal Component Regression in Python revisited

Principal Components Regression, Regression 09/10/2019 Daniel Pelliccia

Want to get more out of your principal components regression? Here's a simple hack that will give you a stunning improvement on the performance of PCR.

[Continue Reading...]

Principal Components Regression vs Ridge Regression on NIR data in Python

Principal Components Regression, Regression, Ridge Regression 10/19/2018 Daniel Pelliccia

Principal components regression is a staple of NIR analysis. Ridge regression is much used of machine learning. How do they relate? Find out in this post.

[Continue Reading...]

Outliers detection with PLS regression for NIR spectroscopy in Python

Data Operations and Plotting, Outliers Detection, Partial Least Squares Regression, Regression 09/22/2018 Daniel Pelliccia

Not every data point is created equal. In this post we'll show how to perform outliers detection with PLS regression for NIR spectroscopy in Python.

[Continue Reading...]

A variable selection method for PLS in Python

Data Operations and Plotting, Partial Least Squares Regression, Regression, Variable Selection 07/04/2018 Daniel Pelliccia

Improve the quality of your PLS regression using variable selection. This tutorial will work through a variable selection method for PLS in Python.

[Continue Reading...]

Partial Least Squares Regression in Python

Partial Least Squares Regression, Regression 06/14/2018 Daniel Pelliccia

Step by step tutorial on how to build a NIR calibration model using Partial Least Squares Regression in Python.

[Continue Reading...]

Principal Component Regression in Python

Principal Components Regression, Regression 05/12/2018 Daniel Pelliccia

An in-depth introduction to Principal Component Regression in Python using NIR data. PCR is the combination of PCA with linear regression. Check it out.

[Continue Reading...]

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.

Necessary

Always Enabled

Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Functional

Performance

Analytics

Others