Determination of the Geographical Origin of Hazelnuts (Corylus avellana L.) by Near-Infrared Spectroscopy (NIR) and a Low-Level Fusion with Nuclear Magnetic Resonance (NMR)

24 September 2021, Version 1
This content is a preprint and has not undergone peer review at the time of posting.


Fourier-transform near-infrared (FT-NIR) spectroscopy was used to determine the geographical origin of 233 hazelnut samples of various varieties from five different countries (Germany, France, Georgia, Italy, Turkey). The experimental determination of the geographical origin of hazelnuts is important, because there are usually large price differences between the producer countries and thus a risk of food fraud that should not be underestimated. The present work is a feasibility study using a low-cost method, as high-field NMR and UPLC-QTOF-MS have already been used for this question. Sample sets were split with repeated nested cross validation and an ensemble of discriminant classifiers with random subspaces was used to build the classification models. By using a preprocessing strategy consisting of multiplicative scatter correction, bucketing and the mean averaging of five measured spectra per sample, a test accuracy of 90.6 ± 3.9% was achieved, which rivals results obtained with much more expensive infrastructure. The application of the feature selection approach surrogate minimal depth showed that the successful classification is mainly caused by protein signals. In addition, a low-level data fusion of the NIR and NMR data was performed to assess how well the two methods complement each other. The data fusion was compared to a complementary approach, where the classification results based on the individual NIR and NMR models were jointly examined. The data fusion performed better than the individual methods with a test accuracy of 96.6 ± 2.8%. A comparison of the outliers in all classification models shows conspicuities in always the same samples, indicating that robust classification models are obtained.


Geographical Origin
Data Fusion
Feature selection

Supplementary materials

Determination of the Geographical Origin of Hazelnuts (Corylus avellana L.) by Near-Infrared Spectroscopy (NIR) and a Mid-Level Fusion with Nuclear Magnetic Resonance (NMR)
List of samples, NIR spectrum, first and second derivative, confusion matrices of classification models, list of missclassified samples


Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.