open access publication

Article, 2024

Strategies for multivariate characterization and classification of pulps and papers by near-infrared spectroscopy

Analytica Chimica Acta, ISSN 1873-4324, 0003-2670, Volume 1318, Page 342895, 10.1016/j.aca.2024.342895

Contributors

Khaliliyan, Hajar 0000-0002-3247-2139 [1] Rinnan, Åsmund 0000-0002-7754-7720 [2] Völkel, Laura 0000-0002-5106-4288 [1] Gasteiger, Franziska [3] Mahler, Kai [4] Röder, Thomas [5] Rosenau, Thomas 0000-0002-6636-9260 [1] Potthast, Antje 0000-0003-1981-2271 [1] Böhmdorfer, Stefan 0000-0003-1400-3395 (Corresponding author) [1]

Affiliations

  1. [1] University of Natural Resources and Life Sciences
  2. [NORA names: Austria; Europe, EU; OECD];
  3. [2] University of Copenhagen
  4. [NORA names: KU University of Copenhagen; University; Denmark; Europe, EU; Nordic; OECD];
  5. [3] AustroCel Hallein GmbH Salzachtalstraße 88, 5400, Hallein, Austria
  6. [NORA names: Austria; Europe, EU; OECD];
  7. [4] Sappi Europe, Sappi Papier Holding GmbH, Brucker Strasse 21, 8101, Gratkorn, Austria
  8. [NORA names: Austria; Europe, EU; OECD];
  9. [5] Lenzing (Austria)
  10. [NORA names: Austria; Europe, EU; OECD]

Abstract

Background Multivariate calibration by Partial Least Squares (PLS) on near-infrared data has been applied successfully in several industrial sectors, including pulp and paper. The creation of multivariate calibration models relies on a set of well-characterised samples that cover the range of the intended application. However, sample sets that originate from an industrial process often show an uneven distribution of reference values. This can be addressed by curation of the reference data and the methodology for multivariate calibration. It needs to be better understood, how these approaches affect the quality and scope of the final model. Results We describe the effect of log10 transformation of the reference values, regular PLS, robust PLS, the newly introduced bin PLS, and their combinations to select more evenly distributed reference values for the quantification of five pulp characteristics (kappa number, R18, R10, cuen viscosity, and brightness; 200 samples) by near-infrared spectroscopy. The quality of the models was assessed by root mean squared error of prediction, calibration range, and coverage of sample types. The best models yielded uncertainty levels equivalent to that of the reference measurement. The optimal approach depended on the investigated reference value. Significance Robust PLS commonly gives the model with the lowest error, but this usually comes at the cost of a notably reduced calibration range. The other approaches rarely impacted the calibration range. None of them stood out as superior; their performance depended on the calibrated parameter. It is therefore worthwhile to investigate various calibration options to obtain a model that matches the requirements of the application without compromising calibration range and sample coverage.

Keywords

applications, approach, calibration, calibration model, calibration options, calibration parameters, calibration range, characteristics, characterization, classification, combination, cost, coverage, creation, curation, data, distribution of reference values, effect, error, error of prediction, industrial processes, industrial sectors, investigated reference values, least squares, levels, low error, mean squared error of prediction, measurements, methodology, model, multivariate calibration, multivariate calibration models, multivariate characterization, near-infrared data, near-infrared spectroscopy, optimization approach, options, paper, parameters, partial least squares, partially, performance, prediction, process, pulp, pulp characteristics, quality, quantification, range, reference, reference data, reference measurements, reference values, requirements, robust partial least squares, root, root mean square error, root mean square error of prediction, sample coverage, sample types, samples, scope, sector, significance, spectroscopy, square, strategies, transformation, type, uncertainty, uncertainty level, uneven distribution, values, well-characterised samples

Data Provider: Digital Science