This study is an important contribution to the field of machine learning-enabled NIR spectroscopy, offering researchers a systematic method for selecting representative subsamples from existing data with quality measures, diagnostic tools, and visualization techniques.
Researchers from Graz University of Technology and Christ University have presented a systematic method for choosing representative subsamples from existing research with an extensive set of quality measures and a visualization strategy. In their article, published in AAPS PharmSciTech, Amrit Paudel and Gobi Ramasamy describe the systematic and structured procedure for selecting subsamples from the historical data (1). They offer a wide range of in-depth quality measures, diagnostic tools, and visualization techniques.
The study used an open-source tablet data set that consists of different doses in milligrams, different shapes, and sizes of dosage forms, slots in tablets, three different manufacturing scales (laboratory, pilot, production), coating differences (coated vs. uncoated), and more. The model was developed on one scale, and the researchers investigated how well the top models are transferable when tested on new data like pilot-scale or production (full) scale.
The researchers demonstrated the selection of appropriate hyperparameters and their impact on the artificial neural network-multilayer perceptron (ANN-MLP) model performance. The choice of hyperparameter tuning approaches and performance with available references are discussed for the data under investigation. The model extension from laboratory-scale to pilot-scale was successfully demonstrated.
ANN-MLP is a type of artificial neural network that is widely used for supervised learning. It is a feedforward neural network with multiple layers of neurons, including an input layer, one or more hidden layers, and an output layer. Each neuron in the network receives input from the previous layer, performs a mathematical operation on the input, and then passes the result to the next layer. ANN-MLP is used for a variety of applications such as database exploration, calibration modeling, image recognition, speech recognition, and natural language processing.
Near-infrared (NIR) spectroscopy is non-destructive and non-intrusive, requires little to no sample preparation, and its overall analysis time may be considerably reduced, making it an ideal real-time analytical tool. This technique is used primarily in the pharmaceutical, agriculture, food and dairy, cosmetics, pulp and paper, and precision medicine industries.
Derivatization, normalization, scatter correction, and advanced approaches are a few of the data pre-processing techniques used to conceal physical information and retrieve chemically related information from NIR data. Modelling is employed after physical/chemical information has been segmented. Principal component regression (PCR) and partial least squares regression (PLS) are used in multivariate linear models.
This study is an important contribution to the field of machine learning-enabled NIR spectroscopy, offering researchers a systematic method for selecting representative subsamples from existing data with quality measures, diagnostic tools, and visualization techniques. The research provides a framework for choosing appropriate hyperparameters and demonstrates the extension of models from laboratory-scale to pilot-scale.
(1) Ali, H.; Muthudoss, P.; Ramalingam, M.; Kanakaraj, L.; Paudel, A.; Ramasamy, G. Machine Learning–Enabled NIR Spectroscopy. Part 2: Workflow for Selecting a Subset of Samples from Publicly Accessible Data. AAPS PharmSciTech. 2023, 24, 34. https://link.springer.com/article/10.1208/s12249-022-02493-5
New Study on Edible Oil Analysis Integrates FT-NIR and Machine Learning
January 14th 2025A new study published in Food Control introduces an approach for assessing antioxidant levels in edible oils using artificial intelligence and spectroscopy, offering significant potential for improving food quality control.
Best of the Week: Seed Vigor, Flower Classification, Emerging Leader in Atomic Spectroscopy
January 10th 2025Top articles published this week include two peer-reviewed articles that explore optical detection technology for seed vigor and classifying flowers, as well as a profile on Benjamin Manard, who was recognized as the winner of the 2025 Emerging Leader in Atomic Spectroscopy.
Evaluation and Development Trends of Optical Detection Technology for Seed Vigor
In this article, the basic principles, advantages, and limitations of different optical techniques for obtaining seed vigor estimates are introduced and reviewed, and the key technology of non-destructive optical detection of single seeds will be discussed.
Measuring Soil Potassium with Near-Infrared Spectroscopy
January 8th 2025Researchers have developed a novel three-step hybrid variable selection strategy, SiPLS-RF(VIM)-IMIV, to enhance the accuracy and efficiency of soil potassium measurement using near-infrared spectroscopy, offering significant advancements for precision agriculture and real-time soil monitoring.