A Researcher from Lomonosov Moscow State University has developed a convolutional neural network (CNN) model for Fourier transform infrared (FT-IR) spectra recognition. This AI-based system is capable of classifying 17 functional groups and 72 coupling oscillations with remarkable accuracy, providing a significant boost to material analysis in fields like organic chemistry, materials science, and biology.
In an innovative study, a scientist from Lomonosov Moscow State University has employed deep learning to streamline the analysis of Fourier transform infrared (FT-IR) spectra. This technique, which is crucial for identifying chemical compounds and assessing their structures, is traditionally labor-intensive and requires a high level of experience and expertise. By leveraging convolutional neural networks (CNN), Daniil S. Koshelev, has developed a model that simplifies this process, allowing for faster and more accurate analysis. This research has been published in the journal Applied Spectroscopy (1).
Fourier transform infrared spectroscopy (FT-IR) is a common method for analyzing substances and compounds in a wide range of scientific disciplines. It involves examining the absorption of infrared light by chemical bonds, providing valuable information about the functional groups and coupling oscillations within a molecule. However, interpreting FT-IR spectra can be challenging, requiring significant time and expertise. To address this, the team from Lomonosov Moscow State University developed a CNN-based model that automatically classifies 17 classes of functional groups and 72 classes of coupling oscillations with high precision (1).
For this research 14,361 FT-IR spectra of organic molecules were obtained by web scanning, creating a comprehensive dataset to train the CNN model. Various CNN architectures were tested with different sizes of feature maps to optimize the model's accuracy. The resulting model achieved a weighted F1 score of 93% for functional groups and 88% for coupling oscillations, demonstrating a relatively high level of accuracy and reliability (1).
Read More: FT-IR with Continuous Wavelet Feature Extraction Combined with an Artificial Neural Network
To ensure the model's accuracy, the research used visualization methods like Shapley additive explanations (SHAP) and gradient-weighted class activation mapping (GradCAM). These tools helped visualize and highlight the absorption bands associated with specific functional groups or bonds, providing a deeper understanding of the model's decision-making process. The high AUC ROC (Area Under the Curve for Receiver Operating Characteristic) metrics, reaching 0.98 and above for most classes, further validated the model's effectiveness (1).
AUC ROC is a metric used to evaluate the performance of a binary classification model. The ROC curve plots the True Positive Rate (sensitivity) against the False Positive Rate (1 minus specificity) at various thresholds, showing the trade-off between these rates. The AUC is the area under the ROC curve, providing a single value that summarizes the model's ability to discriminate between positive and negative instances. An AUC of 0.5 suggests the model is no better than random guessing, while an AUC of 1.0 indicates a perfect classifier. The higher AUC signifies better model performance.
The team's work represents a significant improvement over classical machine learning methods such as K-nearest neighbor, random forests classifier, support vector machine, or multilayer perceptron, which typically achieved an overall class accuracy of only 23% (1). The newly developed CNN model not only outperforms these traditional methods but also brings a new level of automation and efficiency to FT-IR analysis.
This research has the potential to revolutionize the use of FT-IR in organic chemistry, materials science, and biology. By automating the analysis of FT-IR spectra, the model could save valuable time for scientists and enhance the reliability of results. The authors suggest that the model can be used to facilitate the preparation of experimental data for publication, thereby streamlining the research process (1).
The study opens the door to creating software tools based on this AI-driven model, allowing for more efficient and accurate FT-IR analysis (1–3). These advancements could lead to new applications in environmental science, quality control, and other fields where chemical analysis is critical (2,3).
References
(1) Koshelev D. S. Expert System for Fourier Transform Infrared Spectra Recognition Based on a Convolutional Neural Network With Multiclass Classification. Appl. Spectrosc. 2024, 78 (4), 387–397. https://doi.org/10.1177/00037028241226732
(2)Workman, Jr., J; Mark, H. Artificial Intelligence in Analytical Spectroscopy, Part I: Basic Concepts and Discussion. Spectroscopy 2023, 38 (2), 13–22. DOI: 10.56530/spectroscopy.og4284z8
(3)Workman, Jr., J; Mark, H. Artificial Intelligence in Analytical Spectroscopy, Part II: Examples in Spectroscopy. Spectroscopy 2023, 38 (6), 10–15. DOI: 10.56530/spectroscopy.js8781e3
FT-IR Spectroscopy for Microplastic Classification
December 19th 2024A new study in Infrared Physics & Technology highlights the pivotal role of Fourier transform infrared (FTIR) spectroscopy in identifying and quantifying microplastics, emphasizing its advantages, limitations, and potential for advancement in mitigating environmental pollution.
ATR FT-IR: A New Vision on Protein Structure and Aggregation
December 17th 2024A recent study by researchers from the University of Belgrade highlights the transformative potential of attenuated total reflectance Fourier transform infrared (ATR-FT-IR) spectroscopy for analyzing protein structures. This versatile method not only provides insights into secondary structures but also excels at tracking aggregation processes, offering advantages over traditional techniques like X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy.
Measuring Microplastics in Remote and Pristine Environments
December 12th 2024Aleksandra "Sasha" Karapetrova and Win Cowger discuss their research using µ-FTIR spectroscopy and Open Specy software to investigate microplastic deposits in remote snow areas, shedding light on the long-range transport of microplastics.
Advances in Mid-Infrared Imaging: Single-Pixel Microscopy Modernized with Quantum Lasers
December 10th 2024Scientists have developed a novel and creative mid-infrared (MIR) hyperspectral microscope using single-pixel imaging (SPI) technology and a quantum cascade laser (QCL). This innovation offers faster, more cost-effective chemical analysis compared to traditional methods, promising new frontiers in microscopic imaging.