New AI Model Accelerates Infrared Imaging for Protein Analysis, Study Reveals

News
Article

A new study led by Ayanjeet Ghosh of the University of Alabama presents a transformative approach to analyzing infrared (IR) imaging data, with promising implications for neurodegenerative disease research.

Discrete frequency infrared (IR) imaging is a technique that allows scientists to visualize the chemical makeup of biological samples by focusing on specific frequencies (1,2). It has gained significant attention for its potential in biomedical science. In a recent study published in Applied Spectroscopy, lead investigator Ayanjeet Ghosh of the University of Alabama collaborated with researchers from his university as well as the University of Illinois Urbana-Champaign to develop and propose a new two-step regressive neural network (NN) model to improve the analysis of protein structures in tissue samples (1).

Discrete frequency IR imaging has been limited in being applied broadly because it is time-consuming (1). Another challenge is that using it for computational spectral analysis is expensive, especially when it is used to analyze tissue samples from patients with serious illnesses, such as Alzheimer’s and Parkinson’s disease (1). Traditionally, analyzing protein structures in tissue samples involves what’s known as Gaussian band fitting. This technique breaks down IR spectra to determine the relative presence of protein secondary structures (1).

Microscopic Protein Structure 3D Render, Molecular Biology, Science. Generated by AI. | Image Credit: © dominic - stock.adobe.com

Microscopic Protein Structure 3D Render, Molecular Biology, Science. Generated by AI. | Image Credit: © dominic - stock.adobe.com

However, performing this analysis on images that contain millions of pixels quickly becomes infeasible. The volume of data and the complexity of curve fitting demand significant processing time, making it impractical for widespread clinical or research use (1).

The new model proposed by Ghosh and his colleagues attempts to rectify these two major issues. According to Ghosh, the two-step regressive neural network model he developed with his team is designed to achieve the same insights as traditional spectral fitting at a fraction of the time and computational cost.

“Our two-step neural network is designed to perform two key steps necessary for the quantification of protein secondary structures from discrete frequency data,” Ghosh said to Spectroscopy. “It reconstructs the full spectra from seven wavenumbers, and it then predicts areas under the curve (AUCs) of underlying spectral components for structural quantification, which is typically done using band fitting.”

The researchers’ approach required data from just seven discrete wavenumbers rather than densely sampled spectral data. The model then performs an interpolation step to reconstruct the full spectral profile and predict the AUCs for protein components. Over the course of their study, they found that this process was found to be over 3,000 times faster than traditional spectral fitting (1).

Ghosh also discussed that their model does not rely on high-density spectral sampling. As a result, the data sets are smaller, more manageable, and less demanding in terms of storage and processing power. Importantly, the predictive accuracy of the model closely matched the results of traditional methods, meaning researchers can trust the insights generated by the neural network (1).

“To assess robustness under varying signal-to-noise (S/N) ratio conditions, the model’s performance was compared against traditional Gaussian fitting,” Ghosh said to Spectroscopy. “Across different S/N ratio levels, the model consistently outperformed Gaussian fitting, showing lower relative mean absolute error (MAE) and less degradation in predictive accuracy as noise increased.”

By significantly streamlining both data acquisition and interpretation, the work of Ghosh and his collaborators at The University of Alabama and the University of Illinois Urbana-Champaign indicates a potential way for biomedical researchers to approach protein structural analysis in complex biological samples.

Spectroscopy sat down with Ayanjeet Ghosh, as well as his coauthor Rohit Bhargava, to discuss their study more in-depth. Over the next week, we will release the three-part interview. In Part I, Ghosh and Bhargava will discuss their interest in using discrete frequency infrared (IR) imaging to analyze protein secondary structure. Part II will discuss how machine learning (ML) is used in data analysis and go into more detail about the model they developed in their study. Part III will see Ghosh and Bhargava discuss the key performance metrics they used to evaluate their model, and what the future of neurodegenerative disease research looks like.

We are looking forward to sharing their insights with all of you soon.

References

  1. Edmonds, H.; Mukherjee, S. S.; Holcombe, B.; et al. Quantification of Protein Secondary Structures from Discrete Frequency Infrared Images Using Machine Learning. Appl. Spectrosc. 2025, ASAP. DOI: 10.1177/00037028251325553
  2. Hughes, C.; Clemens, G., Bird, B.; et al. Introducing Discrete Frequency Infrared Technology for High-Throughput Biofluid Screening. Sci. Rep. 2016, 6, 20173. DOI: 10.1038/srep20173
Recent Videos
North Lake Tahoe Sunset | Image Credit: © adonis_abril - stock.adobe.com
Beautiful Day in Lake Tahoe, California | Image Credit: Jeremy Janus - stock.adobe.com
Sand Harbor Lake Tahoe Nevada | Image Credit: © Stephen - stock.adobe.com.
Baltimore Downtown Skyline Panorama | Image Credit: © Stefan - stock.adobe.com
Hand scooping up a mixture of sand and microplastics from the shore, theme of pollution. Generated using AI. | Image Credit: © nabila - stock.adobe.com.
Jeanette Grasselli Brown 
Jeanette Grasselli Brown 
Jeanette Grasselli Brown 
Related Content