Investigating the effect of different pre-treatment methods on Raman spectra recorded with different excitation wavelengths
in: Spectrochimica Acta Part A-Molecular and Biomolecular Spectroscopy (2023)
Raman reference libraries can be used for identification of components in unknown samples as Raman spectroscopy offers fingerprint information of the measured samples. Since Raman libraries often contain many different and/or highly similar spectra, it is important that the spectra are a reliable fingerprint for each compound. However, Raman spectra are highly sensitive to the experimental conditions, and the Raman spectra will change in different conditions even though the same sample is measured. Raman data pre-treatment minimizes the differences between Raman spectra arising from different experimental conditions. In this study, different combinations of pre-treatment methods are used to quantify the effect of each pre-treatment step in minimizing the differences between Raman spectra of the same sample in different experimental conditions, e.g., different excitation wavelengths. These different pre-treatment processes are evaluated for six solvents. The spectra differences between spectra recorded with three excitation wavelengths (532 nm, 633 nm, and 830 nm) are evaluated by angular difference index and the influence on a classification model is tested. The angular difference index of each spectrum after every data pre-treatment step shows a decreasing behavior. It could be demonstrated that wavenumber calibration has the largest effect on the differences between the Raman spectra. However, ω4 correction doesn’t have a significate effect in this dataset. The classification results show that the prediction accuracy is improving by doing data pre-treatment. In the dataset obtained in 633 nm a lower amount of pre-treatment steps is needed but in the dataset 830 nm more pre-treatment steps are needed for a high accuracy. The result shows that the choice of an optimal pre-treatment method or combination of methods strongly influences the analysis results, but is far from straightforward, since it depends on the characteristics of the data set and the goal of data analysis.