Next Article in Journal
Breathing Right… or Left! The Effects of Unilateral Nostril Breathing on Psychological and Cognitive Wellbeing: A Pilot Study
Previous Article in Journal
A Pilot Study on a Possible Mechanism behind Olfactory Dysfunction in Parkinson’s Disease: The Association of TAAR1 Downregulation with Neuronal Loss and Inflammation along Olfactory Pathway
Previous Article in Special Issue
Sex Differences in Conversion Risk from Mild Cognitive Impairment to Alzheimer’s Disease: An Explainable Machine Learning Study with Random Survival Forests and SHAP
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine Learning-Assisted Classification of Paraffin-Embedded Brain Tumors with Raman Spectroscopy

by
Gilbert Georg Klamminger
1,2,3,*,†,
Laurent Mombaerts
4,†,
Françoise Kemp
4,
Finn Jelke
5,6,
Karoline Klein
5,7,
Rédouane Slimani
6,8,
Giulia Mirizzi
5,7,
Andreas Husch
5,9,
Frank Hertel
5,7,
Michel Mittelbronn
4,8,9,10,11 and
Felix B. Kleine Borgmann
3,5,8,12,*
1
Department of General and Special Pathology, Saarland University (USAAR), 66424 Homburg, Germany
2
Department of General and Special Pathology, Saarland University Medical Center (UKS), 66424 Homburg, Germany
3
National Center of Pathology (NCP), Laboratoire National de Santé (LNS), 3555 Dudelange, Luxembourg
4
Luxembourg Center of Neuropathology (LCNP), 3555 Dudelange, Luxembourg
5
National Center of Neurosurgery, Centre Hospitalier de Luxembourg (CHL), 1210 Luxembourg, Luxembourg
6
Doctoral School in Science and Engineering (DSSE), University of Luxembourg (UL), 4362 Esch-sur-Alzette, Luxembourg
7
Faculty of Medicine, Saarland University (USAAR), 66424 Homburg, Germany
8
Department of Cancer Research (DoCR), Luxembourg Institute of Health (LIH), 1210 Luxembourg, Luxembourg
9
Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg (UL), 4362 Esch-sur-Alzette, Luxembourg
10
Department of Life Sciences and Medicine (DLSM), University of Luxembourg, 4365 Esch-sur-Alzette, Luxembourg
11
Faculty of Science, Technology and Medicine (FSTM), University of Luxembourg, 4365 Esch-sur-Alzette, Luxembourg
12
Hôpitaux Robert Schuman, 1130 Luxembourg, Luxembourg
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Submission received: 26 February 2024 / Revised: 15 March 2024 / Accepted: 22 March 2024 / Published: 23 March 2024
(This article belongs to the Special Issue Neuroscience Meets Artificial Intelligence)

Abstract

:
Raman spectroscopy (RS) has demonstrated its utility in neurooncological diagnostics, spanning from intraoperative tumor detection to the analysis of tissue samples peri- and postoperatively. In this study, we employed Raman spectroscopy (RS) to monitor alterations in the molecular vibrational characteristics of a broad range of formalin-fixed, paraffin-embedded (FFPE) intracranial neoplasms (including primary brain tumors and meningiomas, as well as brain metastases) and considered specific challenges when employing RS on FFPE tissue during the routine neuropathological workflow. We spectroscopically measured 82 intracranial neoplasms on CaF2 slides (in total, 679 individual measurements) and set up a machine learning framework to classify spectral characteristics by splitting our data into training cohorts and external validation cohorts. The effectiveness of our machine learning algorithms was assessed by using common performance metrics such as AUROC and AUPR values. With our trained random forest algorithms, we distinguished among various types of gliomas and identified the primary origin in cases of brain metastases. Moreover, we spectroscopically diagnosed tumor types by using biopsy fragments of pure necrotic tissue, a task unattainable through conventional light microscopy. In order to address misclassifications and enhance the assessment of our models, we sought out significant Raman bands suitable for tumor identification. Through the validation phase, we affirmed a considerable complexity within the spectroscopic data, potentially arising not only from the biological tissue subjected to a rigorous chemical procedure but also from residual components of the fixation and paraffin-embedding process. The present study demonstrates not only the potential applications but also the constraints of RS as a diagnostic tool in neuropathology, considering the challenges associated with conducting vibrational spectroscopic analysis on formalin-fixed, paraffin-embedded (FFPE) tissue.

1. Introduction

Approaches to more precise and observer-independent tissue diagnostics have been largely developed in the last decades—evolving from the first use of immunohistochemistry (1942) to the implementation of genetic parameters in tumor diagnostic to methylome-based tumor class prediction in the current WHO brain tumor classification [1,2]. To date, histopathological tissue analysis in combination with epigenetic tumor classification is the defined “gold-standard” of brain tumor diagnosis, enabling not only solid consecutive treatment decisions but also evidence-based prognoses. As an alternative attempt for reliable tumor diagnostics, the vibrational spectroscopic technique named Raman spectroscopy (RS) has gained increased research interest in the neurooncological field. This optical method detects changes in the molecular vibrational state of tissue samples and thus uses spectral tissue properties to, e.g., determine tumor origin or to gain insights into tumor biology [3]. So far, the practical potential of this method in combination with supervised and unsupervised machine learning algorithms has been shown not only for the classification of tumor types and distinct histomorphological tumor areas but also for the differentiation of the tumor grade or the detection of tumor margins/infiltration zone [4,5,6,7]. Most recently, Zhang et al. yielded >80% accuracy with support vector machine-based intraoperative differentiation of glioma tissue and healthy control, and Romanishkin et al. reported a support vector machine-based accuracy in detecting glioblastoma tissue (regardless of the histologically proven sampling area, namely, central tumor core or tumor edge) of 83% [8,9]; by using unprocessed samples of pediatric brain tumors, even an accuracy of 86.2% was achieved when classifying between low-grade gliomas and normal brain by employing a logistic regression model [10]. Although, commonly, fresh tissue specimens are spectroscopically examined intraoperatively in vivo [11] by using a handheld Raman probe or ex vivo [4,12] by using advanced Raman imaging techniques, e.g., Stimulated Raman Histology, other approaches aim to establish RS on formalin-fixed or formalin-fixed and paraffin-embedded (FFPE) tissue. The latter is accompanied by challenges such as degradation/fragmentation of nucleic acids and protein cross-linking within FFPE tissue [13,14,15,16], strong spectral signal of paraffin wax adulterating the biological Raman bands [17], modification of biological lipids related to the dewaxing process [18], and occurrence of fluorescence due to silicon-based glass slides [13], which should be addressed pre-experimentally and also considered during data processing and analysis. Nevertheless, RS has been successfully applied on FFPE tissue to classify distinct tumor types [19,20], to predict the genetic status of IDH-mutant/wildtype astrocytomas [21], and to distinguish certain histological areas in the morphological heterogenous glioblastoma based on individual spectral properties [22]. In order to further exploit the potential of RS as an additional method in neuropathology, we acquired the Raman spectra of distinct histomorphological areas (vital tumor and necrosis) of a broad range of intracranial neoplasms and classified them using an in-house-built machine learning pipeline. Within our approach, we did not solely aim at the discrimination of different tumor groups; we further evaluated the question of whether distinct histomorphological areas remain detectable due to their spectroscopic behavior even when compared with areas of another tumor type.

2. Materials and Methods

2.1. Patient Data

Sample specimens of 82 primary brain tumor cases (astrocytoma IDH mutant, oligodendroglioma 1p/19q co-deleted, ependymoma, glioblastoma IDH wildtype; see Supplementary Table S1 for clinical data), meningeal tumors (transitional/meningothelial meningiomas), and brain metastases of different primary origin (breast, colorectal, and lung (non-small cell lung cancer (NSCLC)) carcinomas) were examined and measured by means of RS. See Table 1 for an overview of the included tumor groups/types and the corresponding Raman measurements carried out. The samples included are part of the INSITU® study (approved by the “Comité National d’Ethique de Recherche’”, No. 201804/08) in Luxembourg, and all patients involved gave their written formal consent for tissue usage. The spectroscopic examination of tissue samples was exclusively carried out at Laboratoire national de santé (LNS; Luxembourg) after routine neuropathological diagnostic workup (light microscopy, IHC, and genetic and epigenetic testing) by a board-certified neuropathologist (M.M.).

2.2. Sample Preparation

FFPE tissue blocks were selected according to the neuropathological diagnosis and histomorphological description; thereupon, the tissue blocks were sequentially cut two times (each section of 7 μm), using a standard microtome. While the first section was stained (H&E—hematoxylin and eosin) and thus amenable to light microscopic examination, the second section was not stained to avoid additional negative impact on the biological spectral signal. Instead of using a silicon-based glass slide, which would have affected the Raman measurements due to its strong fluorescence background, the second section was placed on a CaF2 slide (calcium fluoride; Crystran, Poole, UK) [13]. With its beneficial property of only one single Raman band at 321 cm−1, the CaF2 slide allows for the spectroscopic assessment of biological tissue [23,24]. In a next step, the tumor sample of the second section was subjected to a deparaffinization process, in which residual paraffin wax was physically (by briefly heating it) and chemically (by means of repetitive xylene baths) reduced. This step is crucial due to the strong Raman signal (distinct bands at 1063, 1133, 1296, and 1441 cm−1) of paraffin wax, which would potentially overlap with the genuine Raman spectrum of the tumor sample [17,25]. See Supplementary Table S2 for the detailed deparaffination protocol. Finally, the H&E-stained section was light-microscopically analyzed, and the histomorphological areas of interest (vital tissue and necrosis) were marked with a fine marker pen. This marking on the H&E slide allows for an accessible transfer of the localization of the desired tumor area to the subsequent unstained slide, where the corresponding area could also be marked and afterwards submitted to spectroscopic examination. Post-experimentally, the reusable CaF2 slides were hand-washed with a rinsing agent (containing, among others, surface-active agents and ethanol) and warm water until remaining tumor tissue was no longer visible; this was performed in between measurements for quality assurance by means of spectroscopy exclusion.
In this study, two distinct histomorphological areas of interest were defined. The first area was a vital tumor zone, while the second one consisted only of necroses. Due to its occurrence in only a few tumor types, it was spectroscopically assessed only whenever feasible (i.e., glioblastoma and brain metastases). The area of necrosis was defined as an eosinophilic area lacking vital tumor cell nuclei.

2.3. Data Acquisition and Raman Spectroscopy

After pre-experimental preparations were completed, the CaF2 slides with the unstained tissue were submitted to spectroscopic measurements. For the accurate selection of the measurement point, the laser spot of the Raman spectrometer was placed within the marked area in a visually controlled manner. The actual RS measurement was performed in an in-house-built dark chamber with the aim to reduce external sources of light interference, thereby preserving the biological RS signal [26,27]. With a laser spot size of 100 μm, a lens including a working distance of 7 mm and the chosen acquisition parameters of 10 s, 30 averages, and 90 mW (laser power) per measurement, we were able to achieve a fast readout. Therefore, in our approach, a marked area was measured repeatedly, but at different spots within one area. All Raman measurements were conducted using the portable ProRaman-L high-performance Raman spectrometer (TSI, Shoreview, MN, USA) including a CCD detector system; see Supplementary Figure S1. For data acquisition, the default ProRaman Reader software, version 8.3.6 (TSI, Shoreview, MN), was employed, and data smoothing was applied directly after each measurement by using a default Savitzky–Golay filter. All data were stored as .spc files and made accessible to data analysis in a next step. See Figure 1 for a general overview of our study protocol.

2.4. Machine Learning

Starting with data preprocessing, all measurements were visualized (Supplementary Figure S2 displays all raw Raman Spectra sorted by tumor type) and internally checked for outliers due to non-biological confounding factors, e.g., cosmic ray artifacts and hot pixels [28]. In a next step, input data were standardized and trends removed using a Savitzky–Golay filter; afterwards, the Raman spectra were cropped to the range between 500 cm−1 and 2000 cm−1. This spectral range, also known as the “biological area”, is widely used in spectroscopic studies due to a great contribution of Raman bands resulting from underlying biological tissue. See Figure 2 for mean Raman spectra of primary brain tumors, meningiomas, and brain metastases. For deeper machine learning-based analysis, we used an internally written Python script in which we set up a machine learning pipeline to enable iterative classification with divergent input data based on each emerging classification task. In this manner, different research questions can be approached with consistent, a priori-selected parameters and types of validation. To avoid patient-dependent clustering, the input data were split in a training cohort (75% of data) and an external validation cohort (25% of data) containing only Raman measurements of different patients. In the course of this, patient identity did not serve as prediction output but rather for confounder variable control; within the scope of the study, strictly the identity of the area measured by one single measurement was determined as output. After the training of random forest (RF) classification, containing internal random cross-validation, as the selected model within our pipeline, the resulting algorithm’s performance was assessed. To do so, the accuracy, the ROC (receiver operating characteristic) curve, the corresponding AUROC (area under receiver operating curve) value, and the PR (precision–recall) curve, including the corresponding AUPR value, were plotted for each classification decision threshold; furthermore, the misclassification ratio (misclassified counts relative to total measurements per class) and the scores of the commonly used metrics precision (positive predictive value), recall (sensitivity), and F1-score (mean of precision and recall) were reported. To gain a deeper insight into model performance, the macro average (average based on all individual classes equally) and the weighted average (average considering the quantities of measurements in each class) were calculated for each of the three metrics. See Supplementary Material for a more detailed description of our machine learning workflow.

3. Results

3.1. Multi-Class Classification for Discrimination of Tumor Origin

The optimized RF model was first trained to address an initial multi-class classification task. The pathologically determined origin of each tumor served as class and thus as histological ground truth for the model; in our case, this resulted in the following three classes: primary brain tumors (here: gliomas), carcinoma brain metastases, and meningeal tumors (here: meningiomas). At this point, only RS measurements of vital tumor tissue were considered. The splitting numbers for the training and the external validation cohorts, as well as the total number of patients and measurements, are displayed in Supplementary Table S3. The trained model was capable of detecting differences in the spectral properties of tumors of varying origin with a macro-average AUROC value of 0.76 (AUROC value for each class: glioma, 0.84; carcinoma brain metastases, 0.76; meningioma, 0.66) and an overall accuracy of 63%. See Figure 3A for the resulting ROC and PR curves, as well as the AUROC and AUPR values. A confusion matrix with predicted observations in the test set and the corresponding metrics of precision, recall, and F1-score are shown in Figure 3B,C. The primary brain tumor group and the carcinoma brain metastasis group can be clearly distinguished as individual classes based on their spectral properties (misclassification ratios of 0.11 and 0.52, respectively), while the meningioma class was frequently misclassified, with a ratio 0.85. To investigate this distinction in more detail, we repeated the model training process with the same measurements of primary brain tumors and carcinoma brain metastases as input data; this time, each tumor subclass (astrocytoma, oligodendroglioma, ependymoma, glioblastoma, NSCLC metastasis, colorectal carcinoma metastasis, and breast carcinoma metastasis) served as the histological ground truth and thus as class (output) in our learning model. Even in this multi-class (seven classes) learning approach, the primary side of primary brain tumors and carcinoma brain metastases is still reliably detectable, since misclassifications between the tumor types occur most likely within their biological tumor origin (Figure 3D and Supplementary Figures S3 and S4).

3.2. A Practical Approach: Carcinoma Metastases and Glioma Classifier

Upon microscopical identification of a brain tumor as likely metastatic in origin, the question of the primary origin arises. This question is not easily addressed by means of basic light microscopy and frequently requires immunohistochemistry and further molecular pathology. Similarly, in case of a glial tumor, the identification of different tumor types is sometimes difficult by means of light microscopy alone. Fast and label-free identification based on spectral properties could be very helpful. We, therefore, used all the acquired measurements of vital tumor areas of carcinoma brain metastases (Supplementary Table S4) to establish a classifier within our machine learning pipeline to determine the primary tumor origin (lung/breast/colorectal) only by means of spectroscopy. Our trained RF algorithm was able to distinguish between different carcinoma metastases with a macro-average AUROC value of 0.87 (AUROC value for each class: breast carcinoma metastases, 1.00; colorectal cancer metastases, 0.80; and NSCLC, 0.76) and an overall accuracy of 68%, predicting the class of brain metastases of breast tumors with highest scores (precision 1.00, recall 0.83, misclassification ratio 0.17); see Figure 4A for the corresponding ROC/PR curve and confusion matrix. Focusing on the detection of the glioma tumor type and thus also on tumor genetics (IDH mutation in the case of IDH mutant astrocytoma, lack of IDH mutation in the case of IDH wildtype glioblastoma, and 1p/19q co-deletion in the case of oligodendroglioma [29]; see Table 1 and Supplementary Table S1), we applied RS to different grades and types of glioma specimens (input data; Supplementary Table S5) and were able to predict IDH mutant astrocytomas (AUROC of 0.92, recall of 0.72, precision of 0.57, and misclassification ratio of 0.28) and 1p/19q co-deleted oligodendrogliomas (AUROC of 0.86, recall of 0.50, precision of 0.58, and misclassification ratio of 0.50) in a multi-class classification approach. IDH wildtype glioblastoma and ependymomas were detected with AUROC values of 0.79 and 0.91, respectively; the misclassification ratio was 0.81 for the glioblastoma class and 0.00 for the ependymoma class; see Figure 4B and Supplementary Figures S5 and S6.

3.3. A Practical Approach: Classification of Tumor Necrosis

In case the resected tumor samples available for pathological analysis turn out to be purely necrotic during light microscopical examination (e.g., biopsy from central region of a glioblastoma), repeated resectioning or biopsy is necessary for adequate pathological diagnostics. Hence, we evaluated the question of whether RS would be suitable for determining the underlying tumor origin in case of purely necrotic tissue; therefore, we spectroscopically examined areas of necrosis in glioblastoma and in carcinoma brain metastases. By using the established computational learning pipeline, a binary RF classification was able to determine the tumor origin solely based on the spectroscopic properties of fragments of tumor necrosis with an overall accuracy of 92% (macro-average AUROC vale of 0.98) at the best classification threshold. Figure 5 provides a visualization of this practical classification approach and shows the performance evaluation in more detail (see additionally Supplementary Figure S7). Since both tumor groups have shown distinct spectral properties usable for tumor detection on vital tissue in recent analyses (see above), we investigated whether not only the tumor origin (glioma/brain metastasis) but also the histomorphological appearance (vital tumor/necrosis) of each tumor type remains predictable due to an inherent spectroscopic behavior. Indeed, we could show that all four classes (vital brain metastases, necrosis of brain metastases, vital glioma, and necrosis of glioblastoma) were separable based only on their spectroscopic features (macro-average AUROC vale of 89%; see especially Supplementary Figure S8 for more detailed performance metrics). For a general data overview, refer to Supplementary Table S6.
In summary, our results prove the general feasibility of utilizing FFPE tumor samples for vibrational spectroscopic examinations. Moreover, we show that rapid spectroscopic analysis and subsequent identification of the biological origin of tumors remain feasible though dependent on the specific classification task.

4. Discussion

In this study, we demonstrated that the employment of RS on FFPE tissue in conjunction with machine learning algorithms allows for a rough discrimination of tumor origin and tumor types depending on the use case. We showed that spectroscopic classification proved feasible even based on necrotic tissue fragments alone. A further development of this technique could prove highly practical in those cases where advanced and time-consuming methods of histopathology would need to be employed and possibilities of initial H&E staining are already exploited. Notably, the fact that RS can be used on purely necrotic samples opens avenues of diagnostics where molecular approaches fail. In our study, necrosis stood out as a separate class both between and within tumor classes; it was possible to distinguish both the neoplastic origin of necrotic tissue and the difference between necrotic and vital tissue within one tumor class. Our findings are in line with previous studies examining tumor necroses by means of optical spectroscopy; already in 2007, Amharref et al. described the presence of spectroscopic features of necrotic tumor areas [30]. Subsequently, Kalkanis et al. determined the good separability of normal brain tissue, glioblastoma, and necrosis with an overall accuracy of 97.8% (accuracy of 77.5% in the case of artifact inclusion) based on a supervised discriminant function analysis in frozen tissue sections; Kast et al. established Raman-based images of different brain regions (including tumor center and necrosis), and our group recently described the solid separability of morphological glioblastoma areas (peritumoral area, tumor core, and necrosis) in FFPE tissue samples with an overall accuracy of 70.5% employing support vector machine based-classification [7,22,31]. Additional analyses are required to prove if our attempt also holds true for the detection of necrosis in a multi-class classification approach containing various types of necrosis (e.g., necrosis of primary CNS lymphoma, necrosis post radiation treatment, and necrosis in non-cancerous diseases). Prospectively, the spectroscopic behavior of peritumoral infiltration zones and non-tumorous brain tissue, which is not greatly available since brain tumor surgery aims for maximal tumor resection with corresponding minimal loss of healthy brain function, also needs to be profoundly integrated in such an approach. On that note, a study conducted by the team of Jermyn et al. already evaluated the spectroscopic properties of unprocessed peritumoral brain tissue and the minimal resolution of vibrational spectroscopy for tumor cell identification, showing an overall accuracy of 92% in detecting glioma cells spectroscopically, which resulted in the spectroscopic recognition of tumor cell margins up to ~3.7 cm or ~2.4 cm (depending on the MRI sequence) more precise than MRI T1- and T2-weighted tumor margin detection, respectively [32].
Our results demonstrate that RS can potentially serve as a label-free and easy-to-apply technique for the determination of the tumor type or primary tumor origin even when using FFPE tissue; this fast approach for tissue assessment can be applied without awaiting further molecular analysis. Our presented performance metrics are in alignment with the limited evidence known so far. Fullwood et al. published a principal component-based linear discriminant analysis when applying RS on FFPE tissue and differentiated among different primary sites of brain tumor metastasis (breast/colon/melanoma/lung/esophagus and stomach) with 63% correctly classified spectra overall and 72% correctly classified spectra when differentiating among glioblastoma, brain metastasis, and normal brain tissue; the latter could even be improved when averaging spectroscopic measurements [20]. Within our classification models, an internal classification tendency towards the glioma group can be detected, and a more accurate classification of IDH mutant astrocytomas and ependymomas than glioblastoma can be determined when solely classifying different types of gliomas. The latter is in line with the heterogenous nature of glioblastoma, which makes classification as an individual class difficult, as we previously described in unprocessed glioblastoma specimens as well as FFPE tissue [22,33]. In comparison to our reported glioma classifier aiming at differentiating among a broad range of primary brain tumors, Quesnel et al. used RS and support vector machine-based analysis to differentiate among different grades of gliomas and between tumors with the IDH1 mutation or IDH wildtype, with reported accuracies between 75% and 85% [34].
Since we deal with strongly chemically affected and degraded FFPE tissue, several circumstances and limitations need to be considered when evaluating classification performance and addressing occurring misclassifications.
One possibility is that the subsequent loss of biological substance classes useful for tumor determination in native tissue [5], such as lipids and small molecules, reduces tumor-specific spectroscopic properties and accordingly limits accurate class assignment. Additionally, residues of paraffin wax, as well as tissue effects related to the fixation process, may potentially affect each spectrum in a uniform manner and consecutively hamper spectral individuality. Besides promising results for the detection of gliomas and brain metastases, the group of meningeal tumors (meningothelial and transitional meningiomas) does not allow for comparable spectroscopic tissue identification in our study. Nevertheless, other studies on native/frozen tissue samples do describe spectroscopic properties of a meningioma tumor class not only in comparison to dura mater [35,36] but also among different grades [37,38] or in comparison to smear diagnostics [26]. Within our initial three-class classification approach, the inherent susceptibility of the collagen-rich matrix of meningiomas to technical artifacts arising, for example, from the fixation/deparaffination process may have contributed to the reported misclassification rate. Besides classification, RS can provide valuable insights into tumor biology. The spectroscopic elements (i.e., Raman bands at a distinct wavenumber) representing important features of a classification model can hint at a molecule of origin. As an example, the spectral footprint of the protein/lipid ratio in glioma was used as a predictor of malignancy [5]. We analyzed our trained models for feature importance and found that our models used a wide range of features throughout the whole spectral range (see Supplementary Figure S9 for the listed Raman bands and corresponding importance for classification), indicating high complexity within the acquired Raman data. Due to the use of chemically treated tissue samples and the spectral complexity observed, we refrain from interpreting Raman bands and defining a spectrum as a diagnostic “fingerprint” for classification (diagnostic approach). In order to further analyze the underlying biochemistry of a sample (exploratory approach), the employment of RS on native, unprocessed, or cryofixed tissue proves favorable due to absence of preceding sample modification [13,39].

5. Conclusions

In this study, we did not only aim to assess the application of RS during the routine neuropathological workflow but also determined distinct special use cases in which RS may potentially play an advantageous future role in pathological diagnostics. Since all measurements were conducted on biologically highly altered FFPE tumor tissue, our results indicate, as a proof of concept, not only that FFPE tissue samples are generally suitable for vibrational spectroscopic examinations but also that rapid spectroscopical examination and successive determination of biological tumor origin is still feasible—though in accordance with the classification task. A fast insight into tumor origin directly after H&E microscopy potentially helps to communicate preliminary findings to the treating oncologist, focusing the search for primary tumor origin more efficiently without having to wait for more advanced and precise techniques. Using RS, this can be done even on fully necrotic resection material that is otherwise not usable for molecular and histopathological approaches.

Supplementary Materials

The following supporting information can be downloaded at: https://0-www-mdpi-com.brum.beds.ac.uk/article/10.3390/brainsci14040301/s1, Figure S1: Raman spectrometer, Figure S2: Raw Raman spectra, Figure S3: AUROC and AUPR values of the 7-class learning approach, Figure S4: Precision, recall, and F1-score of the 7-class learning approach, Figure S5: Accuracy, precision, recall, and F1-score of the carcinoma brain metastases classifier, Figure S6: Accuracy, precision, recall, and F1-score of glioma classifier, Figure S7: Accuracy, precision, recall, and F1-score of binary necrosis classifier, Figure S8: Multi-class vital area vs. necrosis classifier, Figure S9: Raman bands and corresponding feature importance, Table S1: Clinical information of measured glioma samples, Table S2: Deparaffinization protocol, Table S3: Data details for initial 3-class analysis, Table S4: Carcinoma classifier data, Table S5: Glioma classifier data, Table S6: Multi-class necrosis vs. vital tissue data.

Author Contributions

Conceptualization, G.G.K. and F.B.K.B.; Formal analysis, L.M.; Funding acquisition, A.H., M.M., and F.B.K.B.; Investigation, G.G.K., F.J., K.K., R.S., G.M., F.H., and M.M.; Methodology, G.G.K. and F.B.K.B.; Project administration, F.B.K.B.; Software, L.M., F.K., and A.H.; Supervision, A.H., F.H., M.M., and F.B.K.B.; Validation, L.M. and F.K.; Visualization, G.G.K.; Writing—original draft, G.G.K. and L.M.; Writing—review and editing, G.G.K., L.M., F.K., F.J., K.K., R.S., G.M., A.H., F.H., M.M., and F.B.K.B.; All authors contributed to the final draft. All authors have read and agreed to the published version of the manuscript.

Funding

Luxembourg National Research Fund, FNR (FNR PEARL P16/BM/11192868 grant given to M.M.). Fondation Cancer, Luxembourg (grant given to F.H., M.M., A.H., and F.B.K.).

Institutional Review Board Statement

This study was conducted in accordance with the Declaration of Helsinki and was approved by the Institutional Review Board. The INSITU study, CNER No. 201804/08, was approved by “Comité National d’Ethique de Recherche” (CNER) on 13 July 2018.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available upon request from the corresponding authors. The data are not publicly available due to the ethical approval, which does not permit the public sharing of data.

Acknowledgments

Figure 1 and Figure 5 were created with BioRender.com, last accessed on 18 January 2024.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Coons, A.H.; Creech, H.J.; Jones, R.N.; Berliner, E. The Demonstration of Pneumococcal Antigen in Tissues by the Use of Fluorescent Antibody. J. Immunol. 1942, 45, 159–170. [Google Scholar] [CrossRef]
  2. WHO Classification of Tumours Editorial Board. World Health Organization Classification of Tumours of the Central Nervous System, 5th ed.; International Agency for Research on Cancer: Lyon, France, 2021; ISBN 978-92-832-4508-7.
  3. Auner, G.W.; Koya, S.K.; Huang, C.; Broadbent, B.; Trexler, M.; Auner, Z.; Elias, A.; Mehne, K.C.; Brusatori, M.A. Applications of Raman Spectroscopy in Cancer Diagnosis. Cancer Metastasis Rev. 2018, 37, 691–717. [Google Scholar] [CrossRef]
  4. Hollon, T.C.; Pandian, B.; Adapa, A.R.; Urias, E.; Save, A.V.; Khalsa, S.S.S.; Eichberg, D.G.; D’Amico, R.S.; Farooq, Z.U.; Lewis, S.; et al. Near Real-Time Intraoperative Brain Tumor Diagnosis Using Stimulated Raman Histology and Deep Neural Networks. Nat. Med. 2020, 26, 52–58. [Google Scholar] [CrossRef]
  5. Zhou, Y.; Liu, C.-H.; Wu, B.; Yu, X.; Cheng, G.; Zhu, K.; Wang, K.; Zhang, C.; Zhao, M.; Zong, R.; et al. Optical Biopsy Identification and Grading of Gliomas Using Label-Free Visible Resonance Raman Spectroscopy. J. Biomed. Opt. 2019, 24, 095001. [Google Scholar] [CrossRef]
  6. Pekmezci, M.; Morshed, R.A.; Chunduru, P.; Pandian, B.; Young, J.; Villanueva-Meyer, J.E.; Tihan, T.; Sloan, E.A.; Aghi, M.K.; Molinaro, A.M.; et al. Detection of Glioma Infiltration at the Tumor Margin Using Quantitative Stimulated Raman Scattering Histology. Sci. Rep. 2021, 11, 12162. [Google Scholar] [CrossRef] [PubMed]
  7. Kalkanis, S.N.; Kast, R.E.; Rosenblum, M.L.; Mikkelsen, T.; Yurgelevic, S.M.; Nelson, K.M.; Raghunathan, A.; Poisson, L.M.; Auner, G.W. Raman Spectroscopy to Distinguish Grey Matter, Necrosis, and Glioblastoma Multiforme in Frozen Tissue Sections. J. Neurooncol. 2014, 116, 477–485. [Google Scholar] [CrossRef] [PubMed]
  8. Romanishkin, I.; Savelieva, T.; Kosyrkova, A.; Okhlopkov, V.; Shugai, S.; Orlov, A.; Kravchuk, A.; Goryaynov, S.; Golbin, D.; Pavlova, G.; et al. Differentiation of Glioblastoma Tissues Using Spontaneous Raman Scattering with Dimensionality Reduction and Data Classification. Front. Oncol. 2022, 12, 944210. [Google Scholar] [CrossRef] [PubMed]
  9. Zhang, L.; Zhou, Y.; Wu, B.; Zhang, S.; Zhu, K.; Liu, C.-H.; Yu, X.; Alfano, R.R. A Handheld Visible Resonance Raman Analyzer Used in Intraoperative Detection of Human Glioma. Cancers 2023, 15, 1752. [Google Scholar] [CrossRef] [PubMed]
  10. Jabarkheel, R.; Ho, C.-S.; Rodrigues, A.J.; Jin, M.C.; Parker, J.J.; Mensah-Brown, K.; Yecies, D.; Grant, G.A. Rapid Intraoperative Diagnosis of Pediatric Brain Tumors Using Raman Spectroscopy: A Machine Learning Approach. Neurooncol. Adv. 2022, 4, vdac118. [Google Scholar] [CrossRef] [PubMed]
  11. Jermyn, M.; Mok, K.; Mercier, J.; Desroches, J.; Pichette, J.; Saint-Arnaud, K.; Bernstein, L.; Guiot, M.-C.; Petrecca, K.; Leblond, F. Intraoperative Brain Cancer Detection with Raman Spectroscopy in Humans. Sci. Transl. Med. 2015, 7, 274ra19. [Google Scholar] [CrossRef] [PubMed]
  12. Eichberg, D.G.; Shah, A.H.; Di, L.; Semonche, A.M.; Jimsheleishvili, G.; Luther, E.M.; Sarkiss, C.A.; Levi, A.D.; Gultekin, S.H.; Komotar, R.J.; et al. Stimulated Raman Histology for Rapid and Accurate Intraoperative Diagnosis of CNS Tumors: Prospective Blinded Study. J. Neurosurg. 2021, 134, 137–143. [Google Scholar] [CrossRef]
  13. Butler, H.J.; Ashton, L.; Bird, B.; Cinque, G.; Curtis, K.; Dorney, J.; Esmonde-White, K.; Fullwood, N.J.; Gardner, B.; Martin-Hirsch, P.L.; et al. Using Raman Spectroscopy to Characterize Biological Materials. Nat. Protoc. 2016, 11, 664–687. [Google Scholar] [CrossRef]
  14. Fairley, J.A.; Gilmour, K.; Walsh, K. Making the Most of Pathological Specimens: Molecular Diagnosis in Formalin-Fixed, Paraffin Embedded Tissue. Curr. Drug Targets 2012, 13, 1475–1487. [Google Scholar] [CrossRef] [PubMed]
  15. Mathieson, W.; Thomas, G.A. Why Formalin-Fixed, Paraffin-Embedded Biospecimens Must Be Used in Genomic Medicine: An Evidence-Based Review and Conclusion. J. Histochem. Cytochem. 2020, 68, 543–552. [Google Scholar] [CrossRef] [PubMed]
  16. Yi, Q.; Yang, R.; Shi, J.; Zeng, N.; Liang, D.; Sha, S.; Chang, Q. Effect of Preservation Time of Formalin-Fixed Paraffin-Embedded Tissues on Extractable DNA and RNA Quantity. J. Int. Med. Res. 2020, 48, 0300060520931259. [Google Scholar] [CrossRef] [PubMed]
  17. Faoláin, E.Ó.; Hunter, M.B.; Byrne, J.M.; Kelehan, P.; Lambkin, H.A.; Byrne, H.J.; Lyng, F.M. Raman Spectroscopic Evaluation of Efficacy of Current Paraffin Wax Section Dewaxing Agents. J. Histochem. Cytochem. 2005, 53, 121–129. [Google Scholar] [CrossRef] [PubMed]
  18. Mariani, M.M.; Lampen, P.; Popp, J.; Wood, B.R.; Deckert, V. Impact of Fixation on in Vitro Cell Culture Lines Monitored with Raman Spectroscopy. Analyst 2009, 134, 1154–1161. [Google Scholar] [CrossRef] [PubMed]
  19. Klamminger, G.G.; Klein, K.; Mombaerts, L.; Jelke, F.; Mirizzi, G.; Slimani, R.; Husch, A.; Mittelbronn, M.; Hertel, F.; Kleine Borgmann, F.B. Differentiation of Primary CNS Lymphoma and Glioblastoma Using Raman Spectroscopy and Machine Learning Algorithms. Free Neuropathol. 2021, 2, 26. [Google Scholar] [CrossRef]
  20. Fullwood, L.M.; Clemens, G.; Griffiths, D.; Ashton, K.; Dawson, T.P.; Lea, R.W.; Davis, C.; Bonnier, F.; Byrne, H.J.; Baker, M.J. Investigating the Use of Raman and Immersion Raman Spectroscopy for Spectral Histopathology of Metastatic Brain Cancer and Primary Sites of Origin. Anal. Methods 2014, 6, 3948–3961. [Google Scholar] [CrossRef]
  21. Livermore, L.J.; Isabelle, M.; Bell, I.M.; Scott, C.; Walsby-Tickle, J.; Gannon, J.; Plaha, P.; Vallance, C.; Ansorge, O. Rapid Intraoperative Molecular Genetic Classification of Gliomas Using Raman Spectroscopy. Neurooncol. Adv. 2019, 1, vdz008. [Google Scholar] [CrossRef]
  22. Klamminger, G.G.; Gérardy, J.-J.; Jelke, F.; Mirizzi, G.; Slimani, R.; Klein, K.; Husch, A.; Hertel, F.; Mittelbronn, M.; Kleine-Borgmann, F.B. Application of Raman Spectroscopy for Detection of Histologically Distinct Areas in Formalin-Fixed Paraffin-Embedded Glioblastoma. Neurooncol. Adv. 2021, 3, vdab077. [Google Scholar] [CrossRef] [PubMed]
  23. Fullwood, L.M.; Griffiths, D.; Ashton, K.; Dawson, T.; Lea, R.W.; Davis, C.; Bonnier, F.; Byrne, H.J.; Baker, M.J. Effect of Substrate Choice and Tissue Type on Tissue Preparation for Spectral Histopathology by Raman Microspectroscopy. Analyst 2014, 139, 446–454. [Google Scholar] [CrossRef] [PubMed]
  24. Crystran Ltd. Raman Substrate Materials. Available online: https://www.crystran.co.uk/raman-substrate-materials/ (accessed on 13 November 2020).
  25. Mian, S.A.; Colley, H.E.; Thornhill, M.H.; Rehman, I. Development of a Dewaxing Protocol for Tissue-Engineered Models of the Oral Mucosa Used for Raman Spectroscopic Analysis. Appl. Spectrosc. Rev. 2014, 49, 614–617. [Google Scholar] [CrossRef]
  26. Bury, D.; Morais, C.; Ashton, K.; Dawson, T.; Martin, F. Ex Vivo Raman Spectrochemical Analysis Using a Handheld Probe Demonstrates High Predictive Capability of Brain Tumour Status. Biosensors 2019, 9, 49. [Google Scholar] [CrossRef]
  27. Jermyn, M.; Desroches, J.; Mercier, J.; Tremblay, M.-A.; St-Arnaud, K.; Guiot, M.-C.; Petrecca, K.; Leblond, F. Neural Networks Improve Brain Cancer Detection with Raman Spectroscopy in the Presence of Operating Room Light Artifacts. J. Biomed. Opt. 2016, 21, 094002. [Google Scholar] [CrossRef]
  28. Barton, S.J.; Hennelly, B.M. An Algorithm for the Removal of Cosmic Ray Artifacts in Spectral Data Sets. Appl. Spectrosc. 2019, 73, 893–901. [Google Scholar] [CrossRef]
  29. Louis, D.N.; Perry, A.; Wesseling, P.; Brat, D.J.; Cree, I.A.; Figarella-Branger, D.; Hawkins, C.; Ng, H.K.; Pfister, S.M.; Reifenberger, G.; et al. The 2021 WHO Classification of Tumors of the Central Nervous System: A Summary. Neuro-Oncology 2021, 23, 1231–1251. [Google Scholar] [CrossRef]
  30. Amharref, N.; Beljebbar, A.; Dukic, S.; Venteo, L.; Schneider, L.; Pluot, M.; Manfait, M. Discriminating Healthy from Tumor and Necrosis Tissue in Rat Brain Tissue Samples by Raman Spectral Imaging. Biochim. Biophys. Acta Biomembr. 2007, 1768, 2605–2615. [Google Scholar] [CrossRef]
  31. Kast, R.; Auner, G.; Yurgelevic, S.; Broadbent, B.; Raghunathan, A.; Poisson, L.M.; Mikkelsen, T.; Rosenblum, M.L.; Kalkanis, S.N. Identification of Regions of Normal Grey Matter and White Matter from Pathologic Glioblastoma and Necrosis in Frozen Sections Using Raman Imaging. J. Neurooncol. 2015, 125, 287–295. [Google Scholar] [CrossRef] [PubMed]
  32. Jermyn, M.; Desroches, J.; Mercier, J.; St-Arnaud, K.; Guiot, M.-C.; Leblond, F.; Petrecca, K. Raman Spectroscopy Detects Distant Invasive Brain Cancer Cells Centimeters beyond MRI Capability in Humans. Biomed. Opt. Express 2016, 7, 5129–5137. [Google Scholar] [CrossRef] [PubMed]
  33. Klein, K.; Klamminger, G.G.; Mombaerts, L.; Jelke, F.; Arroteia, I.F.; Slimani, R.; Mirizzi, G.; Husch, A.; Frauenknecht, K.B.M.; Mittelbronn, M.; et al. Computational Assessment of Spectral Heterogeneity within Fresh Glioblastoma Tissue Using Raman Spectroscopy and Machine Learning Algorithms. Molecules 2024, 29, 979. [Google Scholar] [CrossRef] [PubMed]
  34. Quesnel, A.; Coles, N.; Angione, C.; Dey, P.; Polvikoski, T.M.; Outeiro, T.F.; Islam, M.; Khundakar, A.A.; Filippou, P.S. Glycosylation Spectral Signatures for Glioma Grade Discrimination Using Raman Spectroscopy. BMC Cancer 2023, 23, 174. [Google Scholar] [CrossRef] [PubMed]
  35. Koljenović, S.; Schut, T.B.; Vincent, A.; Kros, J.M.; Puppels, G.J. Detection of Meningioma in Dura Mater by Raman Spectroscopy. Anal. Chem. 2005, 77, 7958–7965. [Google Scholar] [CrossRef] [PubMed]
  36. Jelke, F.; Mirizzi, G.; Borgmann, F.K.; Husch, A.; Slimani, R.; Klamminger, G.G.; Klein, K.; Mombaerts, L.; Gérardy, J.-J.; Mittelbronn, M.; et al. Intraoperative Discrimination of Native Meningioma and Dura Mater by Raman Spectroscopy. Sci. Rep. 2021, 11, 23583. [Google Scholar] [CrossRef]
  37. Morais, C.L.M.; Lilo, T.; Ashton, K.M.; Davis, C.; Dawson, T.P.; Gurusinghe, N.; Martin, F.L. Determination of Meningioma Brain Tumour Grades Using Raman Microspectroscopy Imaging. Analyst 2019, 144, 7024–7031. [Google Scholar] [CrossRef]
  38. Zhang, L.; Zhou, Y.; Wu, B.; Zhang, S.; Zhu, K.; Liu, C.; Yu, X.; Alfano, R.R. Intraoperative Detection of Human Meningioma Using a Handheld Visible Resonance Raman Analyzer. Lasers Med. Sci. 2021, 37, 1311–1319. [Google Scholar] [CrossRef]
  39. Mirizzi, G.; Jelke, F.; Pilot, M.; Klein, K.; Klamminger, G.G.; Gérardy, J.-J.; Theodoropoulou, M.; Mombaerts, L.; Husch, A.; Mittelbronn, M.; et al. Impact of Formalin- and Cryofixation on Raman Spectra of Human Tissues and Strategies for Tumor Bank Inclusion. Molecules 2024, 29, 1167. [Google Scholar] [CrossRef]
Figure 1. General overview of our experimental workflow. (A) Standard microtome used for cutting FFPE blocks. (B) Left: H&E-stained glioblastoma tissue on a glass slide. Right: consecutive section (unstained) on a CaF2 slide. (C) Raman measurement. (D) Mean Raman spectra of various primary brain tumors.
Figure 1. General overview of our experimental workflow. (A) Standard microtome used for cutting FFPE blocks. (B) Left: H&E-stained glioblastoma tissue on a glass slide. Right: consecutive section (unstained) on a CaF2 slide. (C) Raman measurement. (D) Mean Raman spectra of various primary brain tumors.
Brainsci 14 00301 g001
Figure 2. Mean Raman spectra (range: 500 cm−1–2000 cm−1) for gliomas (blue), meningiomas (orange), and carcinoma brain metastases (green).
Figure 2. Mean Raman spectra (range: 500 cm−1–2000 cm−1) for gliomas (blue), meningiomas (orange), and carcinoma brain metastases (green).
Brainsci 14 00301 g002
Figure 3. ROC and PR curves of the initial 3-class classification task (black dotted line = random performance level) (A) with their corresponding confusion matrix, (B) as well as metrics of precision, recall, and F1-score (C). Confusion matrix of the second multi-class classification (D). Within the 7-class classification model, a preserved assignment to the primary tumor origin (blue box = tumor origin: carcinoma brain metastases; orange box = tumor origin: primary brain tumors) can be seen, despite a notable confusion among the individual tumor types. Green shading = highest-class prediction; orange shading = lower-class prediction.
Figure 3. ROC and PR curves of the initial 3-class classification task (black dotted line = random performance level) (A) with their corresponding confusion matrix, (B) as well as metrics of precision, recall, and F1-score (C). Confusion matrix of the second multi-class classification (D). Within the 7-class classification model, a preserved assignment to the primary tumor origin (blue box = tumor origin: carcinoma brain metastases; orange box = tumor origin: primary brain tumors) can be seen, despite a notable confusion among the individual tumor types. Green shading = highest-class prediction; orange shading = lower-class prediction.
Brainsci 14 00301 g003
Figure 4. Above: (A) ROC/PR curves and corresponding AUROC/AUPR values for the carcinoma brain metastases classifier, as well as the corresponding confusion matrix. Bottom: (B) ROC/PR curves and corresponding AUROC/AUPR values for the glioma classifier with the corresponding confusion matrix. Black dotted line = random performance level.
Figure 4. Above: (A) ROC/PR curves and corresponding AUROC/AUPR values for the carcinoma brain metastases classifier, as well as the corresponding confusion matrix. Bottom: (B) ROC/PR curves and corresponding AUROC/AUPR values for the glioma classifier with the corresponding confusion matrix. Black dotted line = random performance level.
Brainsci 14 00301 g004
Figure 5. Visualization of the RS approach and subsequent data analyses. In case of only necrotic tissue fragments, an RS measurement followed by machine learning classification may determine the tumor origin. Left upper corner: H&E image of necrosis in a glioblastoma; right bottom corner: ROC/PR curves and confusion matrix of our established binary necrosis classification, showing the good separability based on spectral properties—the different color lines express the individual class performances in contrast to the random performance level (black dotted line).
Figure 5. Visualization of the RS approach and subsequent data analyses. In case of only necrotic tissue fragments, an RS measurement followed by machine learning classification may determine the tumor origin. Left upper corner: H&E image of necrosis in a glioblastoma; right bottom corner: ROC/PR curves and confusion matrix of our established binary necrosis classification, showing the good separability based on spectral properties—the different color lines express the individual class performances in contrast to the random performance level (black dotted line).
Brainsci 14 00301 g005
Table 1. Overview of different tumor types measured by means of Raman spectroscopy.
Table 1. Overview of different tumor types measured by means of Raman spectroscopy.
Tumor Group/
Tumor Type
Number of Cases
n = 82
Number of Measurements
n = 679
Astrocytoma of grades 2,3, IDH mutant974
Oligodendroglioma of grades 2,3, 1p19q co-deleted760
Ependymoma544
Glioblastoma, IDH wildtype27179
Meningothelial meningioma436
Transitional meningioma656
Breast carcinoma metastases853
Colorectal carcinoma metastases665
Non-small cell lung carcinoma (NSCLC) metastases10112
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Klamminger, G.G.; Mombaerts, L.; Kemp, F.; Jelke, F.; Klein, K.; Slimani, R.; Mirizzi, G.; Husch, A.; Hertel, F.; Mittelbronn, M.; et al. Machine Learning-Assisted Classification of Paraffin-Embedded Brain Tumors with Raman Spectroscopy. Brain Sci. 2024, 14, 301. https://0-doi-org.brum.beds.ac.uk/10.3390/brainsci14040301

AMA Style

Klamminger GG, Mombaerts L, Kemp F, Jelke F, Klein K, Slimani R, Mirizzi G, Husch A, Hertel F, Mittelbronn M, et al. Machine Learning-Assisted Classification of Paraffin-Embedded Brain Tumors with Raman Spectroscopy. Brain Sciences. 2024; 14(4):301. https://0-doi-org.brum.beds.ac.uk/10.3390/brainsci14040301

Chicago/Turabian Style

Klamminger, Gilbert Georg, Laurent Mombaerts, Françoise Kemp, Finn Jelke, Karoline Klein, Rédouane Slimani, Giulia Mirizzi, Andreas Husch, Frank Hertel, Michel Mittelbronn, and et al. 2024. "Machine Learning-Assisted Classification of Paraffin-Embedded Brain Tumors with Raman Spectroscopy" Brain Sciences 14, no. 4: 301. https://0-doi-org.brum.beds.ac.uk/10.3390/brainsci14040301

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop