|
1.IntroductionCancer is a disease of uncontrolled cell growth, beginning with one cell or a small group of cells, spreading over to different organs, and ultimately causing death if not controlled. According to GLOBOCAN fact sheet 2012, cancer-related mortality accounts for 8.2 million worldwide, out of which breast cancer alone accounts for 0.52 million.1 Breast cancer begins as a premalignant disease and reaches to its advanced and/or metastatic stage through various phases of tumor progression. The nonmodifiable factors responsible for breast cancer are the female gender itself, age, BRCA1 and BRCA2 genes, and personal factors such as a long menstrual history, use of oral contraceptives, and having first child after age 30 or nulliparous.2 Among the modifiable factors, the major risks are being overweight, lifestyle, taking hormone replacement therapy, taking birth control pills, drinking alcohol, having tobacco, etc. Once a breast tumor is developed, its growth and progression are always supported by the surrounding tumor microenvironment. It is supposed to be composed of a variety of cell types including endothelial cells, pericytes, smooth-muscle cells, fibroblasts of various phenotypes, myofibroblasts, immune cells, natural killer lymphocytes, and antigen-presenting cells, such as macrophages and also dendritic cells. They are often characterized by conditions like hypoxia, nutrient deprivation, acidosis, abnormal tumor angiogenesis, aberrant stroma, etc.3,4 There are several proteins secreted during the progression of the disease. These proteins play a key role in the biological pathway leading to the development and prognosis of the disease. The identification and characterization of these proteins (biomarkers) may lead to a better understanding of the disease patterns. However, the heterogeneous nature of breast cancer makes biomarker identification a challenging task and alternative strategies are needed to address this problem by detecting the cancer early. The current available techniques for breast cancer detection are clinical breast exam, mammography, ultrasounds, molecular breast imaging, and magnetic resonance imaging (MRI), especially for women with a high risk of breast cancer.5 Mammography and ultrasonography are considered to be the most effective screening techniques for the premenopausal and postmenopausal women, respectively.6 Besides their applicability, these techniques have some disadvantages as well.2,5 For example, mammograms, the low dose x-ray images of the breast, are effective in detecting breast cancer, but despite their effectiveness, sometimes produce false-negative results, especially in the case of young subjects.5 The mammographic detection sometimes shows poor prognosis and repeated exposure may lead to other side effects as well.5,6 Many women find having a mammography uncomfortable or painful. The MRI technique is costly and may provide false-negative results. It requires additional follow-ups and biopsies and is usually not recommended for patients with life time risk for breast cancer.5 It may cause claustrophobia, may not show all calcifications, and can induce alterations in the ECG of the patient for a short period of time. The best option could be developing a new tool to detect breast cancer early when the disease has just started, providing enough scope for treatment planning and cure by avoiding the complications. When a cancer is detected early, it increases the survival rate by several folds as opposed to late detection where the mortality rate always prevails over survival.5,6 The decrease in mortality rates signifies progress in both the early detection methodology and an improved treatment strategy. The main objective of the present study is developing a suitable tool for early detection and/or stratification of breast cancer. The attempts to search for tools for early diagnosis of cancer are in progress all over the world in several disciplines of science and technology. Among them, the optical and related techniques are being explored exponentially.7–11 The optical spectroscopy techniques, such as fluorescence, reflectance, Raman scatterings, or a combination of a few or all were tested as potential early detection tools.7–11 Photoacoustic (PA) spectroscopy, the hybrid of optics and acoustics, is a simple and sensitive technique with several advantages over others, such as minimal sample preparation, nondestructive, and no scattering loss in turbid media.12–17 The key advantage of the technique is the signal generation based on differential optical absorption by the constituents of a specimen like biological tissues and acoustic detection.15–17 When a pulsed or modulated light of suitable wavelength is incident on a sample, the absorption of light energy by the sample’s molecules leads to expansion at the excitation volume of the sample, followed by compression in a repetitive sequence of light modulation, generating transient pressure variations or acoustic signals at the excitation volume, which propagate away from the source point.12–15 The transient pressure changes can be detected by fast pressure transducers like piezoelectric transducers (PZT) or microphones and the information about the sample can be obtained.15–17 Since the PA signal solely depends on the absorption of radiation by the biomolecules in the samples under study, the recorded spectral patterns exhibit various characteristics of the biomolecules in the sample. Any change or modification in the biochemical makeup of the sample upon disease initiation is reflected in the corresponding PA patterns. This provides information on the status of the sample under study as well as the microstructure surrounding the change.18 When the absorber is in abundance, the same can be visualized in the imaging as well by using appropriate pressure sensors.19 The PA spectral features in imaging also demonstrate the optical absorption and the geometrical properties of the absorbing structures in the samples under study.20 There are reports on characterizing histological microstructures of the biological specimens using PA spectral analysis.21,22 The technique has been used in vivo and ex vivo to discriminate the various cancer types from the normal based on respective spectral features.20,23–27 The technique has also found applicability in the detection of circulating tumor cells in flow cytometry.28 PA imaging is being put to use from detection and diagnosis of various cancers to nanomedicine for cancer therapy.29–31 Even though the imaging techniques produce a wide array of information about the biological variation, the abundance/size of the sample and the choice of imaging contrast agents are the major constraints with the technique.19,20,29–32 The technique is very popular in studying various cancers and other diseases, but its application in early detection of asymptomatic tumors needs to be explored, experimented with, and validated. The main idea in the present study is to correlate the spectral patterns of the biochemical changes linked to disease association for the early detection of cancer using PA spectroscopy. In the present study, an attempt is made to evaluate breast tumor progression in a xenograft model, an ex vivo study, to evaluate the effectiveness of PA spectroscopy in early detection of breast cancer. The idea of selecting a xenograft model is based on the growing body of evidence of using such models for understanding human diseases.33–36 The laboratory mice models are considered as one of the best model systems available for investigating cancers in humans mostly because of the extensive physiological and molecular similarities, small size, short life span, and availability of its entirely sequenced genome with a chance of further manipulation if any.34,35 Many studies have been carried out all over the world using the immune-compromised mice for the understanding of cancer progression, drug delivery pathways, effects of chemicals on the cancer progression, biomarker discovery, etc.34–39 2.Material and Methods2.1.Experimental SetupThe block diagram of the experimental setup used in the present study is shown in Fig. 1. The setup consists of an excitation source, PA cell, preamplifier, and the signal processing unit. The excitation source is a combination of an Nd-YAG laser (LM1278 LPY707G-10, LITRON Lasers, United Kingdom) and a frequency doubled dye laser (PULSARE Pro, FINE ADJUSTMENT, Germany). The second harmonic (532 nm) of the Nd-YAG laser is used to pump the Rhodamin 6G dye to obtain lasing in the 545 to 580 nm region. The dye laser output is frequency doubled for obtaining 281 nm laser light that is used in the present study. The PA cell, an arrangement of the sample holder and the PZT housing, is fabricated in-house. The PZT (Model PIC 181, length 10 mm, diameter 5 mm, PI Ceramics, Germany) is connected to the preamplifier using a Bayonet Neilsl–Concelman (BNC) type cable connector, which is then coupled to a cathode ray oscilloscope (Tektronix, TDS 5034B) for signal processing and recording. The laser light is focussed onto the sample (laser spot size ) kept in the sample holder of the PA cell by using a 5 cm focusing lens. All the experiments have been conducted with a laser energy per pulse. The energy density for the laser excitation used in the present study is , which is well within the maximum permissible exposure of as described in ANSI guidelines for the wavelength range of 180 to 302 nm for both ocular and skin exposures.40 2.2.Optimization of Protocol for Breast Xenograft ModelIn order to grow human tumors in mice, the immune-compromised murine models are currently being used to prevent the immune rejection. Such models eliminate the role of the immune system in tumor growth/progression in xenograft and have the advantage of serving as a better and reliable model for analyzing cancers in a syngeneic background for in vivo studies.39,41–43 In the present study, BALB/c nude mice, an immune deficient mouse model, is used for inducing breast tumor. The workflow for tumor establishment in nude mice using breast cancer cell lines is shown in Fig. 2(a). Before carrying out this study, animal ethical approval from the Institutional Animal Ethical Committee was obtained, and the study was conducted as per the stated guidelines. Six- to eight-week-old female nude mice were used in the study. The animals were kept under pathogen-free conditions in individually ventilated cages under standardized environmental conditions (22°C room temperature, relative humidity, 12 h light-dark rhythm) and received sterilized food, bedding, and drinking water. The MCF-7 breast cancer cell line was used to subcutaneously induce tumor in nude mice. It was observed that MCF-7 cells suspended in matrigel were optimal for establishing the tumor xenograft model. Matrigel (BD Matrigel™ Basement Membrane Matrix), which is a gelatinous basement membrane extracted from the Engelbreth-Holm-Swarm mouse sarcoma, was used in the inoculation procedures as a supporting and growth medium. For injection, the cells were drawn into a 1 cc syringe using a 26 gauge needle and injected subcutaneously into the lower flank region of the mice. The animals were routinely monitored for the appearance of tumor development, and the tumor dimensions were measured using a digital Vernier caliper. The tumor volumes were calculated in using the equation: .41–44 2.3.Collection of SamplesOnce the xenograft model was standardized, a fresh set of animals were used for tumor development by injecting MCF-7 cells and monitoring over a period of 20 days until the tumor volume reached . Three time points, days 10, 15, and 20 (D10, D15, and D20) following MCF-7 cells injection, were decided for monitoring the tumor growth, allotting six animals () to each time point. The experiements were performed in duplicates, making the total number of animals 36 () for the study. On the respective day of each time point, the animals were sacrificed, the tumor volume was measured using the digital caliper, and the tumor tissues were extracted. The tissue samples were stored at by snap freezing in liquid nitrogen until further use. The stored samples were thawed to room temperature for recording of PA spectra at 281 nm excitation. 2.4.Photoacoustic Spectral Recordings and PreprocessingThe PA spectra of all 36 samples, 12 each from days 10, 15, and 20, were recorded by taking each one of them in a cuvette in the PA cell. Further, to avoid any sample contamination, the cuvette was used to hold the samples, washed thoroughly using distilled water twice followed by alcohol cleaning. The minimum size of the tumor tissue used for the recording of PA spectra was . A total of six spectra per sample were recorded from six different sites of the samples, generating 216 spectra (). The PA spectra were fast Fourier transformed (FFT) using MATLAB® @R2015a (V8.5) algorithms. 3.Data AnalysisThe FFT patterns were preprocessed by performing the background subtraction and normalization. The normalized FFT patterns were later wavelet transformed and used for feature extraction in the spectral region from 14.375 to 80.625 kHz. In order to accomplish the step mentioned above, the normalized FFT spectra were subjected to Haar wavelet analysis and the resultant wavelet coefficients were subsequently subjected to Morlet wavelet analysis. Following this, the feature extraction was carried out. Seven features were extracted: mean, median, area under the curve, variance, standard deviation, skewness, and kurtosis extracted from each and a feature matrix () was prepared. Principal component analysis (PCA) was performed on the data for reducing the variability, followed by logistic regression analysis for discriminating the data by plotting a decision boundary. All the data analysis, i.e., extraction of the wavelet coefficients, spectral features selection, PCA, and logistic regression analysis, were carried out in MATLAB® @R2015a (V8.5). The significance tests [Student test and one-way analysis of variance (ANOVA)] were carried out using Graph Pad Prism 5 (V 5.01). 3.1.Wavelet AnalysisThe wavelet transform analysis is a time-frequency decomposition tool for data analysis, which finds applications in a multitude of diverse physical phenomena such as analysis of climate, financial indices, video images, and medical signals.45,46 It is a mathematical tool utilized for converting a signal or function into another form that makes certain features of the signal more amenable, cleaner, and easier to study.47,48 The wavelets are particularly useful in analyzing the signals that are nonuniform, noisy in nature, and usually in temporal domains. There are two types of wavelets; i.e., if the wavelet analysis has been done throughout the length of the signal, it is called continuous wavelet transform (CWT), and if the wavelet has been separately used for the analysis of the segments of the signal, it is called discrete wavelet transform. It is very crucial to choose the right type of wavelets for the analysis for optimal outcomes. In this study, as the signals are continuous, the CWT is found suitable for the analysis. In CWT analysis, the wavelet coefficients are extracted by convolving the wavelet with the signal (or scaling and shifting the wavelet across the length of the signal). The equation for CWT48 is given by where denotes the wavelet coefficients, is the scaling parameter, and is the translational parameter. The scaling parameter is used to convert the mother wave (obtained when and ) into a more compressed or expanded form that would better suit the type of spectra being analyzed, and the translational parameter is used to translate the wavelet obtained by scaling along the length of the signal. is the weighing function, which is set to for energy conservation as the wavelet at every scale should have the same energy. The function represents the signal that is being analyzed. The complex conjugate (denoted by *) of the wavelet is used to compute the wavelet coefficients. The signal and the wavelet are the functions of variable .The wavelet, a finite nonuniform signal, is used to extract the local features from the signals under study. This is done by transforming the given signal into another representation that is more useful. The dilating and the translation properties allow the wavelet to be equipped with flexible and variable time. The frequency windows that narrow down at high frequencies and broaden at low frequencies make the wavelet available to localize on any detail of the signal to be analyzed. These properties make the wavelet transform suitable for analyzing the serpentine signals. This way, the wavelets obtained are localized in time and in frequency, having the unique scaling properties and the wavelet coefficients. As a result, the multiresolution analysis of the wavelet is advantageous in both the time and the frequency domains. In the present study, we have used Haar and Morlet wavelet methods46–49 to analyze the PA spectra and attempted to evaluate the subtle differences between them. 3.2.Feature Extraction and Principal Component AnalysisOnce the wavelet analysis is done on the FFT patterns and the Morlet coefficients are obtained, the statistical features representing the unique characteristics in each group of the samples under study are extracted. As mentioned above, there are seven statistical features extracted: mean, median, area under the curve/sum (measures of central tendency), variance and standard deviation (measures of dispersion), skewness and kurtosis (measures of shape).7,16,49 The PCA, a data minimization and dimension reduction tool, is performed on the feature matrix of three sample types under study and the first two principal components (PCs) showing maximum variance within the data are identified. The scores of the identified PCs in the logarithmic form are used for logistic regression analysis seeking the data discrimination and decision boundary visualization. 3.3.Logistic RegressionThe logistic regression is a machine learning algorithm used to classify the datasets that are categorical in nature. The main goal in logistic regression is to find the best fitting decision boundary for a set of data that determines the class of the data based on the dichotomous characteristics.50–53 It allows one to estimate the probability of an event occurring and provides a linear classifier. It uses one or more independent variables for determining the data classification. It classifies various data by creating a decision boundary based on the internal characteristics of the data. For making the decision boundary a reference, a sigmoid function is used for separating the positive and negative classes of data. The sigmoid function [], also referred as the logistic regression function,43–46 which ranges from 0 to 1 as the value of varies over the horizontal axis from positive to negative infinity, is defined as in Eq. (3). where represents the hypothesis equation or the equation of the decision boundary. The hypothesis equation [Eq. (4)] segregates the two classes. This equation is governed by the set of features () used and the parameters () that assign a specific weight to each feature for the probability that or 1 in the classification of a test sample. The function can be estimated for the probability that or 1 and is usually referred to as the hypothesis .The key requirement of the algorithm, dealing with the above hypothesis, is that the amount of deviation of each sample belonging to the positive and negative classes from the decision boundary should be minimum. Hence, the values of the parameters have been chosen in such a way that the above requirement is completely fulfilled and computed using the cost function. The cost function gives the amount of deviation that the decision boundary has from each sample in the dataset. As the deviation from the sample should be minimum, the values of must be chosen such that the cost function is minimized. The cost function used for logistic regression is shown in Eq. (5). The values of the parameters () used to plot the decision boundary are calculated using Eq. (6) as shown below. In the above equation, is the learning rate. This equation is used to compute the optimal values of that have been obtained for the least values that the cost function takes. The logistic regression analysis not only determines the boundary between the two classes of the samples but also signifies the class based on the probability depending on the separation of the class elements from the boundary. The strength of the logistic regression has been utilized in the present study to categorize and discriminate the samples from different phases of breast tumor growth in a xenograft model. A supervised mode of logistic regression analysis50–54 was performed using the log of score values of PC1 and PC2. Initially, we compared two groups at a time, such as D10 versus D15, D15 versus D20, and D10 versus D20, as well as taking all groups together (D10 versus D15 versus D20) to understand the distribution, variability and scatterness in the data. A calibration set comprising 20 samples (spectra) from each group (D10, D15, and D20) under study was formed, and by performing logistic regression analysis on the data, a decision boundary of classification with 100% specificity and sensitivity was created. The decision boundary was utilized for testing the remaining samples (spectra) from each group (52 spectra) under study once it was established for the calibration data. 4.Results and DiscussionIn this study, in order to establish the animal xenograft model for the breast tumor development, attempts have been made with MCF-7 breast cancer cell lines. It was tested with various numbers of cells and optimal growth was found when cells were subcutaneously injected into the mice along with matrigel. This study was carried out using three mice. During the experiment, two animals out of the three were found weak and died in between, and in the third animal, which was alive, a palpable tumor bulge was visible on day 7 postinjection followed by a gradual increase in the tumor volume in the subsequent days. It was also noticed that by day 25, the tumor volume had crossed the mark of . As per animal ethical considerations, all of the animals under study were sacrificed once the tumor volume reached . The tumor volume (, ) measurements are shown in Fig. 2(b). One-way ANOVA was carried out on the mean tumor volumes of three different time point groups and a significant difference () was observed. Further, Bonferroni’s test for multiple pairs was also conducted showing the significant difference () with each case. Subsequently, the tumor tissues were excised for the three groups on the respective days and the corresponding PA spectra were recorded. Since PA spectroscopy works based on the absorption of radiation by the samples under study, the selection of a suitable wavelength for excitation is very important for optimal signal generation. Further, the absorption properties of the biological samples depend on their physiological states as well as their biochemical natures.55 These samples have many intrinsic chromophores that exhibit the PA effect at specific wavelength excitations, such as proteins with 250 to 300 nm, nicotinamide adenine dinucleotide (NADH) with 300 to 390 nm, collagen and elastin with 325 to 360 nm, hemoglobin and melanin with 450 to 1000 nm, and lipids with 340 to 450 nm, respectively.55–57 The absorption coefficient values for these chromophores rely on the wavelength of excitations and the condition of the sample, differing in spectral patterns for diseased and normal states. Also, the advantage of using and targeting the intrinsic chromophores avoids any chance of cell toxicity and allows evaluation of the samples on the basis of their physiological and metabolic states.55–57 It is a well known fact that the proteomic and metabolomics profiles of malignant conditions are different from normal conditions. The type of proteins and their concentrations vary widely in the two given conditions. Proteins that mostly absorb at UV range due to amino acid residues such as tryptophan (280 nm), tyrosine (274 nm), phenylalanine (250 nm), and histidine and cysteine residues (260 nm), are the dynamic entities that host a plethora of information about potential biomarkers for disease diagnosis.57–59 There are reports in the literature with cell line studies indicating the presence of an increased amount of tryptophan in aggressive cancerous cells.58 Fluorescence spectroscopy studies as well have demonstrated increased tryptophan concentrations in breast carcinoma samples in comparison to the normal breast tissues.59 Many early breast cancer diagnostic attempts using serum samples by mass spectrometry studies have also indicated that the blood serum amino acid levels increase with tumor development in breast cancer patients, and these profiles are organ and site specific.60,61 The purpose of selecting 281 nm laser light in the present study is to target tryptophan and tyrosine residues of the proteins in the tumor tissues under study for PA signal generation and capturing the corresponding change. Hence, in the present study targeting proteins containing tryptophan and tyrosine residues of tumor tissues, 281 nm pulsed laser light was used for excitation of the samples and the corresponding PA spectra were recorded. The average PA spectra in the time domain (left) of tumor tissues for three different time points of tumor development (days 10, 15, and 20) and the corresponding FFT patterns in the frequency domain (right) are shown in Fig. 3. As the PA cell is a resonant cavity, the Fourier transforms of the acoustic waves along with the resonant frequencies of the cell for the different types of samples were found to be more informative for further analysis. By examining the signal in the frequency domain, the modal frequencies developed upon excitation of the samples for different phases/stages were observed to vary with the pathological states of the tumor. However, this variation of modal frequencies for the three groups was not found to change systematically. For further analysis of the data, finding clear indication of tumor growth and development, suitable spectral features correlating systematic changes of tumor progression involving wavelet analysis were undertaken. There have been considerable efforts already going on all over the world for making use of the spectral variations associated with the disease states/conditions in providing diagnostic assistance. The main idea behind these analyses was to utilize the information associated with the biochemical changes upon the disease initiation for objective assessment of a disease. The statistical tools normally assign the spectral samples under study to one or more predefined categories/groups and perform analysis on them applying either supervised or nonsupervised modes of operations. In the present study, we have employed a supervised mode of analysis to achieve the objective discrimination among the different groups of samples used for the study. Initially, we looked into the frequency patterns of the samples belonging to D10, D15, and D20, and a gradual increase in the strength of certain frequencies 29.37, 29.69, and 30 kHz was observed with the progression of the tumor growth as shown in Fig. 4. The increase of frequency strengths was also found to be significant between D10 and D20 groups. However, no significant difference in the frequency strengths for D10 and D15 as well as D15 and D20 was observed. Although there were significant differences in the strengths of some other frequencies (33.75, 34.06, and 50 kHz) observed for D10 and D15 as well as D15 and D20, no specific trend was noticed in the variation. Hence, we found this region of 14 to 80 kHz had a high potential for inherent information, which may contribute to grouping the time points under study. It is an established fact that the low-frequency components always travel faster through a medium. It can be correlated with the tumor growth/development observed in the present study in the case of low-frequency (0 to 30 kHz) components of the PA spectra discussed above. Further, there are reports in the literature62–64 showing desmoplastic reactions in the stroma of neoplastic tissues in breast cancer conditions, resulting in overdeposition of dense fibrous tissues (mainly collagen I, III, and IV) and other extracellular matrix components. This leads to the formation of the solid lump within the tissue providing it with tensile strength, stiffness, and tightness. The acoustic waves in the present study are produced due to the absorption of 281 nm excitation by the tryptophan moieties of proteins in the tumor, which subsequently propagate through the tumor mass. Once the tumor tissue becomes stiffer and the free space gets filled, it facilitates the movement of the acoustic waves in a faster manner. This may be one of the reasons for the gradual increase observed in the strengths of the low-frequency (0 to 30 kHz) peaks in the present study. The irregularities in the peak intensity behavior above 30 kHz may be attributed to the inhomogeneity of the acoustic field in the tumor mass.65 Also, in cancer cells, it has been observed that the cellular and the nuclear morphology differ from a normal cell. The major noticeable characteristic of the nuclei of cancer cells is that they are abnormally enlarged and have a folded shape.66–68 Though a direct relationship is still not established between the nuclear morphology and the cancer phenotype, the nuclear changes in the cancer cells may be tumor type and stage specific.67 Further, it has been noticed that even though the absorption maxima of DNA is at 260 nm, for proteins at 280 nm, the ratio of absorption coefficient for DNA to proteins is much less at 280 nm.65–68 Thus, the low optical contrast between DNA and protein absorption at 281 nm might have contributed to the inhomogeneity of the acoustic field and affected the signal propagation. In order to accomplish the discrimination among different time point groups under study, the corresponding PA FFT patterns were subjected to Haar and Morlet wavelet analysis one after the other, and the Morlet wavelet coefficients so obtained were used for extracting features for further analysis. As mentioned above, there were seven spectral features (mean, median, area under the curve, variance, standard deviation, skewness, and kurtosis) extracted, and using them, a feature matix of dimensions was prepared. This feature matrix was used for PCA and the first three PCs showing the maximum variance in the data that were selected. The percentage variance of PC1, PC2, and PC3 scores for day 10, day 15, and day 20 groups were found to be 87.40, 12.58, 0.009; 97.54, 2.44, 0.006; and 99.33, 0.65, 0.001%, respectively. From the above information, it is clear that the first two PC scores for all the three groups were sufficient to represent the maximum variability within the respective group of samples. Further, a comparison of log PC1 and log PC2 score values for the three groups by the Kruskal-Wallis test also revealed a significant difference among them. PC1 captured the maximum variability among the three groups ( ***). Dunn’s multiple comparison test has also shown a significant ( ***) difference with PC1 for D10 versus D15, D10 versus D20, and D15 versus D20, respectively. PC2 was also found to be significant (, ***), and Dunn’s multiple comparison test with PC2 for D10 versus D15 and D10 versus D20 was found to be significant. Hence, the subsequent logistic regression analysis was conducted using the logs of PC1 and PC2 scores only. Once the first two PC components were found to contribute the maximum variance in the data as well as significance in their classification, the logs of PC1 and PC2 scores for 20 samples of all three groups under study were randomly chosen to form a calibration set and were used for making an appropriate decision boundary in the logistic regression with 100% specificity and sensitivity, respectively. Subsequently, log PC1 and PC2 scores of the remaining 52 samples in each group were tested against the calibration set decision boundary and the corresponding sensitivity, specificity, and accuracy of the classification was calculated. The correct prediction accuracy of the classification as well as the sample membership association with a particular group for D10 versus D15 was found to be 90.38 and 94.23%, respectively, with an overall accuracy of 92.31% as shown in Fig. 5(a). In the case of D15 versus D20, the correct prediction accuracy as well as the sample membership association to a particular group was 80.77 and 94.23%, respectively, along with overall accuracy of 87.5% for the analysis as shown in Fig. 5(b). The prediction accuracy and the membership association for D10 versus D20 analysis was found to be 92.31 and 98.08%, respectively, along with the overall accuracy of 95.2% as shown in Fig. 5(c). The prediction accuracy of the multiple comparision analysis performed with all three calibration sets taken together, making a decision boundary for each and testing the remaining samples of each group against the respective decision boundaries was found to be 90.38, 73.08, and 94.23% for D10 versus D15, D15 versus D20, and D10 versus D20, respectively [Fig. 5(d)]. The overall accuracy of the multiple comparison analysis was found to be 85.9% as shown in Fig. 5(d). The observed misclassifications in D15 versus D20 comparison may be due to the tumor heterogeneity during the growth period and the physiology of the cancer cells. Further, there are reports showing morphological changes within the developing tumors and their microenvironment influencing the light propagation and hence the acoustic velocity through them affecting the corresponding spectral characteristics.47,48,65 In order to test the reproducibility of the discrimination models for D10 versus D15, D15 versus D20, and D10 versus D20 analysis, the receiver operator characteristics (ROC) curves for the same were also plotted as shown in Fig. 6. The area under the curves (AUC) for the ROC plots of D10 versus D15, D15 versus D20, and D10 versus D20 analysis were found to be 0.95, 0.85, and 0.93, respectively. These observations clearly demonstrate that the ROC-AUC values are approaching 1, indicating the potentiality of the analysis to show the significant discrimination among the different groups () under study, suggesting the clinical implication of the analysis. 5.ConclusionThe PA spectroscopy used to monitor the breast tumor development in the present study has clearly demonstrated its ability to capture the minor biochemical changes upon tumor development. The wavelet-PCA based logistic regression analysis, applied for the assessment of the tumor growth, has distinctly identified the unique spectral patterns in each group of the samples under study and accordingly differentiated the tumor progression. The sensitivity of the technique in capturing the minor biochemical changes along with the accuracy of identifying the tumor progression by using wavelet-PCA based logistic regression analysis has evidently provided its possible clinical implications. However, for identifying specific biomarkers of tumor progression and drawing a correlation with the early detection of the disease, further in-depth PA studies are warranted. AcknowledgmentsThe authors would like to acknowledge the patronage of the Manipal University, Manipal, TIFAC CORE in Pharmacogenomics at the School of Life Sciences, Manipal University, and the Vision Group of Science & Technology, Karnataka State, India. We would also like to thank Professor K. Satyamoorthy, director of the School of Life Sciences, Manipal University, Manipal, for his constant encouragement and support. One of the authors, Ms. Mallika Priya, acknowledges the support of the Manipal University for providing a research fellowship under its structured PhD scheme during November 2010 to March 2014 and the Indian Council of Medical Research, Govt. of India, New Delhi, for a senior research fellowship during April 2014 to March 2015. We express our gratitude to Mr. M. S. Shankar Rao for technical English corrections of the manuscript. ReferencesJ. Ferlay et al.,
“GLOBOCAN 2012 v1.0, Cancer Incidence and Mortality Worldwide: IARC CancerBase No. 11,”
(2015) http://globocan.iarc.fr/Pages/fact_sheets_cancer.aspx?cancer=all September ). 2015). Google Scholar
A. Jemal et al.,
“Global cancer statistics,”
CA Cancer J. Clin., 61
(2), 69
–90
(2011). http://dx.doi.org/10.3322/caac.20107 CAMCAM 0007-9235 Google Scholar
M. Hu and K. Polyak,
“Microenvironmental regulation of cancer development,”
Curr. Opin. Genet. Dev., 18
(1), 27
–34
(2008). http://dx.doi.org/10.1016/j.gde.2007.12.006 COGDET 0959-437X Google Scholar
J. A. Joyce and J. W. Pollard,
“Microenvironmental regulation of metastasis,”
Nat. Rev. Cancer, 9
(4), 239
–252
(2008). http://dx.doi.org/10.1038/nrc2618 NRCAC4 1474-175X Google Scholar
Breast Cancer Facts and Figures 2013-2014, American Cancer Society, Inc., Atlanta
(2013). Google Scholar
K. Kalogerakos, C. Sofoudis and N. Baltayiannis,
“Early breast cancer: a review,”
Cancer Ther., 6 463
–476
(2008). Google Scholar
S. D. Kamath et al.,
“Autofluorescence of ovarian, normal, benign and malignant tissues: a pilot study,”
Photomed. Laser Surg., 27
(2), 325
–35
(2009). http://dx.doi.org/10.1089/pho.2008.2261 Google Scholar
J. S. Soares et al.,
“Diagnostic power of diffuse reflectance spectroscopy for targeted detection of breast lesions with microcalcifications,”
Proc. Natl. Acad. Sci. U S A, 110
(2), 471
–476
(2013). http://dx.doi.org/10.1073/pnas.1215473110 Google Scholar
B. Brozek-Pluska et al.,
“Raman spectroscopy and imaging: applications in human breast cancer diagnosis,”
Analyst, 137
(16), 3773
–3780
(2012). http://dx.doi.org/10.1039/c2an16179f ANLYAG 0365-4885 Google Scholar
Z. Volynskaya et al.,
“Diagnosing breast cancer using diffuse reflectance spectroscopy and intrinsic fluorescence spectroscopy,”
J. Biomed. Opt., 13
(2), 024012
(2008). http://dx.doi.org/10.1117/1.2909672 JBOPFO 1083-3668 Google Scholar
I. Georgakoudi et al.,
“Trimodal spectroscopy for the detection and characterization of cervical precancers in vivo,”
Am. J. Obstet. Gynecol., 186
(3), 374
–382
(2002). http://dx.doi.org/10.1067/mob.2002.121075 Google Scholar
A. Tam and H. Coufal,
“Pulsed opto-acoustics: theory and applications,”
J. Phys. Colloq., 44
(C6), C6–9
–C6–20
(1983). http://dx.doi.org/10.1051/jphyscol:1983602 Google Scholar
A. Rosencwaig,
“Photoacoustic spectroscopy,”
Annu. Rev. Biophys. Bioeng., 9 31
–54
(1980). http://dx.doi.org/10.1146/annurev.bb.09.060180.000335 Google Scholar
J. B. Kinney and R. H. Staley,
“Applications of photoacoustic spectroscopy,”
Annu. Rev. Mat. Sci., 12 295
–321
(1982). http://dx.doi.org/10.1146/annurev.ms.12.080182.001455 ARMSCX 0084-6600 Google Scholar
C. K. N. Patel and A. C. Tam,
“Optical absorption coefficients of water,”
Nature (London), 280
(5720), 302
–304
(1979). http://dx.doi.org/10.1038/280302a0 NATUAS 0028-0836 Google Scholar
S. D. Kamath, S. Ray and K. K. Mahato,
“Photoacoustic spectroscopy of ovarian normal, benign, and malignant tissues: a pilot study,”
J. Biomed. Opt., 16
(6), 067001
(2011). http://dx.doi.org/10.1117/1.3583573 JBOPFO 1083-3668 Google Scholar
M. Priya et al.,
“Prediction of absorption coefficients by pulsed laser induced photoacoustic measurements,”
Spectrochim. Acta, Part A, 127 85
–90
(2014). http://dx.doi.org/10.1016/j.saa.2014.02.021 Google Scholar
F. L. Lizzi et al.,
“Theoretical framework for spectrum analysis in ultrasonic tissue characterization,”
J. Acoust. Soc. Am., 73
(4), 1366
–1373
(1983). http://dx.doi.org/10.1121/1.389241 Google Scholar
J. Xia, J. Yao and L. V. Wang,
“Photoacoustic tomography: principles and advances,”
Electromagn. Waves (Cambridge), 147 1
–22
(2014). Google Scholar
M. P. Patterson et al.,
“Optoacoustic characterization of prostate cancer in an in vivo transgenic murine model,”
J. Biomed. Opt., 19
(5), 056008
(2014). http://dx.doi.org/10.1117/1.JBO.19.5.056008 JBOPFO 1083-3668 Google Scholar
G. Xu et al.,
“The functional pitch of an organ: quantification of tissue texture with photoacoustic spectrum analysis,”
Radiology, 271
(1), 248
–254
(2014). http://dx.doi.org/10.1148/radiol.13130777 RADLAX 0033-8419 Google Scholar
G. Xu et al.,
“Photoacoustic spectrum analysis for microstructure characterization in biological tissue: analytical model,”
Ultrasound Med. Biol., 41
(5), 1473
–1480
(2015). http://dx.doi.org/10.1016/j.ultrasmedbio.2015.01.010 Google Scholar
K. H. Song and L. V. Wang,
“Deep reflection-mode photoacoustic imaging of biological tissue,”
J. Biomed. Opt., 12
(6), 060503
(2007). http://dx.doi.org/10.1117/1.2818045 JBOPFO 1083-3668 Google Scholar
A. Aguirre et al.,
“Potential role of coregistered photoacoustic and ultrasound imaging in ovarian cancer detection and characterization,”
Transl. Oncol., 4
(1), 29
–37
(2011). http://dx.doi.org/10.1593/tlo.10187 Google Scholar
R. E. Kumon, C. X. Deng and X. Wang,
“Frequency domain analysis of photoacoustic imaging data from prostate adeno carcinoma tumors in a murine model,”
Ultrasound Med. Biol., 37
(5), 834
–839
(2011). http://dx.doi.org/10.1016/j.ultrasmedbio.2011.01.012 Google Scholar
M. Priya et al.,
“Photoacoustic spectroscopy in the monitoring of breast tumor development: a pre-clinical study,”
Proc. SPIE, 8943 894347
(2014). http://dx.doi.org/10.1117/12.2039365 PSISDG 0277-786X Google Scholar
M. Priya et al.,
“Photoacoustic spectroscopy based evaluation of breast cancer condition,”
Proc. SPIE, 9303 93032L
(2015). http://dx.doi.org/10.1117/12.2079159 PSISDG 0277-786X Google Scholar
V. P. Zharov et al.,
“In vivo photoacoustic flow cytometry for monitoring of circulating single cancer cells and contrast agents,”
Opt. Lett., 31
(24), 3623
–3625
(2006). http://dx.doi.org/10.1364/OL.31.003623 Google Scholar
S. Mallidi, G. P. Luke and S. Emelianov,
“Photoacoustic imaging in cancer detection, diagnosis, and treatment guidance,”
Trends Biotechnol., 29
(5), 213
–221
(2011). http://dx.doi.org/10.1016/j.tibtech.2011.01.006 TRBIDM 0167-7799 Google Scholar
C. Sim et al.,
“Photoacoustic-based nanomedicine for cancer diagnosis and therapy,”
J. Control Release, 203 118
–125
(2015). http://dx.doi.org/10.1016/j.jconrel.2015.02.020 Google Scholar
S. Zackrisson, S. M. W. Y. van de Ven and S. S. Gambhir,
“Light in and sound out: emerging translational strategies for photoacoustic imaging,”
Cancer Res., 74
(4), 979
–1004
(2014). http://dx.doi.org/10.1158/0008-5472.CAN-13-2387 CNREA8 0008-5472 Google Scholar
K. Geng et al.,
“Thermoacoustic and photoacoustic tomography of thick biological tissues toward breast imaging,”
Technol. Cancer Res. Treat., 4 559
–566
(2005). http://dx.doi.org/10.1177/153303460500400509 Google Scholar
J. B. Kim, M. J. O’Hare and R. Stein,
“Models of breast cancer: is merging human and animal models the future?,”
Breast Cancer Res., 6
(1), 22
–30
(2004). http://dx.doi.org/10.1186/bcr645 BCTRD6 Google Scholar
A. Sierra,
“Animal models of breast cancer for the study of pathogenesis and therapeutic insights,”
Clin. Transl. Oncol., 11
(11), 721
–726
(2009). http://dx.doi.org/10.1007/s12094-009-0434-7 Google Scholar
E. A. Hendrickson,
“The SCID mouse: relevance as an animal model system for studying human disease,”
Am. J. Pathol., 143
(6), 1511
–1522
(1993). AJPAA4 0002-9440 Google Scholar
T. A. Martin et al.,
“Growth and angiogenesis of human breast cancer in a nude mouse tumor model is reduced by NK4, a HGF/SF antagonist,”
Carcinogenesis, 24
(8), 1317
–1323
(2003). http://dx.doi.org/10.1093/carcin/bgg072 CRNGDP 1460-2180 Google Scholar
W. Rodenburg et al.,
“Identification of breast cancer biomarkers in transgenic mouse models: a proteomics approach,”
Proteomics Clin. Appl., 4
(6–7), 603
–612
(2010). http://dx.doi.org/10.1002/prca.200900175 Google Scholar
A. Fantozzi and G. Christofori,
“Mouse models of breast cancer metastasis,”
Breast Cancer Res., 8
(4), 212
(2006). http://dx.doi.org/10.1186/bcr1530 BCTRD6 Google Scholar
V. M. Peut and A. M. Rice,
“A NOD/SCID model of primary human breast cancer,”
Open Transplant. J., 2
(1), 1
–8
(2008). http://dx.doi.org/10.2174/1874418400802010001 Google Scholar
K. Schulmeister et al.,
“Review of exposure limits and experimental data for corneal and lenticular damage from short pulsed UV and IR laser radiation,”
J. Laser Appl., 20
(2), 98
–105
(2008). http://dx.doi.org/10.2351/1.2900539 Google Scholar
M. M. Tomayko and C. P. Reynolds,
“Determination of subcutaneous tumor size in athymic (nude) mice,”
Cancer Chemother. Pharmacol., 24
(3), 148
–154
(1989). http://dx.doi.org/10.1007/BF00300234 CCPHDZ 0344-5704 Google Scholar
M. M. Jensen et al.,
“Tumor volume in subcutaneous mouse xenografts measured by microCT is more accurate and reproducible than determined by 18F-FDG-microPET or external caliper,”
BMC Med. Imaging, 8
(16), 1
–9
(2008). http://dx.doi.org/10.1186/1471-2342-8-16 Google Scholar
D. M. Euhus et al.,
“Tumor measurement in the nude mouse,”
J. Surg. Oncol., 31
(4), 229
–234
(1986). http://dx.doi.org/10.1002/jso.2930310402 JSONAU 0022-4790 Google Scholar
Protocol Online, “Xenograft tumor model protocol,”
(2005) http://www.protocol-online.org/prot/Protocols/Xenograft-Tumor-Model-Protocol-3435.html December 2015). Google Scholar
P. Lio,
“Wavelets in bioinformatics and computational biology: state of art and perspectives,”
Bioinformatics, 19
(1), 2
–9
(2003). http://dx.doi.org/10.1093/bioinformatics/19.1.2 BOINFP 1367-4803 Google Scholar
P. S. Addison,
“The continuous wavelet transform,”
The Illustrated Wavelet Transform Handbook: Introductory Theory and Applications in Science, Engineering, Medicine and Finance, 06
–64 1st ed.CRC Press, IOP Publishing Limited(2002). Google Scholar
L. Zhifang, L. Hui and X. Wenming,
“Characterization of photoacoustic signal using wavelet analysis,”
Proc. SPIE, 7845 78450I
(2010). http://dx.doi.org/10.1117/12.868613 PSISDG 0277-786X Google Scholar
A. H. Gharekhan et al.,
“Distinguishing autofluorescence of normal, benign and cancerous breast tissues through wavelet domain correlation studies,”
J. Biomed. Opt., 16
(8), 087003
(2011). http://dx.doi.org/10.1117/1.3606563 JBOPFO 1083-3668 Google Scholar
G. V. Belle et al., Biostatistics—A Methodology for the Health Sciences, 25
–60 2nd ed.John Wiley & Sons, Inc.(2004). Google Scholar
A. El-Koka, K. H. Cha and D. K. Kang,
“Regularization parameter tuning optimization approach in logistic regression,”
in 15th Int. Conf. on Advanced Communication Technology,
13
–18
(2013). Google Scholar
P. C. Austin and J. V. Tu,
“Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality,”
J. Clin. Epidemiol., 57
(11), 1138
–1146
(2004). http://dx.doi.org/10.1016/j.jclinepi.2004.04.003 JCEPEE 0895-4356 Google Scholar
S. C. Bagley, H. White Bagley and B. A. Golomb,
“Logistic regression in the medical literature: standards for use and reporting, with particular attention to one medical domain,”
J. Clin. Epidemiol., 54
(10), 979
–985
(2001). http://dx.doi.org/10.1016/S0895-4356(01)00372-9 JCEPEE 0895-4356 Google Scholar
S. Dreiseitl and L. Ohno-Machado,
“Logistic regression and artificial neural network classification models: a methodology review,”
J. Biomed. Inform., 35
(5–6), 352
–359
(2002). http://dx.doi.org/10.1016/S1532-0464(03)00034-0 Google Scholar
C. Persello and L. Bruzzone,
“Active learning for domain adaption in the supervised classification of remote sensing images,”
IEEE Trans. Geosci. Remote Sens., 50
(11), 4468
–4483
(2012). http://dx.doi.org/10.1109/TGRS.2012.2192740 IGRSD2 0196-2892 Google Scholar
J. R. Mourant et al.,
“Predictions and measurements of scattering and absorption over broad wavelength ranges in tissue phantoms,”
Appl. Opt., 36
(4), 949
–957
(1997). http://dx.doi.org/10.1364/AO.36.000949 Google Scholar
N. Ramanujam,
“Fluorescence spectroscopy of neoplastic and non-neoplastic tissues,”
Neoplasia, 2
(1–2), 89
–117
(2000). http://dx.doi.org/10.1038/sj.neo.7900077 Google Scholar
C. E. Webb, J. D. C. Jones,
“Lasers applications in biology and biotechnology,”
Handbook of Laser Technology and Applications: Applications, 2123
–2128 CRC Press(2003). Google Scholar
L. Zhang et al.,
“Tryptophan as the fingerprint for distinguishing aggressiveness among breast cancer cell lines using native fluorescence spectroscopy,”
J. Biomed. Opt., 19
(3), 037005
(2014). http://dx.doi.org/10.1117/1.JBO.19.3.037005 JBOPFO 1083-3668 Google Scholar
L. A. Sordillo et al.,
“Optical spectral fingerprints of tissues from patients with different breast cancer histologies using a novel fluorescence spectroscopic device,”
Technol. Cancer Res. Treat., 12
(5), 455
–461
(2013). http://dx.doi.org/10.7785/tcrt.2012.500330 Google Scholar
I. Poschke et al.,
“Tumor-dependent increase of serum amino acid levels in breast cancer patients has diagnostic potential and correlates with molecular tumor subtypes,”
J. Transl. Med., 11 290
(2013). http://dx.doi.org/10.1186/1479-5876-11-290 Google Scholar
Y. Miyagi et al.,
“Plasma free amino acid profiling of five types of cancer patients and its application for early detection,”
PLoS One, 6
(9), e24143
(2011). http://dx.doi.org/10.1371/journal.pone.0024143 POLNCL 1932-6203 Google Scholar
C. Frantz, K. M. Stewart and V. M. Weaver,
“The extracellular matrix at a glance,”
J. Cell Sci., 123 4195
–4200
(2010). http://dx.doi.org/10.1242/jcs.023820 Google Scholar
F. R. Balkwill, M. Capasso and T. Hagemann,
“The tumor microenvironment at a glance,”
J. Cell Sci., 125 5591
–5596
(2012). http://dx.doi.org/10.1242/jcs.116392 Google Scholar
Y. Saijo, H. Sasaki,
“High frequency acoustic properties of tumor tissue,”
Ultrasonic Tissue Characterization, 217
–229 Springer, Japan
(1996). Google Scholar
V. Backman et al.,
“Detection of pre-invasive cancer cells,”
Nature, 406 35
–36
(2000). http://dx.doi.org/10.1038/35017638 Google Scholar
D. K. Yao et al.,
“Optimal ultraviolet wavelength for in vivo photoacoustic imaging of cell nuclei,”
J. Biomed. Opt., 17
(5), 056004
(2012). http://dx.doi.org/10.1117/1.JBO.17.5.056004 Google Scholar
D. Zink, A. H. Fischer and J. A. Nickerson,
“Nuclear structure in cancer cells,”
Nat. Rev. Cancer, 4
(9), 677
–687
(2004). http://dx.doi.org/10.1038/nrc1430 Google Scholar
D. K. Yao et al.,
“In vivo label-free photoacoustic microscopy of cell nuclei by excitation of DNA and RNA,”
Opt. Lett., 35
(24), 4139
–4141
(2010). http://dx.doi.org/10.1364/OL.35.004139 Google Scholar
BiographyMallika Priya, MTech, has earned post-graduation in biotechnology from School of Biotechnology and Bioinformatics, Padmashree Dr. D. Y. Patil University, Mumbai, India (2010). She is currently working as a PhD student at the Department of Biophysics, School of Life Sciences, Manipal University, Karnataka, India. Her interdisciplinary doctoral study involves discrimination of breast cancer stages by photoacoustic spectroscopy signatures in a xenograft model and clinical samples by PCA, wavelet and logistic regression analysis. Bola Sadashiva Satish Rao, PhD, is currently serving as professor and head of the Department of Radiation Biology and Toxicology, School of Life Sciences, Manipal University, Karnataka, India. He received his PhD degree in radiobiology from Mangalore University in 1994. He has been associated with Manipal University since 1987. His research interests are in cellular and molecular biology, radiation biology, experimental oncology and therapeutics, environmental toxicology and occupational hazards and heavy metal/nanotechnology. Subhash Chandra is currently working as a junior engineer in the Department of Biophysics, School of Life Sciences, Manipal University, Karnataka, India. He has completed his Diploma in Engineering (E&C) from N.R.A.M.P. NITTE, Karnataka. He is associated with the Department of Biophysics, School of Life Sciences since 2010 and has been a major hand in maintaining lasers and fabrication of mechanical components to develop photoacoustic and fluorescence based setups. Anirbit Datta, BE, is a graduate in electronics and communication (2015), from the Department of Electronics and Communication, Manipal Institute of Technology, Manipal University, Karnataka, India. He has carried out his BE final-year project work in the Department of Biophysics, School of Life Sciences, Manipal. His research interests include signal processing, machine learning and big data analysis. He is currently working as a trainee decision scientist in Mu Sigma Inc., Bangalore, India. Subramanya G. Nayak, PhD, is currently serving as professor at the Department of Electronics and Communication, Manipal Institute of Technology, Manipal University Karnataka, India. He has been associated with Manipal Institute of Technology, Manipal University, Karnataka for the past 16 years. He has a PhD degree in electrical and electronics engineering. He has been actively involved in academics related activities at the institute, and his research areas are bio signal recognition, and embedded system design and applications. Krishna Kishore Mahato, PhD, is currently serving as professor and head of the Department of Biophysics, School of Life Sciences, Manipal University, Karnataka, India. He has received his PhD degree in the field of laser spectroscopy from Banaras Hindu University, Varanasi, India (1998). He has been associated with Manipal University since 2002. His areas of interest are photoacoustic/fluorescence spectroscopy based cancer research and diagnosis, low level laser therapy, wound healing, fluorescence based protein characterization and bioengineering. |