Optimization of combination chemotherapy based on the calculation of network entropy for protein-protein interactions in breast cancer cell lines
© Carels et al. 2015
Received: 16 April 2015
Accepted: 20 July 2015
Published: 11 August 2015
In this report, we show how entropy computation can be applied to the characterization of a protein-protein interaction networks to assist the selection of personalized chemotherapeutic strategy for cancer treatment.
With seven malignant (BT-20, BT-474, MDA-MB-231, MDA-MB-468, MCF-7, T-47D, ZR-75-1) and one healthy (MCF10A) cell lines, we combined interactome and transcriptome data as well as Shanon entropy computation to classify drugs according to their inhibitory potential and to identify the top-5 protein targets best suited for personalized chemotherapy.
We have investigated breast cancer cell lines and found that the entropy of their protein interaction networks is negatively correlated with their sensitivity to target-specific drugs of high potency. This sensitivity is defined as half cell growth inhibition (GI50) with respect to drug administration. By contrast, we found no correlation for drugs that are either of low potency or with no specific molecular targets (cytotoxic). As a result, drugs can be divided into target specific and generally cytotoxic according to the GI50 they produce in malignant cell lines. By extrapolation, we predict that the inactivation of the top-5 up-regulated protein hubs by specific drugs will reduce the protein network entropy by ~2 %, on average, which is expected to substantially increase the benefit of a personalized chemo-therapeutic strategy for patient survival.
We propose several novel drug combinations using only the approved drugs for the inactivation of the target identified in this study with the purpose of increasing patient survival and lowering the deleterious side effects of cancer chemotherapy.
KeywordsBreast cancer Entropy Interaction network Histological subtype Chemotherapy
The text of this report may appear somewhat specialized to some readers not familiar with drug development or systems biology. In order to improve readability of this paper, we introduce the definitions of the key concepts used in what follows.
In molecular biology, an interactome is the whole set of molecular interactions in a particular cell. It refers specifically to physical interactions among proteins, also known as protein-protein interactions, i.e., physical contacts established between two or more proteins as a result of biochemical events and/or biophysical forces. Here, we more particularly refer to transient interactions among proteins in the context of signaling networks, i.e., the protein pathways that connect protein receptors on the cell surface with transcription factors that (up- or down-) regulate gene expression. Mathematically, interactomes are generally displayed as graphs (networks).
Complex networks are ubiquitous in nature. Mathematically, a network may be described by either a directed or undirected graph G = (V, E) with vertex and edge sets V and E, respectively. An edge appears in the graph if there is a known interaction of the two partners, for example two interacting proteins in a cell, either by direct binding or by enzymatic catalysis. A node is referred to as a node of degree k if it is connected to other nodes by k edges. The connectivity level (or rate) of a network characterizes the average number of interactions (edges) per node. When, a node has a number of interactions (connections or edges) significantly larger than the average, it is called a hub. Top-5 (or 10, or more) refers to the 5 (or 10) best items in a list for a given feature under consideration.
In information theory, entropy (the so-called Shannon entropy) is the negative of the expected value of the information contained in a message received. Mathematically speaking the Shannon entropy, H, of a discrete random variable X is a measure of the amount of uncertainty associated with the value of X when only its distribution is known. So, for example, if the distribution associated with a random variable is constant (i.e. equal to some known value with probability 1), then entropy is minimal and equal to 0.
Here, the biological system studied represents the interactome structure for a cell, i.e., the number of edges (interactions with neighbor proteins) per node (proteins in the network). The probability distribution of the events (the probability of a given number of edges per node), coupled with the information amount (the probability of a given number of edges for the node considered multiplied by its base 2 logarithm) of every event (node), forms a random variable whose average (also termed expectation value) is the average amount of information. Its inverse is the network entropy generated by this distribution.
Half cell growth inhibition (GI50)
In the context of whole-cell assays, GI50 is the concentration of a drug that is needed to inhibit 50 % of cell proliferation.
Breast cancer is a global disease. It is the most common cancer in women (25 % of all cancers), with nearly 281,840 estimated new cases, and 40,290 estimated deaths in 2015 in US population (http://seer.cancer.gov). Breast cancer is also becoming an increasingly urgent problem in low- and middle-income countries.
Endocrine receptors, i.e., estrogen (ER) or progesterone (PR) receptors. Breast tumors that grow in response to estrogen are classified as ER+ while those that grow in response to progesterone are classified as PR+. ER+ or/and PR+ tumors (60 % of the cases) are likely to respond to endocrine therapies while ER- and PR- tumors (5 to 10 % of the cases) are not.
Human epidermal growth factor receptor 2 (HER2). Malignant cells up-regulate a protein known as HER2/neu in about 20 to 25 % of breast tumors and results in a HER2+ phenotype. These breast tumors tend to be much more aggressive and fast-growing.
Triple negative (TN). About 15–25 % of breast tumors do not over-express any of estrogen, progesterone, or HER2 receptors. TNs are more difficult to treat, since most chemotherapeutic agents target one of the ER, PR or HER receptors and often require combination therapies . The name TN is sometimes used as a surrogate term for basal-like and comprises a very heterogeneous group of cancers. There is no standard classification scheme for TNs, but these malignant cells are frequently defined by cytokeratin 5/6 and EGFR staining. However, no clear criteria or cutoff values have been standardized yet.
The chemotherapy regimens used in breast cancer have a relatively low level of molecular specificity with a wide range of acute and long-term side effects that can be substantially deleterious to patients. In addition, clinicians cannot accurately predict the risk of metastasis development in individual patients. Currently, among about 80 % of patients that received adjuvant chemotherapy, approximately 40 % relapse and ultimately die of metastatic tumors. A further complicating factor in these analyses is that many women who would be cured by local treatment alone, which includes surgery and radiotherapy, will be ‘over-treated’ and suffer the toxic side effects of chemotherapy needlessly. Based on this context, new strategies, models or paradigms are urgently needed to identify patients, who are at the highest risk for developing metastases, and which might benefit from specific drugs. This approach is at the core of personalized medicine (also referred to as precision medicine) today.
A tremendous effort is ongoing worldwide to improve treatment success and decrease deleterious side effects in patients. With that concern, cell-lines are very useful models for the identification of clinically relevant molecular determinants of tumor response to drugs. It has been reported that cell lines are, indeed, worthwhile models of primary tumors at both the transcript and genome copy-number levels . The comparative analysis of pathways has shown that the majority of subtype-specific signaling sub-networks are conserved between cell lines and tumors. This similarity is important, given the very different environments between a cell line growing in axenic culture and a primary or metastatic tumor exposed to in vivo conditions. This supports the consistency of in vitro investigations as relevant inferences for clinical testing .
As a fruit of ~30 years of investigations, the interactions between cellular proteins reached a sufficiently high level of description for modeling complex molecular processes such as those involved in cancer. Here, we applied this vast systems biology knowledge base to better understand the behavior of malignant cell lines subjected to drug treatments. We used network entropy as a quantitative measure according to the definition of Shannon  to characterize the complexity of protein interaction networks as described by Breitkreutz et al. . We used Eq. (3) to evaluate the network entropy of each of the protein-protein interaction networks considered. This means, we first generated a rank-order distribution function for each network and associated the frequency of a particular number of edges connected to nodes with a probability function, p(k). This was repeated for each particular network with its rearrangement as a result of removing the edges corresponding to the inhibition of a specific protein-protein interaction due to a targeted pharmacological agent. We have chosen the top 5 protein hubs as predicted best targets for inhibition by drug molecules Our objective has been to quantify the benefit associated to the target inactivation of top-5 protein hubs in up-regulated genes rather than non-hub proteins . We found that the proportion of total entropy represented by top-5 hub proteins is ~2 % of total protein network, on average, which by extrapolation, in the case of breast cancer is expected to bring the 5-year survival in the majority of the cases to 100 % [6, 8], i.e., to improve the 10-year survival expectancy or perhaps even to result in a permanent cure. Consequently, we proposed a few optimized drug combinations based on our inferences using the approved inhibitors available on the market. Each combination is specific to a particular subtype of breast cancer due to their differences in the topology of the corresponding interaction networks.
The protein connectivity that serves as data for entropy calculation in the present work is based on the protein interactions given in the file intact-micluster.zip available from ftp://ftp.ebi.ac.uk/pub/databases/intact/current/psimitab/ (accessed on 04.04.2014). We selected the two columns of UniprotKB identifiers (UID) in the intact-micluster.zip file and eliminated the incomplete pairs (marked as “-”, i.e., when an intact access number has no UniprotKB equivalent known). The resulting file contained 308,314 protein pairs. This interaction file was then processed to form a non-redundant UID list used to retrieve the corresponding protein sequences (68,504) by querying UniprotKB at http://www.uniprot.org/help/uniprotkb. Since some UID were obsolete, we substituted them with their current name retrieved by querying the field search at UniprotKB using the format ‘replaces:obsolete UID’. The equivalence between UID and human genes was obtained by homology search (tBLASTn) of protein sequences (68,504) used as queries and human coding sequences (CDS) used as subjects from the dataset (hs37p1.EID.tar.gz) of Fedorov’s laboratory  available at http://bpg.utoledo.edu/~afedorov/lab/eid.html. Homologous hits were considered significant when their score was ≥120, E-value ≤10−4 and identity rate ≥80 % over ≥50 % of query size (http://mitointeractome.kobic.kr/supplement.php). After elimination of subject redundancy (keeping the hit matching the largest identity rate), the final size of human CDS dataset fully described by protein interactions was 17,301.
We recovered transcriptome datasets of cell lines (BT-20, BT-474, MDA-MB-231, MDA-MB-468, MCF-7, MCF10A, T-47D, ZR-75-1, see information at http://www.atcc.org/) from http://www.illumina.com/science/data_library.ilmn. The gene expression profile was evaluated through a homology search with the human CDS sample of Fedorov’s laboratory. The fifty bp sequences from transcriptome tags were used as queries in homology searches (BLASTn) in human CDSs. The homology redundancy in the BLASTn output file gave us the tag count per gene, i.e., a profile of human gene expression for the considered sample. Homologous hits were considered significant when covering ≥25 bp (50 % of size).
Each gene expression profile (tag count per gene) was normalized according to CDS size and whole tag count using the formula (109*C)/(N*L), where 109 is a correction factor, C is the number of reads that match a gene, N is the total mappable tags in the experiment, and L is the CDS size . When tags were counted for more than one gene isoform (alternative splicing forms), we cumulated counts and allocated them to just one form (the largest one); this strategy means that we looked for gene expression and not isoform expression.
Net entropy differences occur between cell lines because of a combination of the interactome (network of protein interactions) that define in a fixed way the number of interactions between a protein and its neighbors in the network and the transcriptome that shows whether a gene (corresponding to a node in the protein network) is expressed or not. The interactome does not change from one cell line to another in our computational experiments because it is the product of ~30 years of wet lab experimentation. By contrast, the transcriptome (gene expression) is relatively easy to measure by high throughput sequencing techniques, which allow the identification whether a gene for a protein of the network is expressed or not according to the cell line under consideration. If the gene is not expressed, the corresponding node in the network does not exist in the cell line in the expression state considered and its entropy is not included. In the other cases where the expression is larger than zero, the entropy is computed with the consequence that the network is Boolean in essence, which is an approximation in the sense that each node could be modulated by its level of expression to compute the entropy. However, the Shannon entropy does not account for relative statistical weights and hence this level of information has been neglected.
Classification of genes according to expression rates
Since genes with a low expression rate are the most numerous, the distribution of gene frequency according to normalized tag counts is Poisson like. To classify genes into down- or up-regulated, a symmetrical distribution is necessary in order to estimate a p-value on a Gaussian curve resulting from the best fit with the observed distribution.
To obtain a symmetrical distribution, we subtracted the normalized (according to size and number) data from the transcriptome of a malignant cell line from the non-tumoral cell line (MCF10A). After normalization using Q-norm, the distribution’s mean was close to zero for any comparison between a malignant cell line and the control. The log10(x i + 1) transformation brought the observed distribution closer to a Gaussian distribution. We used PRISM to perform the best fit (95 %) with a Gaussian distribution of log10(x i + 1) data classified by increasing values from the largest negative number to the largest positive number. In this investigation, we only considered up-regulated genes since it is those genes that encode proteins targeted under the classical concept of protein inhibition by drug binding. The boundaries corresponding to p-values of 1 % considering a one-tails p-value (up-regulated side) on the best fit of a Gaussian distribution were used to calculate the classification threshold of down- and up-regulated genes on the observed distribution using the inverse function, i.e., 10log10(xi+1) and subtracting 1 from the result of the exponential. The up-regulated genes at p-values <0.001 were those with positive values higher than the classification thresholds of +150.
To calculate the entropy correlation between potentially up-regulated genes (≥150 reads per gene) and the total gene sample, we identified the list of genes for which up-regulation occurred at least once over all seven malignant cell lines and summed the entropy per cell line over these gene subsets for the eight cell lines (including the reference MCF10A). We did the same calculation for the set of genes corresponding to the total set minus the potentially up-regulated genes referred to as the complement and verified that the total entropy was indeed the sum of those of potentially up-regulated genes and the complement over the eight cell lines. Finally, we calculated correlations of entropies by pairs considering the potentially up-regulated, complement and whole sets of genes.
The GI50 were derived from the sd02 datasheet (http://www.pnas.org/lookup/suppl/doi:10.1073/pnas.1018854108/-/DCSupplemental/sd02.xlsx) from Heiser et al.  and the target annotation associated to these drugs from the sd04 datasheet of the same source. The 74 drugs selected for screening cover a wide range of targets and processes implicated in cancer biology and progression and can be classified into two major groups: (i) agents that target specific receptors (n =54) and (ii) general cytotoxic chemotherapeutics (n = 20), defined as various. Thus, we analyzed the correlation between the –log10(GI50), associated to both target-specific and broadly cytotoxic drugs, and the corresponding entropy per node of the protein network in the control MCF10A cell line as well as in luminal A (MCF-7, T-47D, ZR-75-1), luminal B (BT-474) and triple-negative (BT-20, MDA-MB-231, MDA-MB-468) malignant cell lines.
Benefit of targeting up-regulated protein hubs as therapeutic targets
Since the selection of up-regulated protein targets is expected to reduce as much as possible the incidence of adverse side effects for the patient, we calculated the benefit, in terms of entropy per node, that could be associated to the inactivation of top-5 most connected proteins (hubs). To do this, we simply computed the entropy per node associated to top-5 most connected proteins in the context of the up-regulated sub-network and computed the relative difference of entropy per node of this sub-network with and without these top-5 hub proteins. According to Cheang et al. , the average probability of 5-year survival of patients with luminal breast cancer is ~90 % while that of the patients with triple-negatives is ~70 % and the control is of course 100 %. Thus, we measured the benefit of protein inactivation by the reduction of entropy of its protein network with the consequence that it comes closer to that of the non-tumoral cell, which can be predicted in terms of benefit (%) to the patient by interpolation using the orthogonal regression line through the average protein network entropy of luminal, triple-negative and control cell lines as well as their associated 5-year patient survival. Finally, we searched the database of clinically approved drugs that potentially inhibit top-5 hub targets and proposed their optimized combination for new cocktails in the treatment of breast cancer.
Statistics of sample size and entropy per node of cell line samples
Since it may seem unwarranted to draw conclusions from (i) data that only differ in the second decimal place and (ii) the size of the cell sample addressed here is too small to conduct statistical testing based on the variance, we analyzed entropy patterns among cell lines to detect whether some internal consistency may justify the general trends reported here. We found a positive correlation (r = 0.72, P = 0.04) between the entropies of the subsets of potentially up-regulated genes (n = 923) taking the eight cell lines into account and the entropies of the total gene set (n = 9724) (Fig. 1a). By contrast, we did not find any significant correlation (r = −0.29, P = 0.51) when comparing entropies of the gene set (n = 8801) corresponding to the total sample minus the potentially up-regulated ones (referred to as the complement) with the total sample (n = 9724) (Fig. 1b). However, the comparison of potentially up-regulated genes (n = 923) to the complement (n = 8801) demonstrated a correlation of r = −0.93 (P = 0.0002) (Fig. 1c). This pattern is unlikely to occur just by chance or by some sample bias and demonstrates that the general conclusions drawn in this paper are consistent.
The panel of cytotoxic drugs classified according to their therapeutic targets, primary effector pathways, or signaling pathway; and the sensitivity for each malignant cell lines
The panel of targeted drugs classified according to their therapeutic targets, primary effector pathways, or signaling pathway; and the sensitivity for each malignant cell lines
Cell cycle ∘
Expected benefit of drug combination in breast cancer therapy using entropy data from Additional file 2: Table S2 and the relationship y = −0.0005*x + 11.4732
Gefitinib, erlotinib, cetuximab, lapatinib, panitumumab, vandetanib, trastuzumab, pertuzumab, afatinib, neratinib, AZD9291, CLO-1686a 
Gefitinib, erlotinib, cetuximab, lapatinib, panitumumab, vandetanib, trastuzumab, pertuzumab, afatinib, neratinib, AZD9291, CLO-1686a 
Trastuzumab, pertuzumab, NeuVax vaccinea 
A drug such as fusicoccin is expected to be effective against most breast cell lines because its protein target is almost always among the top differentially expressed hub proteins (except in MCF-7). By contrast, to increase the patient’s 10-year survival, one should complement fusicoccin with some other drugs according to the cell line under consideration. Of course, due to tumor heterogeneity, several cellular phenotypes can be identified at the same time complicating the issue of optimal drug selection immensely. However, the same reasoning should be applied to all of the phenotypes represented, perhaps with a statistical weight applied, but an in-depth discussion on this issue is beyond the scope of the present study so we only address isolated cell lines in what follows. To complement fusicoccin, one may consider the following panel of drugs from the theoretically most efficacious to the less efficacious according to the entropy of their respective target in cell signaling networks: gefitinib, erlotinib, cetuximab, lapatinib, panitumumab, vandetanib, trastuzumab, pertuzumab, afatinib, neratinib, AZD9291 or CLO-1686 > difopein or R18 (triple-negative); CGP78850 or C90 > HDGF-H3 or NSC348884 (Luminal A); and difopein or R18 > trastuzumab, pertuzumab or NeuVax vaccine (Luminal B).
The protein network representation used in this study can be considered very large since from a total of 9724 genes, an average of 7207 were included at the same time for all eight cell lines. In spite of the fact that all conclusions drawn in this paper rely on entropy differences in the second decimal place, we believe that they are significant because an internal pattern was found in the computational experiment in the form of negative and positive correlations that cannot be explained by chance. Of course, the pattern is not only due to up-regulated genes since, for entropy comparison, the same sample size must be taken for all seven malignant cell lines and the number of significantly up-regulated genes is not the same in each of these lines. Thus, according to the cell line a sizeable number of non up-regulated genes may contaminate that sample. However, it is in the potentially up-regulated genes that one must look for an increase of entropy correlating with cell malignancy, which is also consistent with the increased metabolism of malignant cells.
The exercise of correlating potentially up-regulated genes to the gene complement demonstrates the internal consistency of our sample according to the entropy calculation made here. The pattern of entropy distribution found recapitulates the notion that the more malignant a cell line is, the larger is the associated entropy of its network. Interestingly the protein network being finite by nature, if the entropy of up-regulated genes increases, a compensation effect occurs at a cost represented by the entropy of the total network. However, still a higher level of significance exists when considering entropy differences over the whole sample because this also takes into account genes that are not necessarily significantly up-regulated, but still over-expressed compared to the reference. The sum total of entropies over these genes makes a difference at the whole sample level. Therefore, a sample of potentially up-regulated genes cannot be taken into account to calculate the benefit of drug treatment to the patient; the right choice involves the full set of genes.
A correlation between malignancy and PNE was first shown by Breitkreutz et al.  by considering different types of tumors, which incidentally did not include breast cancer. Here, we studied several breast cancer cell lines and a similar trend has been observed (Table 1). It is conceivable that the small entropy differences observed here are at least in part due to this specific situation of dealing with a single cancer type, but also due to the different methodology used by Breitkreutz et al. .
The boundary between both types of targets (broadly cytotoxic and target-specific) is not always clear because some specific targeting drugs may have unintended off-target effects leading to inhibition of DNA synthesis, for instance, such as is the case with methotrexate, which is specific for the folate receptor, hence inhibiting purine and pyrimidine base biosynthesis and ultimately blocking DNA synthesis. When grouping drugs by more narrowly defined activities, the noise in the data tended to be reduced and a positive correlation appeared for cytotoxic drugs (data not shown), but the negative correlation associated to specific targets remained. Here, we took a conservative position and presented the data without additional potentially confounding filtering operations. However, it is interesting to note that if the positive correlation between –log10(GI50) of cytotoxic drugs and PNE were to be confirmed in the future, it would mean that cytotoxic drugs are involved in another type of relationship regarding cell sensitivity compared to target-specific drugs. Rather, it means that the system relies much more on the mechanism that is inactivated when the entropy of the system is high than when it is low. As a metaphor for this concept, the consequences of a central power plant destruction are much greater for an industrialized country than for a developing one, simply because the entire system depends on it due to strong interconnectedness.
A negative correlation between –log10(GI50) and PNE has the consequence that cells with more complex protein networks (higher entropy) have more options to explore as alternative pathways in order to cope with target inhibition, which is not the case with cells characterized by less complex protein networks (lower entropy) which, as a consequence, are expected to take more time to adapt (or eventually die). This notion is reminiscent of the gene-for-gene concept described by Harold Henry Flor  in plant pathology. In plants, the gene-for-gene relationship is generally seen as the collapse of a host’s resistance to a parasite that may occur as a response to a mutation in a parasite’s gene of virulence that allows it to overcome the host’s resistance and invade its tissues. For this reason, plant breeding has traditionally coped with resistance collapse through the accumulation of genes encoding host resistance. This process of gene accumulation, which has been called gene pyramidation , lowers the probability of parasite adaptation by increasing its virulence because the accumulation of virulence genes in a parasite generally decreases its fitness in its environment and, as a consequence, decreases its likelihood.
Conversely, in the case of cancer cell resistance to drugs, the more complex the protein network, the more alternative escape routes/pathways it has compared to a specific target inhibition. The fact that susceptibility of malignant cell lines is always higher for target-specific than cytotoxic drugs suggests that drug development efforts should be concentrated on target-specific drugs and combinations thereof. Thus, it is natural to expect the development of resistance to specific drugs by malignant cell according to the gene-for-gene or, here, the gene-for-inhibitor concept. Following this logic, one may consider a gene-for-inhibitor relationship in the case of malignant cell lines vis-à-vis drugs. Accordingly, formulating drugs into a cocktail should overcome malignant cell’s resistance, which is actually one of the modern therapeutic trends [6, 7]. However, formulating a drug combination should also account for the dose-limiting negative side effects for normal cells and to protect the immune system’s integrity. Thus, we first seek the most probable protein targets (top-5) for drug inhibition in order to maximize the patient benefit from such a therapeutic combination. Top-5 is justified by the fact that more than five drugs cannot be realistically fit within one drug capsule. Of course, drugs could be administrated in several capsules or through intravenous injections. However, such developments should be seen in the scope of clinical trials that we do not address here.
The concept of patient benefit maximization is closely related to the choice of protein targets that act as connectivity hubs in the signaling pathway, but are up-regulated in malignant cells compared to normal cells in order to minimize deleterious side effects for the patient’s health. We found that in the majority of cases, one hub-specific drug would be enough to bring the 5-year survival expectancy close to that of normal cells, based on entropy calculations. When the benefit of 5-year survival is estimated to exceed 100 %, it simply means that the benefit should be seen in more than 5-year survival expectancy, i.e., 10-year survival expectancy, which is now the state of the art in breast cancer statistical evaluation.
In general, it is hard to determine what cocktail should be applied to maximize the 10-year survival expectancy and minimize deleterious side effects on patients. Currently, the only way to shed light on this issue is through a trial-and-error experimentation. However, drug combinations from Table 4 could be good starting points based on rational arguments and they can be evaluated immediately in clinical trials since most of the drugs involved are already approved. Interestingly, without taking clinical considerations into account, our investigation shows that fusicoccin should be a basic cocktail component, which should be complemented with other drugs according to the specific breast cancer type developed by the patient in order to maximize the 10-year survival expectancy, which opens an important avenue for personalized medicine.
The response rate to a chemotherapeutic drug treatment may be relatively low in a population of unselected patients. To improve the effectiveness of cancer therapies, a repurposing strategy should include tumor phenotype characterization by molecular techniques in order to design a treatment regimen optimal for the patient outcome. We found that the susceptibility of malignant cells to drugs that are specific for their target is negatively correlated with the entropy of their protein-protein interaction network, which implicitly means that malignant cell resistance to specific drugs is due to the larger number of potential alternative routes in their signaling network. The consequence of the positive correlation between protein network entropy and malignant cell resistance to specific drugs is that drug cocktails addressing a large number of protein targets are expected to be more effective for the treatment of malignant cell lines with high entropy levels than cocktails targeting only a few proteins. We show that the best protein targets to be addressed for drug development are those (i) whose entropy is large (interaction hubs) and (ii) that are up-regulated in malignant cells compared to normal cells. It is easy to understand that the larger the interaction rate of a protein hub is, the greater its inactivation effect will be on the protein network. It is also readily appreciated that the inactivation of up-regulated hub targets in malignant cells compared to normal ones is more beneficial to the patient because this will minimize negative side effects of a drug treatment. Since specific drugs are more potent and potentially safer than cytotoxic ones, we propose a rational methodology based on protein network entropy to choose the best cocktail of specific drugs according to the protein profile of malignant cells for a given tumor. Our approach differs from the traditional drug repurposing since it allows the application of personalized therapies that should affect essential breast cancer pathways resulting in malignant cell death with minimal side effects for normal cells. In addition, the strategy outlined here should be easy to extend to the personalized therapy of other cancer types.
This research was supported by a fellowship from CAPES-Fiocruz (cooperation term 001/2012 CAPES-Fiocruz) to T. M. Tilli, the National Institute for Science and Technology on Innovation on Neglected Diseases (INCT/IDN, CNPq, 573642/2008-7), the Canadian Breast Cancer Foundation, the Allard Foundation and the Alberta Cancer Foundation.
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
- Onitilo AA, Engel JM, Greenlee RT, Mukesh BN. Breast cancer subtypes based on ER/PR and Her2 expression. Comparison of clinicopathologic features and survival. Clin Med Res. 2009;7:4–13.View ArticleGoogle Scholar
- Hudis CA, Gianni L. Triple-negative breast cancer: an unmet medical need. Oncologist. 2011;16:1–11.View ArticleGoogle Scholar
- Neves RM, Chin K, Fridlyand J, Yeh J, Baehner FL, et al. A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes. Cancer Cell. 2006;10:515–27.View ArticleGoogle Scholar
- Heiser LM, Sadanandam A, Kuo W-L, Benz SC, Goldstein TC, et al. Subtype and pathway specific responses to anticancer compounds in breast cancer. Proc Natl Sci Acad USA. 2012;109:2724–9.ADSView ArticleGoogle Scholar
- Shannon CE. A mathematical theory of communication. Bell Syst Tech J. 1948;27(3):379–423. doi:10.1002/j.1538-7305.1948.tb01338.x.MathSciNetView ArticleMATHGoogle Scholar
- Breitkreutz D, Hlatky L, Rietman EA, Tuszynski JA. Molecular signaling network complexity is correlated with cancer patient survivability. Proc Natl Acad Sci U S A. 2012;109:9209–12.ADSView ArticleGoogle Scholar
- Carels N, Tilli T, Tuszynski JA. A computational strategy to select optimized protein targets for drug development toward the control of cancer diseases. PLoS One. 2015;10:e0115054.View ArticleGoogle Scholar
- Breitkreutz D, Rietman EA, Hinow P, Healey L, Tuszynski JA. Complexity of molecular signaling networks for various types of cancer and neurological diseases correlates with patient survivability. In: BIOMAT 2013. Singapore: World Scientific; 2014. p. 250–62.Google Scholar
- Shepelev V, Fedorov A. Advances in the exon-intron database. Brief Bioinform. 2006;7:178–85.View ArticleGoogle Scholar
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5:621–8.View ArticleGoogle Scholar
- Bolstad BM, Irizarry RA, Astrand M, Speed TP. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003;19:185–93.View ArticleGoogle Scholar
- Cheang MCU, Voduc D, Bajdik C, Leung S, McKinney S, et al. Basal-like breast cancer defined by five biomarkers has superior prognostic value than triple-negative phenotype. Clin Cancer Res. 2008;14:1368–76.View ArticleGoogle Scholar
- Carels N, Frias D. A statistical method without training step for the classification of coding frame in transcriptome sequences. Bioinformatics Biol Insights. 2013;7:35–54.View ArticleGoogle Scholar
- Kim M-S, Pinto SM, Getnet D, Nirujogi RS, Manda SS, et al. A draft map of the human proteome. Nature. 2014. doi:10.1038/nature13302.Google Scholar
- Arteaga CL, Engelman JA. ERBB receptors: from oncogene discovery to basic science to mechanism-based cancer therapeutics. Cancer Cell. 2014;25:282–303.View ArticleGoogle Scholar
- Bury M, Andolfi A, Rogister B, Cimmino A, Mégalizzi V, et al. Fusicoccin a, a phytotoxic carbotricyclic diterpene glucoside of fungal origin, reduces proliferation and invasion of glioblastoma cells by targeting multiple tyrosine kinases. Transl Oncol. 2013;6:112–23.View ArticleGoogle Scholar
- Cao W, Yang X, Zhou J, Teng Z, Cao L, et al. Targeting 14-3-3 protein, difopein induces apoptosis of human glioma cells and suppresses tumor growth in mice. Apoptosis. 2010;15:230–41.View ArticleGoogle Scholar
- Dong S, Kang S, Lonial S, Khoury HJ, Viallet J, Chen J. Targeting 14-3-3 sensitizes native and mutant BCR-ABL to inhibition with U0126, rapamycin and Bcl-2 inhibitor GX15-070. Leukemia. 2008;22:572–7.View ArticleGoogle Scholar
- Gay B, Suarez S, Caravatti G, Furet P, Meyer T, Schoepfer J. Selective GRB2 SH2 inhibitors as anti-Ras therapy. Int J Cancer. 1999;83:235–41.View ArticleGoogle Scholar
- Giubellino A, Gao Y, Lee S, Lee MJ, Vasselli JR, et al. Inhibition of tumor metastasis by a growth factor receptor bound protein 2 Src homology 2 domain-binding antagonist. Cancer Res. 2007;67:6012–6.View ArticleGoogle Scholar
- Qi W, Shakalya K, Stejskal A, Goldman A, Beeck S, et al. NSC348884, a nucleophosmin inhibitor disrupts oligomer formation and induces apoptosis in human cancer cells. Oncogene. 2008;27:4210–20.View ArticleGoogle Scholar
- Ren H, Chu Z, Mao L. Antibodies targeting hepatoma-derived growth factor as a novel strategy in treating lung cancer. Mol Cancer Ther. 2009;8:1106–12.View ArticleGoogle Scholar
- Schneble EJ, Berry JS, Trappey FA, Clifton GT, Ponniah S, et al. The HER2 peptide nelipepimut-S (E75) vaccine (NeuVax™) in breast cancer patients at risk for recurrence: correlation of immunologic data with clinical response. Immunotherapy. 2014;6:519–31.View ArticleGoogle Scholar
- Flor HH. Current status of the gene-for-gene concept. Annu Rev Phytopathol. 1971;9:275–96.View ArticleGoogle Scholar
- Patocchi A, Walser M, Tartarini S, Broggini GAL, Gennari F, Sansavini S, et al. Identification by genome scanning approach (GSA) of a microsatellite tightly associated with the apple scab resistance gene Vm. Genome. 2005;48:630–6.View ArticleGoogle Scholar