|
|
ORIGINAL ARTICLE |
|
Year : 2015 | Volume
: 11
| Issue : 4 | Page : 846-851 |
|
Protein-protein interaction networks and modules analysis for colorectal cancer and serrated adenocarcinoma
Hualong Yu1, Lan Ye2, Jianxin Wang1, Lei Jin1, Yanfeng Lv1, Miao Yu1
1 Department of Anorectal Surgery, The Second Hospital of Shandong University, Jinan, Shandong Province 250033, China 2 Department of Oncology, The Second Hospital of Shandong University, Jinan, Shandong Province 250033, China
Date of Web Publication | 15-Feb-2016 |
Correspondence Address: Jianxin Wang Department of Anorectal Surgery, The Second Hospital of Shandong University, Jinan, Shandong Province 250033 China
 Source of Support: None, Conflict of Interest: None  | Check |
DOI: 10.4103/0973-1482.140805
Purpose: To screen key modules and explore the potential mechanism of conventional colorectal cancer (CRC) and colorectal serrated adenocarcinoma (SAC). Materials and Methods: The microarray data of GSE36758 and GSE8671 were downloaded from Gene Expression Omnibus database. The differentially expressed genes (DEGs) in SAC versus colon carcinoma (CC) and CC versus normal control (NC) group were analyzed and the protein-protein interaction (PPI) networks for DEGs were constructed. The modules of PPI networks were further analyzed and the function enrichment analysis of all enrolled DEGs was carried out based on ToppGene database. Results: Total eight DEGs (SAC vs. CC) and 445 DEGs (CC vs. NC) were extracted based on the gene expression profile of GSE36758 and GSE8671, respectively. Total three PPI networks were constructed with DEGs in CC versus NC, SAC versus CC group, and DEGs in both two groups. Three modules were extracted from the PPI network of CC versus NC. Meanwhile, three modules were extracted from the network of DEGs in both two groups. Function enrichment analysis showed that DEGs involved in these modules were mainly associated with cellular activities. Conclusion: DEGs in modules of SAC and CRC were mainly involved in cellular activities pathways. The PPI networks and modules might contribute to the further study of pathogenesis for CRC and SAC based on the molecular level. Keywords: Colorectal cancer, modules, pathways, protein-protein network analysis, serrated adenocarcinoma
How to cite this article: Yu H, Ye L, Wang J, Jin L, Lv Y, Yu M. Protein-protein interaction networks and modules analysis for colorectal cancer and serrated adenocarcinoma. J Can Res Ther 2015;11:846-51 |
How to cite this URL: Yu H, Ye L, Wang J, Jin L, Lv Y, Yu M. Protein-protein interaction networks and modules analysis for colorectal cancer and serrated adenocarcinoma. J Can Res Ther [serial online] 2015 [cited 2021 Jan 23];11:846-51. Available from: https://www.cancerjournal.net/text.asp?2015/11/4/846/140805 |
> Introduction | |  |
Colorectal cancer (CRC) ranks the third disease for the high incidence and mortality in most western countries. [1] Serrated adenocarcinoma (SAC) has been recognized as a new subtype of CRC according to the WHO classification of tumors in the digestive system. [2] SAC accounts for about 7.5% of all CRCs and 17.5% of the most proximal CRC. [3] Different from CRC, there are different histological diagnosis criteria for SAC. [4],[5] Furthermore, patients with SAC tend to have a worse prognosis than patients with conventional colon carcinoma (CC). [3] Compared with CC, the higher frequency of adverse histological features [6] and the different histochemical expression pattern may contribute to the poorer prognosis of SAC. [7] Although there is a rapid decline in the incidence of CRC for medical technology improvement, the potential mechanism of CRC has not been clarified clearly, especially in the molecular level. [8]
Recent studies show that genetic alternation is an event during the colorectal tumor development. [9],[10] It is reported that the frequent mutation of KRAS (Kirsten rat sarcoma viral oncogene homolog) in addition to BRAF (v-Raf murine sarcoma viral oncogene homolog B1) contributes to the process of SAC. [11] The chromosome 5q21 genes in familial adenomatous polyposis is also proved to be important for the colorectal-tumor development. [12] Although gene expression profiles contribute to the molecular diagnosis of colorectal tumors, [13],[14] a single profile analysis of SAC or CC can hardly explain the whole potential interactions of mechanism underlying SAC and CC development. Thus, a systematical analysis of potential interactions for SAC and CC using bioinformatics study based on the related gene expression profile is needed.
In this study, the differentially expressed genes (DEGs) were selected based on the gene expression profiles of SAC and CC via the construction and analysis of interaction networks, we hoped to dig out key modules and the potential diseases-related genes for the further researches.
> Materials and methods | |  |
Differentially expressed genes and microarray data
The microarray data of GSE36758 [15] and GSE8671 [16] were downloaded from Gene Expression Omnibus) database (http://www.ncbi.nlm.nih.gov/geo/). The gene expression profiles data of GSE36758 was developed by 11 SAC tumor tissues and 15 CC tumor tissues and the GSE8671 data was derived from 32 CC tumor tissues and 32 corresponding normal colon tissues normal control (NC). The raw data and the probe annotation files were downloaded based on the platform of GPL4133 (Agilent-014850 Whole Human Genome Microarray 4 x 44K G4112F) and GPL570 (Affymetrix Human Genome U133 Plus 2.0 Array).
The probe files were translated into gene level data and the datasets were normalized by affy package in R. [17] The DEGs in SAC versus CC and CC versus NC were analyzed by limma package in R [18] and were tested by the Beniamini-Hochberg method. [19] P < 0.05 and |log FC | >1 were considered to be the cut-off value.
Protein-protein interaction networks construction
All the protein-protein interaction (PPI) pairs of the human whole genome were downloaded from Human Protein Reference Database (HPRD) (http://hprd.org/). The protein interaction related data were processed by getting rid of the repeated interactions and the interactions between single genes. The DEGs related protein pairs were selected and the PPI networks for DEGs in SAC versus CC, CC versus NC group and both SAC versus CC and CC versus NC group were constructed by MATLAB software (MathWorks, Natick, MA). [20]
Extraction and evaluation of modules in protein-protein interaction networks
In order to obtain the modules that have close relationships, we performed modules analysis for the three PPI networks. The modules were clustered by Molecular Complex Detection plugin [21] in Cytoscape software (Institute for Systems Biology, Seattle, Washington, USA). [22] The modules were evaluated by comparing the function similarity of the gene set in one module with the application of Gene ontology (GO). [23] The detailed calculation method was performed according to Ruths et al. [24] The accuracy of modules was proportional to the modules' ranking. We selected the module score larger than two as the threshold value.
Function enrichment analysis
ToppGene [25] (http://toppgene.cchmc.org/) can be used to carry out the gene function enrichment analysis or optimized related genes. In this study, we performed function enrichment analysis for DEGs identified in this paper based on the ToppGene database (Cincinnati Children's Hospital Medical Center (BMI CCHMC), Cincinnati, OH,USA). The enriched biological processes (BPs) were mainly analyzed. P <0.05 was considered to be significant.
> Results | |  |
Identification of differentially expressed genes
Total eight DEGs in SAC versus CC group were obtained based on gene expression profile of GSE36758 and 445 DEGs in CC versus NC group were obtained based on the gene expression profile of GSE8671. All these DEGs were enrolled in the present study.
Protein-protein interaction networks analysis
There were total 9240 protein interaction pairs for human whole genome based on HPRD database. After preprocessed, 37,080 protein pairs were obtained, among which we selected the DEGs related protein pairs to construct the PPI networks. The PPI network for CC versus NC group was constructed with 362 nodes and 512 edges [Figure 1]. There are 12 nodes and 12 edges in the PPI network of SAC versus CC group [Figure 2] and 371 nodes and 551 edges in the PPI network of all the DEGs in both two groups [Figure 3]. Compared with other two PPI networks, the PPI network of SAC versus CC had the fewest genes involved (such as CTNNB1 and SRC). | Figure 1: Protein-protein interaction network of colorectal cancer versus control: Squares represent the differentially expressed genes; lines represent the interactions among genes
Click here to view |
 | Figure 2: Protein-protein interaction network of serrated adenocarcinoma versus colorectal cancer: Squares represent the differentially expressed genes; lines represent the interactions among genes
Click here to view |
 | Figure 3: The co-protein-protein interaction network of differentially expressed genes (DEGs) in two gene expression profiles: Squares represent the DEGs; lines represent the interactions among genes
Click here to view |
Modules analysis of protein-protein interaction networks
Molecular Complex Detection in Cytoscape software was used for the modules analysis. When the threshold of module score was larger than two, three modules was extracted from the PPI network of CC versus NC group [Figure 4] and the detailed information for the three modules were listed in [Table 1]. Meanwhile, three modules were extracted from the PPI network of all DEGs identified in this paper [Figure 5] and the detail information for the three modules were listed in [Table 2]. No module was extracted from the network of SAC versus CC group due to the unsuitable module score. The similarity (Funsim value) showed that compared to the modules containing fewer number of genes, the similarity of the modules that contain a large number of genes was low [Table 1] and [Table 2]. | Figure 4: Three modules extracted from the protein-protein interaction network of colorectal cancer versus control: Squares represent the differentially expressed genes; lines represent the interactions among genes
Click here to view |
 | Figure 5: Three modules extracted from the co-protein-protein interaction network of differentially expressed genes (DEGs) in two gene expression profiles: Squares represent the DEGs; lines represent the interactions among genes
Click here to view |
 | Table 1: The results of modules extracted from PPI network of colorectal cancer versus normal control
Click here to view |
 | Table 2: The result of modules extracted from co-PPI network of two gene profiles
Click here to view |
Function enrichment analysis of differentially expressed genes
ToppGene database was used for the function enrichment analysis to obtain the main function of all enrolled DEGs. Since BP in GO can better reflect the function of genes, we select BP as the main target in the function enrichment analysis of DEGs in this study. The results of GO enrichment analysis showed that the significantly enriched GO terms of DEGs were mainly related with cellular activities. Top 10 of GO terms enriched by DEGs were listed in [Table 3] according to the P value.
> Discussion | |  |
In the present study, we performed a PPI network and key modules analysis based on the gene expression profiles of SAC and CC. The results showed that total three PPI networks were constructed for DEGs in CC versus NC, SAC versus CC group, and DEGs in both two groups. Three modules were extracted from the PPI network of CC versus NC group. Meanwhile, three modules were extracted from the co-PPI network of two gene profiles. No module was extracted from the PPI network of SAC versus CC group. Function enrichment analysis indicated that DEGs involved in these modules were mainly associated with the BP of cellular activities.
Although CRC are proved to undergo the pathological, [26] extensive clinical [27],[28] and molecular analysis, [29],[30] little is currently known about gene expression changes accompanying their formation. Previous study indicates that the genetic inactivation of AKT1, AKT2, and PDPK1 are vital to the process of human CRC. [31] Gene expression is also proved to improve prognosis prediction of Stage II and III CRC. [32] Sabates-Bellver et al. [16] indicates that KIAA1199 is a novel target of the Wnt signaling pathway and a putative marker of colorectal adenomatous transformation. As a distinct variant of CRC, SAC accounts for about 7.5% of all CRCs. [15] Previous studies prove that the mutation of some genes and pathways are associated with the process of SAC. [11],[33],[34] Some outstanding results of CC and SAC based on the gene level are reported, but a systematical bioinformatics analysis of two diseases based on the gene expression was lack. In this study, the PPI networks analysis revealed three important networks among CC, SAC and healthy controls. Although there were few genes involved in the PPI network of CC versus SAC, some potential key gene including CTNNB1 might play an important role in the relationship between CC and SAC. The CTNNB1 (β-catenin gene) and CTNNB1 related pathway are proved to be important in many tumor or cancer in vivo, such as endometrioid ovarian carcinomas, [35] prostate cancers [36] and desmoid tumors. [37] Morikawa et al. indicate that the alteration of CTNNB1 is associated with CRC. [38] Our result was accordance with the previous studies that CTNNB1 was associated with the process of CC. However, the result in this study indicated that there was still a relation between CTNNB1 and SAC. Thus, we speculated that the genes including CTNNB1 in the PPI network of CC versus SAC might have a potential relationship with the disease-related gene expression via the process of CC and SAC. However, a further investigation with a large sample size based on gene expression profile analysis was needed to prove this speculation.
The high gene functional similarity (Funsim value) in the modules extracted from PPI networks showed the high accurateness of these modules in the present study. Compared with the modules that involved with a few genes, the modules that involved with large number of genes tended to have a low gene functional similarity; this phenomenon might be explained by the close relations among genes within module. Function enrichment analysis showed the DEGs in modules were mainly enriched in the cell activities pathways, such as cell migration. Cell migration is a highly integrated multistep process that contributes to tissue repair and regeneration, and drives disease progression in cancer. [39] Previous study indicates that some influential factors such as CXCR4 and CXCL12 are inversely expressed in CRC cells and modulate cancer cell migration. [40] Cell migration contributes to the morphogenesis and cancer development. [41] Thus, the results of enrichment analysis indicated that the modules obtained in the present study were closely related to the process of SAC and CC. However, due to the limitations (such as sample size) in the present study, a further investigation is needed.
> Conclusion | |  |
Genes in the modules clustered in our paper showed close association with CC and SAC development. CTNNB1 may play a key role in both CC and SAC. The module analysis in our paper provided a new perspective to explore the mechanism of CC and SAC.
> References | |  |
1. | Parker SL, Tong T, Bolden S, Wingo PA. Cancer statistics, 1997. CA Cancer J Clin 1997;47:5-27. |
2. | McSherry CK, Cornell GN, Glenn F. Carcinoma of the colon and rectum. CA Cancer J Clin 1969;19:248-50. |
3. | García-Solano J, Pérez-Guillermo M, Conesa-Zamora P, Acosta-Ortega J, Trujillo-Santos J, Cerezuela-Fuentes P, et al. Clinicopathologic study of 85 colorectal serrated adenocarcinomas: Further insights into the full recognition of a new subset of colorectal carcinoma. Hum Pathol 2010;41:1359-68. |
4. | Mäkinen MJ, George SM, Jernvall P, Mäkelä J, Vihko P, Karttunen TJ. Colorectal carcinoma associated with serrated adenoma - Prevalence, histological features, and prognosis. J Pathol 2001;193:286-94. |
5. | Tuppurainen K, Mäkinen JM, Junttila O, Liakka A, Kyllönen AP, Tuominen H, et al. Morphology and microsatellite instability in sporadic serrated and non-serrated colorectal cancer. J Pathol 2005;207:285-94. |
6. | García-Solano J, Conesa-Zamora P, Trujillo-Santos J, Mäkinen MJ, Pérez-Guillermo M. Tumour budding and other prognostic pathological features at invasive margins in serrated colorectal adenocarcinoma: A comparative study with conventional carcinoma. Histopathology 2011;59:1046-56. |
7. | García-Solano J, Conesa-Zamora P, Trujillo-Santos J, Torres-Moreno D, Mäkinen MJ, Pérez-Guillermo M. Immunohistochemical expression profile of ß-catenin, E-cadherin, P-cadherin, laminin-5?2 chain, and SMAD4 in colorectal serrated adenocarcinoma. Hum Pathol 2012;43:1094-102. |
8. | Mäkinen MJ. Colorectal serrated adenocarcinoma. Histopathology 2007;50:131-50. |
9. | Vogelstein B, Fearon ER, Hamilton SR, Kern SE, Preisinger AC, Leppert M, et al. Genetic alterations during colorectal-tumor development. N Engl J Med 1988;319:525-32. |
10. | Lengauer C, Kinzler KW, Vogelstein B. Genetic instability in colorectal cancers. Nature 1997;386:623-7. |
11. | Stefanius K, Ylitalo L, Tuomisto A, Kuivila R, Kantola T, Sirniö P, et al. Frequent mutations of KRAS in addition to BRAF in colorectal serrated adenocarcinoma. Histopathology 2011;58:679-92. |
12. | Nishisho I, Nakamura Y, Miyoshi Y, Miki Y, Ando H, Horii A, et al. Mutations of chromosome 5q21 genes in FAP and colorectal cancer patients. Science 1991;253:665-9. |
13. | Lin YM, Furukawa Y, Tsunoda T, Yue CT, Yang KC, Nakamura Y. Molecular diagnosis of colorectal tumors by expression profiles of 50 genes expressed differentially in adenomas and carcinomas. Oncogene 2002;21:4120-8. |
14. | Lanza G, Ferracin M, Gafà R, Veronese A, Spizzo R, Pichiorri F, et al. mRNA/microRNA gene expression profile in microsatellite unstable colorectal cancer. Mol Cancer 2007;6:54. |
15. | Conesa-Zamora P, García-Solano J, García-García F, Turpin Mdel C, Trujillo-Santos J, Torres-Moreno D, et al. Expression profiling shows differential molecular pathways and provides potential new diagnostic biomarkers for colorectal serrated adenocarcinoma. Int J Cancer 2013;132:297-307. |
16. | Sabates-Bellver J, Van der Flier LG, de Palo M, Cattaneo E, Maake C, Rehrauer H, et al. Transcriptome profile of human colorectal adenomas. Mol Cancer Res 2007;5:1263-75. |
17. | Gautier L, Cope L, Bolstad BM, Irizarry RA. affy - Analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 2004;20:307-15. |
18. | Diboun I, Wernisch L, Orengo CA, Koltzenburg M. Microarray analysis after RNA amplification can detect pronounced differences in gene expression using limma. BMC Genomics 2006;7:252. |
19. | Yekutieli D, Benjamini Y. Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics. J Stat Plan Inference 1999;82:171-96. |
20. | Demuth H, Beale M. Neural Network Toolbox for Use with MATLAB. copyright: 1992-1997 by The Math works, Inc. 1993. |
21. | Bandettini WP, Kellman P, Mancini C, Booker OJ, Vasu S, Leung SW, et al. MultiContrast Delayed Enhancement (MCODE) improves detection of subendocardial myocardial infarction by late gadolinium enhancement cardiovascular magnetic resonance: A clinical validation study. J Cardiovasc Magn Reson 2012;14:83. |
22. | Kohl M, Wiese S, Warscheid B. Cytoscape: Software for visualization and analysis of biological networks. Data Mining in Proteomics. Humana Press: Springer; 2011. p. 291-303. |
23. | Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000;25:25-9. |
24. | Ruths T, Ruths D, Nakhleh L. GS2: An efficiently computable measure of GO-based similarity of gene sets. Bioinformatics 2009;25:1178-84. |
25. | Chen J, Bardes EE, Aronow BJ, Jegga AG. ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res 2009;37:W305-11. |
26. | Jass JR, Smyrk TC, Stewart SM, Lane MR, Lanspa SJ, Lynch HT. Pathology of hereditary non-polyposis colorectal cancer. Anticancer Res 1994;14:1631-4. |
27. | Winawer S, Fletcher R, Rex D, Bond J, Burt R, Ferrucci J, et al. Colorectal cancer screening and surveillance: Clinical guidelines and rationale-Update based on new evidence. Gastroenterology 2003;124:544-60.  [ PUBMED] |
28. | Winawer SJ, Fletcher RH, Miller L, Godlee F, Stolar MH, Mulrow CD, et al. Colorectal cancer screening: Clinical guidelines and rationale. Gastroenterology 1997;112:594-642. |
29. | Pritchard CC, Grady WM. Colorectal cancer molecular biology moves into clinical practice. Gut 2011;60:116-29. |
30. | Nosho K, Baba Y, Tanaka N, Shima K, Hayashi M, Meyerhardt JA, et al. Tumour-infiltrating T-cell subsets, molecular changes in colorectal cancer, and prognosis: Cohort study and literature review. J Pathol 2010;222:350-66. |
31. | Ericson K, Gan C, Cheong I, Rago C, Samuels Y, Velculescu VE, et al. Genetic inactivation of AKT1, AKT2, and PDPK1 in human colorectal cancer cells clarifies their roles in tumor growth regulation. Proc Natl Acad Sci U S A 2010;107:2598-603. |
32. | Salazar R, Roepman P, Capella G, Moreno V, Simon I, Dreezen C, et al. Gene expression signature to improve prognosis prediction of Stage II and III colorectal cancer. J Clin Oncol 2011;29:17-24. |
33. | Hiyama T, Yokozaki H, Shimamoto F, Haruma K, Yasui W, Kajiyama G, et al. Frequent p53 gene mutations in serrated adenomas of the colorectum. J Pathol 1998;186:131-9. |
34. | Chan TL, Zhao W, Leung SY, Yuen ST, Cancer Genome Project. BRAF and KRAS mutations in colorectal hyperplastic polyps and serrated adenomas. Cancer Res 2003;63:4878-81. |
35. | Palacios J, Gamallo C. Mutations in the beta-catenin gene (CTNNB1) in endometrioid ovarian carcinomas. Cancer Res 1998;58:1344-7. |
36. | Gerstein AV, Almeida TA, Zhao G, Chess E, Shih IeM, Buhler K, et al. APC/CTNNB1 (beta-catenin) pathway alterations in human prostate cancers. Genes Chromosomes Cancer 2002;34:9-16. |
37. | Le Guellec S, Soubeyran I, Rochaix P, Filleron T, Neuville A, Hostein I, et al. CTNNB1 mutation analysis is a useful tool for the diagnosis of desmoid tumors: A study of 260 desmoid tumors and 191 potential morphologic mimics. Mod Pathol 2012;25:1551-8. |
38. | Morikawa T, Kuchiba A, Yamauchi M, Meyerhardt JA, Shima K, Nosho K, et al. Association of CTNNB1 (beta-catenin) alterations, body mass index, and physical activity with survival in patients with colorectal cancer. JAMA 2011;305:1685-94. |
39. | Ridley AJ, Schwartz MA, Burridge K, Firtel RA, Ginsberg MH, Borisy G, et al. Cell migration: Integrating signals from front to back. Science 2003;302:1704-9. |
40. | Brand S, Dambacher J, Beigel F, Olszak T, Diebold J, Otte JM, et al. CXCR4 and CXCL12 are inversely expressed in colorectal cancer cells and modulate cancer cell migration, invasion and MMP-9 activation. Exp Cell Res 2005;310:117-30. |
41. | Friedl P, Hegerfeldt Y, Tusch M. Collective cell migration in morphogenesis and cancer. Int J Dev Biol 2004;48:441-9. |
[Figure 1], [Figure 2], [Figure 3], [Figure 4], [Figure 5]
[Table 1], [Table 2], [Table 3]
|