Identification of Biomarkers Associated With Pathological Stage and Prognosis of Clear Cell Renal Cell Carcinoma by Co-expression Network Analysis

Chen, Liang; Yuan, Lushun; Qian, Kaiyu; Qian, Guofeng; Zhu, Yuan; Wu, Chin-Lee; Dan, Han C.; Xiao, Yu; Wang, Xinghuan

doi:10.3389/fphys.2018.00399

ORIGINAL RESEARCH article

Front. Physiol., 18 April 2018

Sec. Integrative Physiology

Volume 9 - 2018 | https://doi.org/10.3389/fphys.2018.00399

Identification of Biomarkers Associated With Pathological Stage and Prognosis of Clear Cell Renal Cell Carcinoma by Co-expression Network Analysis

$\r\nLiang Chen$ Liang Chen¹

Lushun Yuan¹

Kaiyu Qian^2,3

Guofeng Qian⁴

Yuan Zhu^2,3

Chin-Lee Wu⁵

Han C. Dan⁶

Yu Xiao^1,2,3^*

Xinghuan Wang¹^*

¹Department of Urology, Zhongnan Hospital of Wuhan University, Wuhan, China
²Department of Biological Repositories, Zhongnan Hospital of Wuhan University, Wuhan, China
³Laboratory of Precision Medicine, Zhongnan Hospital of Wuhan University, Wuhan, China
⁴Department of Endocrinology, The First Affiliated Hospital of Zhejiang University, Hangzhou, China
⁵Department of Urology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, United States
⁶Greenebaum Cancer Center, School of Medicine, University of Maryland, Baltimore, MD, United States

Clear cell renal cell carcinoma (ccRCC) is the most common subtype among renal cancer whose prognostic is affected by the tumor progression associated with complex gene interactions. However, there is currently no available molecular markers associated with ccRCC progression and used or clinical application. In our study, microarray data of 101 ccRCC samples and 95 normal kidney samples were analyzed and 2,425 differentially expressed genes (DEGs) were screened. Weighted gene co-expression network analysis (WGCNA) was then conducted and 11 co-expressed gene modules were identified. Module preservation analysis revealed that two modules (red and black) were found to be most stable. In addition, Pearson's correlation analysis identified the module most relevant to pathological stage(patho-module) (r = 0.44, p = 3e-07). Functional enrichment analysis showed that biological processes of the patho-module focused on cell cycle and cell division related biological process and pathway. In addition, 29 network hub genes highly related to ccRCC progression were identified from the stage module. These 29 hub genes were subsequently validated using 2 other independent datasets including GSE53757 (n = 72) and TCGA (n = 530), and the results indicated that all hub genes were significantly positive correlated with the 4 stages of ccRCC progression. Kaplan-Meier survival curve showed that patients with higher expression of each hub gene had significantly lower overall survival rate and disease-free survival rate, indicating that all hub genes could act as prognosis and recurrence/progression biomarkers of ccRCC. In summary, we identified 29 molecular markers correlated with different pathological stages of ccRCC. They may have important clinical implications for improving risk stratification, therapeutic decision and prognosis prediction in ccRCC patients.

Introduction

Renal carcinoma is a common malignancy of the urinary system and accounts for 2–3% of adult malignancies (Siegel et al., 2017). Renal cell carcinoma (RCC) accounts for about 90% of renal cancers, the vast majority (70–85%) of which are clear cell subtypes (Cairns, 2010). RCC can be divided into 4 pathological stages according to tumor size, the extend of invasion and metastasis (Motzer et al., 2015). Surgery treatment is effective for localized RCC, however, once the RCC becomes metastatic the survival rate of patients will drop sharply. In addition, about 30% of patients with RCC had metastasis when diagnosed. The standard treatment for localized RCC is surgery including radical or partial nephrectomy due to its insensitive to radiotherapy and chemotherapy (Motzer et al., 1996). For metastatic RCC, immunotherapy with interleukin-2 (IL-2) and interferon (IFN) was once the standard treatment, but it had limited curative effects and strong side effects (Negrier et al., 1998). Targeted therapies including sorafenib (Hutson et al., 2010) and sunitinib (Motzer et al., 2006) were approved for metastatic RCC in 2005 and 2006 respectively, with better effects and fewer side effects compared with immunotherapy. However, targeted therapies were still limited and prone to drug resistance (Coppin et al., 2011; Ljungberg et al., 2015). Therefore, more effective diagnosis biomarkers and therapeutic targets are in urgent need.

Gene expression profiles such as microarray and RNA-sequencing have been widely used to identify biomarkers associated with clear cell renal cell carcinoma (ccRCC) progression (Dahinden et al., 2010; Gerlinger et al., 2014). However, most of the published studies focused on the screening of differently expressed genes (DEGs), ignoring the high correlations between genes, although genes with similar expression patterns might be functionally related (Tavazoie et al., 1999). Weighted gene co-expression network analysis (WGCNA) was applied to explore the correlations between gene clusters and clinical features (Langfelder and Horvath, 2008). Recently, many studies related to WGCNA regarding biological information and systems biology have been published in well-known journals (Kunowska et al., 2015; Luo et al., 2015). The WGCNA algorithm had been applied to screen for biological processes and treatment targets of cancer as well as specific biomarkers related to complex disease, such as familial combination of hyperlipidemia (Plaisier et al., 2009), Alzheimer's disease (Miller et al., 2010) and osteoporosis (Farber, 2010). Similarly, WGCNA was also used to identify key genes significantly correlated with clinical indicators of tumor progression including tumor stages, grades and metastasis for different tumor types (Chen P. et al., 2017; He et al., 2017).

Thus, our study aims to identify network-centric genes associated with ccRCC progression by constructing a co-expression network using DEGs through weighted gene co-expression network WGCNA (Clarke et al., 2013; Chou et al., 2014). To our knowledge, it is the first attempt to use WGCNA to identify a series of hub genes as biomarkers significantly associated with pathological stages and prognosis of ccRCC and to distinguish localized and non-localized ccRCC.

Materials and Methods

Data Collection

Raw gene expression profile and clinical data were obtained from Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/). Datasets GSE53757 Von Roemeling et al., 2014 and GSE36895 (Peña-Llopis et al., 2012) performed on the same platform Affymetrix Human Genome U133 Plus 2.0 Array (HG U133 Plus 2.0) were combined and analyzed to screen DEGs. Dataset GSE73731 (Wei et al., 2017) was also performed on the same platform included 265 ccRCC samples, of which 125 samples with complete clinical data were used to identify hub genes with WGCNA. Two other independent datasets GSE40355 based on the platform of Whole Human Genome Microarray 4 × 44 K v2 (Agilent-026652) and GSE36895 were used to conduct module preservation analysis. In addition, RNA sequencing data of 530 ccRCC samples was downloaded from The Cancer Genome Atlas (TCGA) database (https://genome-cancer.ucsc.edu/) for further validation. Gene expression data was based on Illumina Hiseq's RNA sequencing technology. The detailed information of the datasets used in this study was summarized in Supplementary Table S1.

Data Preprocessing and Differentially Expressed Genes (DEGs) Screening

Raw expression data was calculated following the pre-processing procedures: RMA background correction, log2 transformation, quantile normalization and median polish algorithm summarization using the “affy” (Gautier et al., 2004) package of R software (version 3.3.1). Besides, “sva” (Leek and Storey, 2007) R package was used to remove batch effects between dataset GSE53757 and GSE36895. Probes were annotated by the Affymetrix annotation files. R package “limma” (Ritchie et al., 2015) was applied to select the DEGs between 101 ccRCC samples and 95 normal kidney samples. The cut-off criteria for screening DEGs was the false discovery rate (FDR) <0.01 and |log2(fold change)| ≥ 1.

Weighted Gene Co-Expression Network Construction

The “WGCNA” (Langfelder and Horvath, 2008) package in R was applied to performed co-expression network using the expression values of 2,425 DEGs from 125 tumor samples and complete clinical data (GSE73731). First, one outlier sample was excluded from subsequent analysis (Supplementary Figure S2). The detailed procedure for WGCNA construction could be found in our previous study (Chen L. et al., 2017). Briefly, we constructed the weighted adjacency matrix using a power function based on a soft-thresholding parameter β. After that, the adjacency was transformed into topological overlap matrix (TOM), and average linkage hierarchical clustering was performed according to the TOM-based dissimilarity measure. In this study, we chose a minimum size (gene group) of 30 for the genes dendrogram and a cut-line (0.25) for module dendrogram and merged some modules.

Module Preservation Analysis

To access the stability of the each module identified above, we conducted module preservation analysis using the modulePreservation (Langfelder et al., 2011) method (nPermutations = 200) in the “WGCNA” package. The two datasets GSE36895 and GSE40355 used for preservation analysis contained gene expression profiles of 29 and 16 ccRCC samples, respectively.

Identifying Clinically Significant Modules and Module Functional Annotation

WGCNA identifies gene modules based on their expression similarities in samples and calculates the correlation between the external clinical information and gene modules to identify clinically significant gene modules. The gene modules most correlated with clinical features were selected as modules of interest, combined with the correlative clinical feature. To further clarify the mechanism underlying the impact of module genes on correlative clinical feature, genes in interest module were uploaded to the Database for Annotation, Visualization, and Integrated Discovery (DAVID) (Dennis et al., 2003) for GO functional annotation and KEGG pathway enrichment analysis. FDR < 0.05 was used as cut-off criteria.

Hub Genes Identification and Validation

Hub genes were a series of genes with the highest degree of connectivity in a gene module and determined the characteristics of a module. Hub genes were defined by module connectivity, measured by absolute value of the Pearson's correlation (cor.geneModuleMembership > 0.8) and clinical trait relationship, and measured by absolute value of the Pearson's correlation (cor.geneTraitSignificance > 0.2). In this study, we identified hub genes in the module which significantly correlated with certain clinical feature. In addition, we used boxplot to show the relationship between the hub genes and the corresponding clinical features, and the statistical significance between them was analyzed by one-way ANOVA.

Another independent dataset GSE53757 was analyzed to validate the hub genes. In addition, 530 ccRCC samples from TCGA database were analyzed to compare expression of hub genes between different pathological stages of ccRCC.

Survival Analysis

In order to evaluate the impact of all hub genes on ccRCC patient's prognosis, overall and disease-free survival were analyzed. For overall survival analysis, 530 patients were divided into 2 groups according to median expression of each hub gene (high vs. low). Similarly, for disease free survival, 433 patients with complete recurrence/progression survival time were also analyzed. Then we adopted “survival” of R software for log-rank test and Kaplan-Meier survival analysis (Goel et al., 2010).

Results

Identification of Differentially Expressed Genes in ccRCC Tissue Samples

A workflow of this study was shown in Supplementary Figure S4. The expression matrices for 196 samples containing 101 ccRCC samples and 95 normal kidney samples in datasets GSE53757 and GSE36895 were obtained after data preprocessing. A total of 2,425 DEGs including 1,259 up-regulated and 1166 down-regulated were screened between ccRCC and normal kidney under the threshold of FDR < 0.01 and |log2FC| ≥ 1. These 2,425 DEGs were then selected for subsequent analysis. The heatmap for DEGs were shown in Supplementary Figure S1.

Weighted Co-expression Network Construction and Module Preservation Analysis

Co-expression network was constructed using independent dataset GSE73731 including 124 ccRCC samples associated with complete clinical data (Figure 1). The expression values of 2,425 DEGs were included for co-expression network constructing by adopting “WGCNA” package. In current study, to ensure a scale-free network, we selected β = 7 (scale free R² = 0.87) as the soft-thresholding power (Figures 2A–D), and identified a total of 11 modules (Figure 2E).

FIGURE 1

Figure 1. Clustering dendrogram of 124 tumor samples and the clinical traits. The clustering was based on the expression data of DEGs between tumor samples and non-tumor samples in ccRCC. The color intensity was proportional to older age as well as higher pathological stage and tumor grade.

FIGURE 2

Figure 2. Determination of soft-thresholding power in WGCNA. (A) Analysis of the scale-free fit index for various soft-thresholding powers (β). (B) Analysis of the mean connectivity for various soft-thresholding powers. (C) Histogram of connectivity distribution when β = 7. (D) Checking the scale free topology when β = 7. (E) Dendrogram of all differentially expressed genes clustered based on a dissimilarity measure (1-TOM).

To determine whether the identified network can also be found in another independent network, we performed module preservation analysis by comparing the GSE73731 dataset with 2 other test datasets GSE36895 and GSE40355. As shown in Figure 3, red and black modules were found to be most stable due to their Zsummary statistics (Figures 3B,D) above 10 and Median rank statistics (Figures 3A,C) close to minimum both in the two test datasets.

FIGURE 3

Figure 3. The medianRank and Zsummary statistics of the module preservation of the DEG modules using two independent dataset GSE36895 (A,B) and GSE40355 (C,D). The medianRank of the modules close to zero indicates a high degree of module preservation (A,C). The dashed blue and green lines indicate the thresholds Z = 2 and Z = 10, respectively (B,D). These horizontal lines indicate the Zsummary thresholds for strong evidence of conservation (above 10) and for low to moderate evidence of conservation (above 2).

Identification of Key Modules and Functional Annotation

There were great biological implications to identify modules most significantly related to clinical features. We found that red module showed the highest correlation with pathological stage (r = 0.44, p = 3e-7, Figure 4A). We defined the module most relevant to pathological stage(red) as patho-module.

FIGURE 4

Figure 4. Identification of modules associated with the clinical traits of ccRCC. (A) Heatmap of the correlation between module eigengenes and clinical traits of ccRCC. (B) Distribution of average gene significance and errors in the modules associated with pathological stage of ccRCC. (C) GO functional and KEGG pathway enrichment analyses for genes in the red module. The –log10 (P-value) of each term is colored according to the legend. (D) Scatter plot for correlation between gene module membership in the red module(patho-module) and gene significance.

Besides, in relation with pathological stage, the patho-module also exhibited the highest gene significance (Figure 4B). Therefore, we selected the patho-module as module of interest and analyzed subsequently. Interestingly, the patho-module also showed high correlation with tumor grade. The genes in each module were listed in Supplementary Table S2.

To figure out the functional involvement of the patho-module, 209 genes in patho-module were uploaded into DAVID database for Gene Ontology (GO) analysis and KEGG pathway enrichment analysis. Under the threshold of FDR < 0.05, biological processes of patho-module were suggested to focus on cell division (p = 1.24e-18), mitotic nuclear division (p = 3.61e-14), sister chromatid cohesion (p = 1.26E-09), mitotic cytokinesis (p = 2.07e-08), mitotic sister chromatid segregation (p = 2.30e-07), chromosome segregation (p = 7.22e-07), kinetochore assembly (p = 6.31e-06) and microtubule-based movement (p = 2.86e-05, Figure 4C). While KEGG pathway enrichment analysis suggested that 209 genes in patho-module was significantly enriched in two pathways including cell cycle (2.15e-07) and p53 signaling pathway (p = 1.36e-05, Figure 4C).

Identification of Hub Genes

Under the threshold of module connectivity (cor.geneModuleMembership) more than 0.8 and clinical trait relationship (cor.geneTraitSignificance) more than 0.2, 29 genes with the high connectivity in patho-module were selected as hub genes (Figure 4D). Hub genes were significantly correlated with pathological stage were listed in Table 1.

TABLE 1

Table 1. Hub genes in the modules related with pathological stages.

The relationship between all hub gene and pathological stages in the training dataset GSE73731 was shown in Figure 5. Expression values of all hub genes in different pathological stages were compared and statistical differences were calculated with one-way ANOVA, suggesting significant difference (p < 0.01) of each hub gene across different pathological stages. Interestingly, we also found significant difference (p < 0.01) of each hub gene across different tumor grades (Supplementary Figure S3).

FIGURE 5

Figure 5. Boxplots of hub genes across different pathological stages in the GSE73731. The boxplots show the medians and dispersions of the samples of different pathological stages for each hub gene. (A) Boxplots of hub genes from ANLN to FOXM1 (sort in alphabetical order) in different pathological stages. (B) Boxplots of hub genes from KIF11 to UHRF1 in different pathological stages. P-values are the results of one-way ANOVA for different pathological stages. ***p < 0.001.

Validation of Hub Genes

Then all hub genes were selected for validation using 2 other independent datasets including GSE53757 and TCGA dataset. In the test set GSE53757, significant difference was detected for each hub gene expression across 4 pathological stages using one-way ANOVA (Figure 6). Moreover, one-way ANOVA and independent sample t-tests based on RNA sequencing data showed that all hub genes were also effective to distinguish local ccRCC (pathological stage I or II) and non-localized ccRCC (pathological stage III or IV, Figure 7).

FIGURE 6

Figure 6. Boxplots of hub genes across different pathological stages in the validation dataset GSE53757. The boxplots show the medians and dispersions of the samples of different pathological stages for each hub gene. (A) Boxplots of hub genes from ANLN to FOXM1 (sort in alphabetical order) in different pathological stages. (B) Boxplots of hub genes from KIF11 to UHRF1 in different pathological stages. P-values are the results of one-way ANOVA for different pathological stages. *p < 0.05, **p < 0.01, ***p < 0.001.

FIGURE 7

Figure 7. Boxplots of hub genes across different pathological stages in the TCGA dataset. The boxplots show the medians and dispersions of the samples of different pathological stages for each hub gene. P-values are the results of independent sample t-test between pathological stage I/II and III/IV and one-way ANOVA for different pathological stages.

Survival Analysis

Base on TCGA RNA-sequencing data and clinical information, patients were divided into 2 groups according to median expression of each hub gene, and Kaplan-Meier survival curve was then plotted. Patients with higher each hub gene showed significantly shorter overall survival rate and disease-free survival rate, indicating that all hub genes could act as prognosis and recurrence biomarkers of ccRCC (Figures 8, 9).

FIGURE 8

Figure 8. Overall survival analyses on hub genes in the TCGA data set. Survival curves for patients in different groups. Red lines represent high expression of hub genes, while blue lines represent low expression of hub genes.

FIGURE 9

Figure 9. Diseases-free survival analyses on hub genes in the TCGA data set. Survival curves for patients in different groups. Red lines represent high expression of hub genes, while blue lines represent low expression of hub genes.

Discussion

ccRCC is the most common subtype among renal cancer whose prognostic is affected by the tumor progression associated with complex gene interactions. Exploring molecular markers of ccRCC is important for the diagnosis and treatment of patients with ccRCC. However, there is currently no available molecular markers associated with the pathological stage of ccRCC for clinical application. Here, clinical information and mRNA expression profiling of patients with ccRCC from microarray data were analyzed by using a systematic biological method WGCNA. We identified 29 molecular markers correlated with different pathological stages of ccRCC. They may have important clinical implications for improving risk stratification, therapeutic decision-making and prognosis prediction in patients with ccRCC.

WGCNA, an algorithm to mine gene module information from expression profiling microarray, was widely applied in the RNA-sequencing data (Langfelder and Horvath, 2008). In this method, module was defined as a group in which genes had similar expression variation trend in a physiological process or in different tissue samples. In another word, genes with similar expression variation trend across different samples could be defined as a gene module. The WGCNA clustering criteria had a great biological significance which was completely different from the clustering method based on the geometric distance between data. After the gene module was identified, the stability of module was calculated. Then the correlation between stable modules and clinical features such as age, gender, stage of disease, and tumor grade were calculated. By this method, the most clinically related gene modules which could be used to explore the primary cause of disease development. Scale-free network was characterized by the existence of a few nodes that have significantly higher connectivity than the general nodes. The “few” node genes were defined as hub genes. So, the studies for the correlation between interest module and certain clinical feature could be simplified as correlation between hub genes of interest module and clinical feature, and thus providing an important molecular basis for exploring the mechanism of disease development.

The aim of the study was to identify biomarkers associated with progression of ccRCC using WGCNA to mine expression profiling combined with clinical data of large numbers of patients with ccRCC. In the study of cancer, candidate molecular biomarkers must be well distinguished between cancerous and normal tissues. Based on DEGs screened from ccRCC, the weighted co-expression network was constructed and 11 co-expression modules were identified through dynamic tree cutting method. It was found that among the 11 modules the red one showed the highest positive correlation with tumor pathological stage by correlation analysis. In addition, the preservation statistics suggested that the stability of the patho-module is good. Therefore, the patho-module was considered as a clinically significant gene cluster. Hub genes with the highest connectivity in a gene module largely decided the characteristics of the module.

Functional annotation of patho-module was suggested to focus on cell division, cell cycle, mitotic nuclear division, sister chromatid cohesion, mitotic cytokinesis, etc. Cell cycle and cell division are the basic process of cell proliferation, the abnormal mediation of which will lead to the tumor progression. In current study, 29 hub genes of patho-module significantly correlated with pathological stage were identified and validated, which could distinguish localized (pathological stage I/II) and non-localized ccRCC (pathological stage III/IV). Interestingly, expression value of each hub gene across different tumor grades could also be found significant difference, suggesting that the 29 hub genes were positively correlated with tumor grade of ccRCC as well. These findings may contribute to the improvement of therapeutic decision-making, risk stratification, and prognosis prediction for ccRCC patients.

Previous studies using TCGA ccRCC data accurately predicted molecular markers associated with histological grading of ccRCC. Wan et. al identified 8 genes that could be used to distinguish different grades of ccRCC by comparing different histological grades of ccRCC (grade I/II vs. grade III/IV) (Wan et al., 2017). However, this analysis method did not utilize a global level system biological analysis method, which might cause large false-positive results. Other studies using the collected tissue samples for immunohistochemical analysis showed that EphA1 (Wang L. et al., 2015), EphA2 (Wang X. et al., 2015), and VEGFR-1 (Lkhagvadorj et al., 2014) were associated with different pathological stages of ccRCC by directly comparing gene expression differences. However, the results lacked large sample support.

WGCNA is a method that can be applied to explore potential biological mechanisms and to identify genes associated with patient's prognosis. The practical utility of this approach can predict new prognostic markers. Our research may contribute to the personalized treatment of patients with ccRCC. Nonetheless, prior to the clinical use of these molecular markers, multiple center randomized controlled clinical trials and in vitro/vivo experiments should be conducted to assess the potential application of molecular features, to predict survival and to functionally characterize hub genes in clinical applications.

A larger number of clinical samples were required to validate our findings and elucidate the underlying mechanisms of how these hub genes impacted pathological stage of ccRCC, which were our subsequent research work.

In summary, we established a gene co-expression network to identify and validate network hub genes associated with the progression of ccRCC, based on systems biology-based WGCNA. Our work might have important clinical implications for improving risk stratification, therapeutic decision-making and prognosis prediction in patients with ccRCC.

Author Contributions

LC, LY, YX, and XW conceived and designed the study; LC, LY, KQ, and YX performed the analysis procedures; LC, GQ, YZ, HD, and YX analyzed the results; LC, C-LW, YX, and XW contributed analysis tools; LC, LY, and YX contributed to the writing of the manuscript. All authors reviewed the manuscript.

Funding

This study was supported in part by grants from the Hubei Province Health and Family Planning Scientific Research Project (grant number WJ2017H0002) and National Natural Science Foundation of China (grant number 81772730). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The excellent technical assistance of Shanshan Zhang and Danni Shan, Zhongnan Hospital of Wuhan University, is gratefully acknowledged. We also would like to acknowledge the GEO and TCGA databases for free use.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2018.00399/full#supplementary-material

References

Cairns, P. (2010). Renal cell carcinoma. Cancer Biomark. 9, 461–473. doi: 10.3233/CBM-2011-0176

CrossRef Full Text | Google Scholar

Chen, L., Yuan, L., Wang, Y., Wang, G., Zhu, Y., Cao, R., et al. (2017). Co-expression network analysis identified FCER1G in association with progression and prognosis in human clear cell renal cell carcinoma. Int. J. Biol. Sci. 13, 1361–1372. doi: 10.7150/ijbs.21657

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, P., Wang, F., Feng, J., Zhou, R., Chang, Y., Liu, J., et al. (2017). Co-expression network analysis identified six hub genes in association with metastasis risk and prognosis in hepatocellular carcinoma. Oncotarget 8, 48948–48958. doi: 10.18632/oncotarget.16896

PubMed Abstract | CrossRef Full Text | Google Scholar

Chou, W. C., Cheng, A. L., Brotto, M., and Chuang, C. Y. (2014). Visual gene-network analysis reveals the cancer gene co-expression in human endometrial cancer. BMC Genomics 15:300. doi: 10.1186/1471-2164-15-300

PubMed Abstract | CrossRef Full Text | Google Scholar

Clarke, C., Madden, S. F., Doolan, P., Aherne, S. T., Joyce, H., O'Driscoll, L., et al. (2013). Correlating transcriptional networks to breast cancer survival: a large-scale coexpression analysis. Carcinogenesis 34, 2300–2308. doi: 10.1093/carcin/bgt208

PubMed Abstract | CrossRef Full Text | Google Scholar

Coppin, C., Kollmannsberger, C., Le, L., Porzsolt, F., and Wilt, T. J. (2011). Targeted therapy for advanced renal cell cancer (RCC): a Cochrane systematic review of published randomised trials. BJU Int. 108, 1556–1563. doi: 10.1111/j.1464-410X.2011.10629.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Dahinden, C., Ingold, B., Wild, P., Boysen, G., Luu, V. D., Montani, M., et al. (2010). Mining tissue microarray data to uncover combinations of biomarker expression patterns that improve intermediate staging and grading of clear cell renal cell cancer. Clin. Cancer Res. 16, 88–98. doi: 10.1158/1078-0432.CCR-09-0260

PubMed Abstract | CrossRef Full Text | Google Scholar

Dennis, G., Sherman, B. T., Hosack, D. A., Yang, J., Gao, W., Lane, H. C., et al. (2003). DAVID: database for annotation, visualization, and integrated discovery. Genome Biol. 4:P3. doi: 10.1186/gb-2003-4-5-p3

PubMed Abstract | CrossRef Full Text | Google Scholar

Farber, C. R. (2010). Identification of a gene module associated with BMD through the integration of network analysis and genome-wide association data. J. Bone Miner. Res. 25, 2359–2367. doi: 10.1002/jbmr.138

PubMed Abstract | CrossRef Full Text | Google Scholar

Gautier, L., Cope, L., Bolstad, B. M., and Irizarry, R. A. (2004). affy–analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 20, 307–315. doi: 10.1093/bioinformatics/btg405

PubMed Abstract | CrossRef Full Text | Google Scholar

Gerlinger, M., Horswell, S., Larkin, J., Rowan, A. J., Salm, M. P., Varela, I., et al. (2014). Genomic architecture and evolution of clear cell renal cell carcinomas defined by multiregion sequencing. Nat. Genet. 46, 225–233. doi: 10.1038/ng.2891

PubMed Abstract | CrossRef Full Text | Google Scholar

Goel, M. K., Khanna, P., and Kishore, J. (2010). Understanding survival analysis: Kaplan-Meier estimate. Int. J. Ayurveda Res. 1, 274–278. doi: 10.4103/0974-7788.76794

PubMed Abstract | CrossRef Full Text | Google Scholar

He, Z., Sun, M., Ke, Y., Lin, R., Xiao, Y., Zhou, S., et al. (2017). Identifying biomarkers of papillary renal cell carcinoma associated with pathological stage by weighted gene co-expression network analysis. Oncotarget 8, 27904–27914. doi: 10.18632/oncotarget.15842

PubMed Abstract | CrossRef Full Text | Google Scholar

Hutson, T. E., Bellmunt, J., Porta, C., Szczylik, C., Staehler, M., Nadel, A., et al. (2010). Long-term safety of sorafenib in advanced renal cell carcinoma: follow-up of patients from phase III TARGET. Eur. J. Cancer 46, 2432–2440. doi: 10.1016/j.ejca.2010.06.121

PubMed Abstract | CrossRef Full Text | Google Scholar

Kunowska, N., Rotival, M., Yu, L., Choudhary, J., and Dillon, N. (2015). Identification of protein complexes that bind to histone H3 combinatorial modifications using super-SILAC and weighted correlation network analysis. Nucleic Acids Res. 43, 1418–1432. doi: 10.1093/nar/gku1350

PubMed Abstract | CrossRef Full Text | Google Scholar

Langfelder, P., and Horvath, S. (2008). WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9:559. doi: 10.1186/1471-2105-9-559

PubMed Abstract | CrossRef Full Text | Google Scholar

Langfelder, P., Luo, R., Oldham, M. C., and Horvath, S. (2011). Is my network module preserved and reproducible? PLoS Comput. Biol. 7:e1001057. doi: 10.1371/journal.pcbi.1001057

PubMed Abstract | CrossRef Full Text | Google Scholar

Leek, J. T., and Storey, J. D. (2007). Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLoS Genet. 3:e0030161. doi: 10.1371/journal.pgen.0030161

PubMed Abstract | CrossRef Full Text | Google Scholar

Ljungberg, B., Bensalah, K., Canfield, S., Dabestani, S., Hofmann, F., Hora, M., et al. (2015). EAU guidelines on renal cell carcinoma: 2014 update. Eur. Urol. 67, 913–924. doi: 10.1016/j.eururo.2015.01.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Lkhagvadorj, S., Oh, S. S., Lee, M. R., Jung, J. H., Chung, H. C., Cha, S. K., et al. (2014). VEGFR-1 expression relates to fuhrman nuclear grade of clear cell renal cell carcinoma. J. Lifestyle Med. 4, 64–70. doi: 10.15280/jlm.2014.4.1.64

PubMed Abstract | CrossRef Full Text | Google Scholar

Luo, Y., Coskun, V., Liang, A., Yu, J., Cheng, L., Ge, W., et al. (2015). Single-cell transcriptome analyses reveal signals to activate dormant neural stem cells. Cell 161, 1175–1186. doi: 10.1016/j.cell.2015.04.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Miller, J. A., Horvath, S., and Geschwind, D. H. (2010). Divergence of human and mouse brain transcriptome highlights Alzheimer disease pathways. Proc. Natl. Acad. Sci. U.S.A. 107, 12698–12703. doi: 10.1073/pnas.0914257107

PubMed Abstract | CrossRef Full Text | Google Scholar

Motzer, R. J., Bander, N. H., and Nanus, D. M. (1996). Renal-cell carcinoma. N. Engl. J. Med. 335, 865–875. doi: 10.1056/NEJM199609193351207

PubMed Abstract | CrossRef Full Text | Google Scholar

Motzer, R. J., Jonasch, E., Agarwal, N., Beard, C., Bhayani, S., Bolger, G. B., et al. (2015). Kidney cancer, version 3.2015. J. Natl. Compr. Cancer Netw. 13, 151–159. doi: 10.6004/jnccn.2015.0022

PubMed Abstract | CrossRef Full Text | Google Scholar

Motzer, R. J., Rini, B. I., Bukowski, R. M., Curti, B. D., George, D. J., Hudes, G. R., et al. (2006). Sunitinib in patients with metastatic renal cell carcinoma. JAMA 295, 2516–2524. doi: 10.1001/jama.295.21.2516

PubMed Abstract | CrossRef Full Text | Google Scholar

Negrier, S., Escudier, B., Lasset, C., Douillard, J. Y., Savary, J., Chevreau, C., et al. (1998). Recombinant human interleukin-2, recombinant human interferon alfa-2a, or both in metastatic renal-cell carcinoma. groupe francais d'immunotherapie. N. Engl. J. Med. 338, 1272–1278. doi: 10.1056/NEJM199804303381805

CrossRef Full Text | Google Scholar

Peña-Llopis, S., Vega-Rubín-De-Celis, S., Liao, A., Leng, N., Pavía-Jiménez, A., Wang, S., et al. (2012). BAP1 loss defines a new class of renal cell carcinoma. Nat. Genet. 44, 751–759. doi: 10.1038/ng.2323

PubMed Abstract | CrossRef Full Text | Google Scholar

Plaisier, C. L., Horvath, S., Huertas-Vazquez, A., Cruz-Bautista, I., Herrera, M. F., Tusie-Luna, T., et al. (2009). A systems genetics approach implicates USF1, FADS3, and other causal candidate genes for familial combined hyperlipidemia. PLoS Genet. 5:e1000642. doi: 10.1371/journal.pgen.1000642

PubMed Abstract | CrossRef Full Text | Google Scholar

Ritchie, M. E., Phipson, B., Wu, D., Hu, Y., Law, C. W., Shi, W., et al. (2015). limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43:e47. doi: 10.1093/nar/gkv007

PubMed Abstract | CrossRef Full Text | Google Scholar

Siegel, R. L., Miller, K. D., and Jemal, A. (2017). Cancer Statistics, 2017. CA Cancer J. Clin. 67, 7–30. doi: 10.3322/caac.21387

CrossRef Full Text | Google Scholar

Tavazoie, S., Hughes, J. D., Campbell, M. J., Cho, R. J., and Church, G. M. (1999). Systematic determination of genetic network architecture. Nat. Genet. 22, 281–285. doi: 10.1038/10343

PubMed Abstract | CrossRef Full Text | Google Scholar

Von Roemeling, C. A., Radisky, D. C., Marlow, L. A., Cooper, S. J., Grebe, S. K., Anastasiadis, P. Z., et al. (2014). Neuronal pentraxin 2 supports clear cell renal cell carcinoma by activating the AMPA-selective glutamate receptor-4. Cancer Res. 74, 4796–4810. doi: 10.1158/0008-5472.CAN-14-0210

PubMed Abstract | CrossRef Full Text | Google Scholar

Wan, F., Zhu, Y., Han, C., Xu, Q., Wu, J., Dai, B., et al. (2017). Identification and validation of an eight-gene expression signature for predicting high Fuhrman grade renal cell carcinoma. Int. J. Cancer 140, 1199–1208. doi: 10.1002/ijc.30535

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L., Hu, H., Tian, F., Zhou, W., Zhou, S., and Wang, J. (2015). Expression of EphA2 protein is positively associated with age, tumor size and Fuhrman nuclear grade in clear cell renal cell carcinomas. Int. J. Clin. Exp. Pathol. 8, 13374–13380.

PubMed Abstract | Google Scholar

Wang, X., Liu, Y., Cao, G., Zhang, X., Xu, H., Xu, H., et al. (2015). Expression of the EphA1 protein is associated with Fuhrman nuclear grade in clear cell renal cell carcinomas. Int. J. Clin. Exp. Pathol. 8, 6821–6827.

PubMed Abstract | Google Scholar

Wei, X., Choudhury, Y., Lim, W. K., Anema, J., Kahnoski, R. J., Lane, B., et al. (2017). Recognizing the continuous nature of expression heterogeneity and clinical outcomes in clear cell renal cell carcinoma. Sci. Rep. 7:7342. doi: 10.1038/s41598-017-07191-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: clear cell renal cell carcinoma (ccRCC), differentially expressed genes (DEGs), weighted gene co-expression network analysis (WGCNA), survival prognosis, pathological stage

Citation: Chen L, Yuan L, Qian K, Qian G, Zhu Y, Wu C-L, Dan HC, Xiao Y and Wang X (2018) Identification of Biomarkers Associated With Pathological Stage and Prognosis of Clear Cell Renal Cell Carcinoma by Co-expression Network Analysis. Front. Physiol. 9:399. doi: 10.3389/fphys.2018.00399

Received: 13 December 2017; Accepted: 04 April 2018;
Published: 18 April 2018.

Edited by:

Angelica Merlot, University of Sydney, Australia

Reviewed by:

Fabio Grizzi, Humanitas Research Hospital, Italy
Antonio Longo, Università degli Studi di Catania, Italy
Philipp Kapranov, Huaqiao University, China

Copyright © 2018 Chen, Yuan, Qian, Qian, Zhu, Wu, Dan, Xiao and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yu Xiao, yu.xiao@whu.edu.cn
Xinghuan Wang, wangxinghuan@whu.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.