P3DB: An Integrated Database for Plant Protein Phosphorylation

Yao, Qiuming; Bollinger, Curtis; Gao, Jianjiong; Xu, Dong; Thelen, Jay J.

doi:10.3389/fpls.2012.00206

ORIGINAL RESEARCH article

Front. Plant Sci., 07 September 2012

Sec. Plant Proteomics and Protein Structural Biology

Volume 3 - 2012 | https://doi.org/10.3389/fpls.2012.00206

This article is part of the Research Topic Plant Proteomic Resources View all 18 articles

P3DB: An Integrated Database for Plant Protein Phosphorylation

Qiuming Yao^1,2

Curtis Bollinger^1,2

Jianjiong Gao³

Dong Xu^1,2

Jay J. Thelen^2,4*

¹ Department of Computer Science, University of Missouri, Columbia, MO, USA
² Bond Life Science Center, University of Missouri, Columbia, MO, USA
³ Computational Biology Center, Memorial Sloan-Kettering Cancer Center, New York, NY, USA
⁴ Department of Biochemistry, University of Missouri, Columbia, MO, USA

Protein phosphorylation is widely recognized as the most widespread, enzyme-catalyzed post-translational modification in eukaryotes. In particular, plants have appropriated this signaling mechanism as evidenced by the twofold higher frequency of protein kinases within the genome compared to other eukaryotes. While all aspects of plant protein phosphorylation research have grown in the past 10 years; phosphorylation site mapping using high-resolution mass spectrometry has grown exponentially. In Arabidopsis alone there are thousands of experimentally determined phosphorylation sites. To archive these events in a user-intuitive format we have developed P³DB, the Plant Protein Phosphorylation Database (p3db.org). This database is a repository for plant protein phosphorylation site data, currently hosting information on 32,963 non-redundant sites collated from 23 experimental studies from six plant species. These data can be queried for a protein-of-interest using an integrated BLAST module to query similar sequences with known phosphorylation sites among the multiple plants currently investigated. The paper demonstrates how this resource can help identify functionally conserved phosphorylation sites in plants using a multi-system approach.

Introduction

Protein phosphorylation is one of the most ubiquitous post-translational modifications (PTM), controlling signaling pathways, metabolic, and cellular processes. It is evident that plants are particularly adept at this form of protein modulation as the genome of the reference plant Arabidopsis thaliana contains slightly over 1000 protein kinases, at least twice as many on a gene frequency basis compared to man or fungi (Wang et al., 2007). As most kinases are capable of phosphorylating multiple proteins, it is likely that the majority of the plant proteome has the potential to be phosphorylated, if only transiently. Technology and methodology to experimentally catalog the plant phosphoproteome have existed for over a decade (Neubauer and Mann, 1999; Ficarro et al., 2002; Gruhler et al., 2005), and there has been a steady increase in the number of mapped phosphorylation sites collectively from model and crop plants (Nuhse et al., 2004, 2007; Wolschin and Weckwerth, 2005; Benschop et al., 2007; de la Fuente van Bentem et al., 2008; Sugiyama et al., 2008; Whiteman et al., 2008; Hsu et al., 2009; Ito et al., 2009; Jones et al., 2009; Li et al., 2009; Reiland et al., 2009; Wang et al., 2009; Chen et al., 2010; Grimsrud et al., 2010; Kline et al., 2010; Nakagami et al., 2010; Bi et al., 2011; Engelsberger and Schulze, 2012; Meyer et al., 2012).

The information related to phosphorylation site data is multi-dimensional and includes experiment-specific information (i.e., meta-data, such as experimental parameters and data quality metrics), peptide sequence, phosphorylation site, mass spectra, etc. As a result there is a clear need to hierarchically store and organize the increasingly large phosphoproteomics data, motivating the creation of phosphorylation databases in the past few years. HPRD, the Human Protein Reference Database (Prasad et al., 2009) provides a wide range of data of phosphorylation and other modifications, but the data are limited to human. Phosphosite (Hornbeck et al., 2012) also provides large datasets but mainly on human, rat, and mouse. Phospho.ELM (Diella et al., 2008; Dinkel et al., 2011) also contains data on human and mouse. PhosphoPep (Bodenmiller et al., 2008) archives phosphoproteomic data from yeast, worm, and fly in addition to human. PHOSIDA (Gnad et al., 2011) provides phosphoproteomic data from a comprehensive group of organisms from human to bacteria, but again no plant data are available. UniProt (Farriol-Mathis et al., 2004) is another source of protein sequences and annotations for all kind of modifications; however, phosphorylation data are limited (Table 1). As few of these databases accommodate plant phosphoproteomics data two recent databases were created, PhosPhAt and P³DB. PhosPhAt (Heazlewood et al., 2008; Durek et al., 2010) is a database that specifically maintains experimental phosphorylation site data for Arabidopsis, whereas the Plant Protein Phosphorylation DataBase, or P³DB (Gao et al., 2009), is a comprehensive repository for all plant phosphoproteomic data, including Arabidopsis.

TABLE 1

Table 1. Phosphorylation site data for plants in UniProt and P³DB 2.0.

P³DB is both a web portal and an integrated database specifically for archiving and disseminating plant protein phosphoproteomics data. It integrates data from different plants, experimental approaches, and spectral search algorithms. P³DB is also a reliable data source for computational analysis since all of its data are downloadable. It can help to easily test a hypothesis and make inferences based on the high-quality phosphoproteomics data. For example, disorder is a feature to measure the local structure stability of proteins. The higher the disorder score, the more flexible that local structure is likely to be. There are hypotheses that protein phosphorylation occurs preferentially within disordered regions (Iakoucheva et al., 2004). P³DB provides a resource to further explore this relationship.

Since the release of P³DB 1.0 (Gao et al., 2009), this database has been under extensive development in terms of software/tool infrastructure, phosphoproteome coverage, and integration of these datasets with contextual information about phosphorylation site assignment and function. In this paper, we highlight the current status of the database and some of the recent developments since the original publication of the database (Gao et al., 2009). We also further explore the relationship between protein phosphorylation and disorder analysis using this rich source of plant phosphoproteomics data.

Database Infrastructure

Data

In P³DB version 2.0, phosphoproteomics data were integrated from 23 experimental studies including 11,601 phosphoproteins harboring 32,962 non-redundant phosphorylation sites. This new dataset covered six different organisms (A. thaliana, Brassica napus, Glycine max, Medicago truncatula, Oryza sativa, and Zea mays), with different experimental designs including mass spectrometry instrumentation (e.g., chromatography method, dissociation techniques, etc.) as well as data mining algorithms (and associated output). Consequently, the data were of different quality. To maintain the high-quality data within P³DB, while also expanding this database in accordance with the growing body of global phosphoproteomic studies, simple quality control standards were implemented. To be deposited into P³DB, datasets must meet the following criteria: FDR <1%; precursor mass accuracy <15 ppm; and PTM-score >84, in which case identification of the phosphorylation site is statistically significant compared to background. Additionally, all database searching must be performed with homologous proteome databases.

Interface

P³DB 2.0 has a new user-friendly web interface with better visualization for phosphoproteomics data and for data searching. Hierarchical design was applied to the front-end display as well as the database schema which not only facilitates user access to the interested portion of the database but also provides a comprehensive viewer to explore the data from all aspects. The data were first classified by organisms as the highest level, then proteins, followed by peptides, finishing with phosphorylation sites. Annotated mass spectra were mapped to the peptides if provided from experimental side. The data sources were listed in a drop-down list, which can be used as a filter by user selection at any scale level. Phosphoproteins in P³DB are also seamlessly linked to prediction results from the phosphorylation site prediction tool Musite (Gao et al., 2010). More features can be seen directly from the example section.

Standard Output

Besides just exploring the details and information related to site phosphorylation, P³DB also allows the user to extract the batch data from the download module. Phosphoprotein report, phosphosite report, non-redundant phosphopeptide report, and redundant phosphopeptide report are four of our standardized file formats. Phosphoprotein reports contain all the protein sequences and phosphosite reports have all phosphorylated sites listed. Therefore, by simply combining these two report files, it is easy to convert to MusiteXML, which can be uploaded to Musite for additional analysis or prediction.

Implementation

MySQL was used for storage, maintenance, and operation of P³DB. The front-end web interface was implemented in PHP, which is a popular scripting language for dynamic webpages. In P³DB 2.0, a well-defined and packaged JavaScript called jQuery was used to enhance the interface of the website and improve user experience. The database, specifically the display of data, was both designed as a hierarchical structure supporting navigation among proteins, peptides, phosphorylation sites, and mass spectra (if available). Original citations were also linked to all of the archived data.

Example of Use

Data Analysis

Through the portal http://p3db.org, the new home page for the P³DB 2.0 website is shown (Figure 1). Table 1 lists the fraction of phosphorylated Ser, Thr, and Tyr in each organism and the comparison to the UniProt release (February 2012).

FIGURE 1

Figure 1. Main page of P3DB 2.0 website.

P³DB 2.0 provides a great source to analyze the disorder characteristic that could be potentially related to site phosphorylation. Disorder score for each phosphosite can be calculated by VSL2B (Obradovic et al., 2005) ranging from 0 to 1. All of the scores were classified into the 200 equally divided bins. Then the obtained histograms were normalized by the total number of counted sites and allowing the empirical distribution to be obtained. For each organism in P³DB 2.0, the empirical distributions on both phosphorylated and non-phosphorylated sites were plotted (Figure 2). The horizontal axis from left to right shows the local structure from non-disordered/low-disordered to high-disordered region. The normalized frequency in the vertical axis depicts the density of the empirical distribution.

FIGURE 2

Figure 2. Disorder score distribution in six organisms. (A) Arabidopsis thaliana, (B) Brassica napus, (C) Glycine max, (D) Medicago truncatula, (E) Oryza sativa, and (F) Zea mays.

Two-sample Komolgorov–Smirnov test was used to determine if the two disorder distributions were significantly different for phospho- and non-phosphosites. The results are listed in Table 2.

TABLE 2

Table 2. Results of Komolgorov–Smirnov test for disorder distributions of phospho- and non-phosphosites.

Specific Protein Study

Besides the meta-data analysis, P³DB provides multiple tools for searching, retrieving, and visualizing phosphoproteomics data. In the following example we show the process of retrieving the homologous proteins and exploring the potential conservative sites among different studies within the P³DB.

60S acidic ribosomal protein plays a very important role in the elongation step of protein biosynthesis, which exists as heteromeric complex of subunits P1, P2, and P3 (Tchorzewski, 2002). The Arabidopsis homologs (Figure 3), P1 subunit O23095 (AT4G00810) and P2 subunit AAC73029 (AT2G27710; Tchorzewski, 2002), can be obtained directly by typing O23095 and AAC73029 in the search box without even knowing the TAIR numbers, because the current P³DB 2.0 supports more than 25 types of different accession number (IDs) in the search module and does the mapping internally. Other than querying by accession number, data can also be searched using protein description, annotations, and protein names within the search module. The search module also allows the user to filter by organisms or studies to narrow down the search space. On the protein page, all accession numbers in the system can be obtained so that we can easily know or verify the TAIR numbers (AT4G00810 and AT2G27710) for our target proteins O23095 and AAC73029. The protein page shows the phosphorylation sites on the whole sequence with the sites highlighted. By looking at the protein pages for our P1 and P2 subunits (Figure 3), it is obvious that the location of the only phospho-serine is conserved in both proteins, with a motif EES(*)DDD.

FIGURE 3

Figure 3. Example of using search engine to access the homologous proteins.

To determine if this phospho-serine residue is conserved, we can perform a BLAST search through P³DB using the protein sequence O23095 and find the orthologous protein in rice (Oryza sativa). Although the best hit is a P3 subunit, it does contain the conserved phosphorylation site too. We can also try to search for the other homologous proteins within Arabidopsis, i.e., paralogs. With the best hit of AT3G44590, two sites are observed to be phosphorylated other than the one that is strongly conserved. Sequence conservation of phosphorylation sites can be a useful indicator for conservation of regulatory function.

Continuing with protein AT3G44590, we can explore for further information in a hierarchical manner (Figure 4). The protein page usually has multiple links for the phosphorylated sites where site details are shown like flanking sequence and the number of available spectra in P³DB. Since the same site might be observed in different peptides, each non-redundant peptide link leads to the specific peptide page. The peptide page contains the details of the experimental condition and parameters, each of which is mapped to the spectrum. If the annotated mass spectrum is available (we request this from all users who deposit data into P³DB), the visualization tool will bring the mass spectrum and annotated peptide fragmentation series in the new screen.

FIGURE 4

Figure 4. Detailed information for a certain phosphorylation site or peptide. (A) Protein page, (B) Phospho-site page, (C) Peptide page, (D) Spectra parameters on peptide page, (E) Spectra viewer.

Furthermore, the long green button “send to Musite for phosphorylation prediction” on the protein page will send the sequence to Musite on the fly to do prediction. Therefore, the standard output of P³DB facilitates the communication between the experimental reliable dataset with the more extensive computational tools and analyses.

Discussion

Table 1 shows that P³DB 2.0 contains considerably more plant phosphoproteomics data than the recent version (February 2012) of the comprehensive protein database UniProt. This is because the phosphorylation data within UniProt is almost exclusively from Arabidopsis. P³DB 2.0 has a wide range of different plant species, reflecting the expansion of phosphoproteomics research in Viridiplantae. The overall variability in phospho-amino acid frequency among the various plants likely reflects the preliminary nature of cataloging the plant phosphoproteome, particularly for crop species.

The plots in Figure 2 show the different disorder distribution for phospho- and non-phosphosites among different organisms. The quantitative analysis by Komolgorov–Smirnov test (Table 2) shows that in most of the cases the disorder scores for each amino acid were differentially distributed, and for phospho- and non-phosphosites the disorder scores were also significantly different.

Although the plots are more descriptive (Figure 2), they are comparable since they are calculated on a fixed scale. For the non-phosphorylated sites, the density curves are almost the same among six species. For the phosphorylated sites, since the data are much sparser, the density curves are not as smooth as the non-phosphorylated curves, and they also differ from each other. However, the relative relationship among these density curves is preserved among different species. First, the disorder scores of the non-phosphorylated sites tend to be distributed in the lower disordered regions more so than the phosphorylated sites. Especially for the curves of non-phosphorylated Thr and Tyr, the maxima, and the large population are in the regions of less than 0.5, while for the phosphorylated sites, the density curves almost grow exponentially in the highly disordered region. Although for Ser, both curves grow in the highly disordered region, it is noteworthy that the non-phosphorylated density is above the phosphorylated density in the lowly disordered region, and this relationship is inverted in the highly disordered region. This clearly shows that a large population of the phosphorylated sites has the tendency to be in the disordered region while the non-phosphorylated sites are more likely to be located in non-disordered regions for all the species in this study. Second, the distribution patterns for Ser, Thr, and Tyr differ; however, the relationship in both highly disordered and lowly disordered regions is the same in different species. In the non-phosphorylated case, the Tyr curve is above the Thr, and the Ser curve is at the bottom, while in the phosphorylated case, the relation flips as the order of high density to low is Ser, Thr, and Tyr.

P³DB 2.0 displays the data in a relational, hierarchical manner that integrates proteins, peptides, phosphosites, and spectra for each phosphorylation event. Various search and query tools (search by protein accession, description, annotations, protein names, and protein/peptide sequences) are embedded within the P³DB website framework allowing for seamless interrogation of the data without leaving the site. For example, the search module provides the user with multiple ways to access to proteins of interest and the BLAST search tool allows for comparative studies at the primary sequence level.

In summary, P³DB version 2.0 now has become an integrated data bank and portal driven by the rapidly growing field of high-throughput phosphorylation site mapping. For bioinformaticist or experimental biologist, P³DB provides both the tools and experimental resources for querying plant protein phosphorylation data.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This work was supported by National Science Foundation (grant numbers 0604439 and 1126992) and National Institute of Health (grant numbers R21/R33 GM078601 and R01-GM100701).

References

Benschop, J. J., Mohammed, S., O’Flaherty, M., Heck, A. J., Slijper, M., and Menke, F. L. (2007). Quantitative phosphoproteomics of early elicitor signaling in Arabidopsis. Mol. Cell. Proteomics 6, 1198–1214.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Bi, Y. D., Wang, H. X., Lu, T. C., Li, X. H., Shen, Z., Chen, Y. B., and Wang, B. C. (2011). Large-scale analysis of phosphorylated proteins in maize leaf. Planta 233, 383–392.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Bodenmiller, B., Campbell, D., Gerrits, B., Lam, H., Jovanovic, M., Picotti, P., Schlapbach, R., and Aebersold, R. (2008). PhosphoPep – a database of protein phosphorylation sites in model organisms. Nat. Biotechnol. 26, 1339–1340.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Chen, Y., Hoehenwarter, W., and Weckwerth, W. (2010). Comparative analysis of phytohormone-responsive phosphoproteins in Arabidopsis thaliana using TiO2-phosphopeptide enrichment and mass accuracy precursor alignment. Plant J. 63, 1–17.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

de la Fuente van Bentem, S., Anrather, D., Dohnal, I., Roitinger, E., Csaszar, E., Joore, J., Buijnink, J., Carreri, A., Forzani, C., Lorkovic, Z. J., Barta, A., Lecourieux, D., Verhounig, A., Jonak, C., and Hirt, H. (2008). Site-specific phosphorylation profiling of Arabidopsis proteins by mass spectrometry and peptide chip analysis. J. Proteome Res. 7, 2458–2470.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Diella, F., Gould, C. M., Chica, C., Via, A., and Gibson, T. J. (2008). Phospho.ELM: a database of phosphorylation sites – update 2008. Nucleic Acids Res. 36, D240–D244.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Dinkel, H., Chica, C., Via, A., Gould, C. M., Jensen, L. J., Gibson, T. J., and Diella, F. (2011). Phospho.ELM: a database of phosphorylation sites – update 2011. Nucleic Acids Res. 39, D261–D267.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Durek, P., Schmidt, R., Heazlewood, J. L., Jones, A., Maclean, D., Nagel, A., Kersten, B., and Schulze, W. X. (2010). PhosPhAt: the Arabidopsis thaliana phosphorylation site database. An update. Nucleic Acids Res 38, D828–D834.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Engelsberger, W. R., and Schulze, W. X. (2012). Nitrate and ammonium lead to distinct global dynamic phosphorylation patterns when resupplied to nitrogen-starved Arabidopsis seedlings. Plant J. 69, 978–995.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Farriol-Mathis, N., Garavelli, J. S., Boeckmann, B., Duvaud, S., Gasteiger, E., Gateau, A., Veuthey, A. L., and Bairoch, A. (2004). Annotation of post-translational modifications in the Swiss-Prot knowledge base. Proteomics 4, 1537–1550.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Ficarro, S. B., Mccleland, M. L., Stukenberg, P. T., Burke, D. J., Ross, M. M., Shabanowitz, J., Hunt, D. F., and White, F. M. (2002). Phosphoproteome analysis by mass spectrometry and its application to Saccharomyces cerevisiae. Nat. Biotechnol. 20, 301–305.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gao, J., Agrawal, G. K., Thelen, J. J., and Xu, D. (2009). P3DB: a plant protein phosphorylation database. Nucleic Acids Res. 37, D960–D962.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gao, J., Thelen, J. J., Dunker, A. K., and Xu, D. (2010). Musite, a tool for global prediction of general and kinase-specific phosphorylation sites. Mol. Cell. Proteomics 9, 2586–2600.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gnad, F., Gunawardena, J., and Mann, M. (2011). PHOSIDA 2011: the posttranslational modification database. Nucleic Acids Res. 39, D253–D260.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Grimsrud, P. A., Den Os, D., Wenger, C. D., Swaney, D. L., Schwartz, D., Sussman, M. R., Ane, J. M., and Coon, J. J. (2010). Large-scale phosphoprotein analysis in Medicago truncatula roots provides insight into in vivo kinase activity in legumes. Plant Physiol. 152, 19–28.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Gruhler, A., Olsen, J. V., Mohammed, S., Mortensen, P., Faergeman, N. J., Mann, M., and Jensen, O. N. (2005). Quantitative phosphoproteomics applied to the yeast pheromone signaling pathway. Mol. Cell. Proteomics 4, 310–327.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Heazlewood, J. L., Durek, P., Hummel, J., Selbig, J., Weckwerth, W., Walther, D., and Schulze, W. X. (2008). PhosPhAt: a database of phosphorylation sites in Arabidopsis thaliana and a plant-specific phosphorylation site predictor. Nucleic Acids Res. 36, D1015–D1021.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Hornbeck, P. V., Kornhauser, J. M., Tkachev, S., Zhang, B., Skrzypek, E., Murray, B., Latham, V., and Sullivan, M. (2012). PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse. Nucleic Acids Res. 40, D261–D270.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Hsu, J. L., Wang, L. Y., Wang, S. Y., Lin, C. H., Ho, K. C., Shi, F. K., and Chang, I. F. (2009). Functional phosphoproteomic profiling of phosphorylation sites in membrane fractions of salt-stressed Arabidopsis thaliana. Proteome Sci. 7, 42.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Iakoucheva, L. M., Radivojac, P., Brown, C. J., O’Connor, T. R., Sikes, J. G., Obradovic, Z., and Dunker, A. K. (2004). The importance of intrinsic disorder for protein phosphorylation. Nucleic Acids Res. 32, 1037–1049.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Ito, J., Taylor, N. L., Castleden, I., Weckwerth, W., Millar, A. H., and Heazlewood, J. L. (2009). A survey of the Arabidopsis thaliana mitochondrial phosphoproteome. Proteomics 9, 4229–4240.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Jones, A. M., Maclean, D., Studholme, D. J., Serna-Sanz, A., Andreasson, E., Rathjen, J. P., and Peck, S. C. (2009). Phosphoproteomic analysis of nuclei-enriched fractions from Arabidopsis thaliana. J. Proteomics 72, 439–451.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Kline, K. G., Barrett-Wilt, G. A., and Sussman, M. R. (2010). In planta changes in protein phosphorylation induced by the plant hormone abscisic acid. Proc. Natl. Acad. Sci. U.S.A. 107, 15986–15991.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Li, H., Wong, W. S., Zhu, L., Guo, H. W., Ecker, J., and Li, N. (2009). Phosphoproteomic analysis of ethylene-regulated protein phosphorylation in etiolated seedlings of Arabidopsis mutant ein2 using two-dimensional separations coupled with a hybrid quadrupole time-of-flight mass spectrometer. Proteomics 9, 1646–1661.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Meyer, L. J., Gao, J., Xu, D., and Thelen, J. J. (2012). Phosphoproteomic analysis of seed maturation in Arabidopsis, rapeseed, and soybean. Plant Physiol. 159, 517–528.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Nakagami, H., Sugiyama, N., Mochida, K., Daudi, A., Yoshida, Y., Toyoda, T., Tomita, M., Ishihama, Y., and Shirasu, K. (2010). Large-scale comparative phosphoproteomics identifies conserved phosphorylation sites in plants. Plant Physiol. 153, 1161–1174.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Neubauer, G., and Mann, M. (1999). Mapping of phosphorylation sites of gel-isolated proteins by nanoelectrospray tandem mass spectrometry: potentials and limitations. Anal. Chem. 71, 235–242.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Nuhse, T. S., Bottrill, A. R., Jones, A. M., and Peck, S. C. (2007). Quantitative phosphoproteomic analysis of plasma membrane proteins reveals regulatory mechanisms of plant innate immune responses. Plant J. 51, 931–940.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Nuhse, T. S., Stensballe, A., Jensen, O. N., and Peck, S. C. (2004). Phosphoproteomics of the Arabidopsis plasma membrane and a new phosphorylation site database. Plant Cell 16, 2394–2405.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Obradovic, Z., Peng, K., Vucetic, S., Radivojac, P., and Dunker, A. K. (2005). Exploiting heterogeneous sequence properties improves prediction of protein disorder. Proteins 61(Suppl. 7), 176–182.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Prasad, T. S., Kandasamy, K., and Pandey, A. (2009). Human Protein Reference Database and Human Proteinpedia as discovery tools for systems biology. Methods Mol. Biol. 577, 67–79.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Reiland, S., Messerli, G., Baerenfaller, K., Gerrits, B., Endler, A., Grossmann, J., Gruissem, W., and Baginsky, S. (2009). Large-scale Arabidopsis phosphoproteome profiling reveals novel chloroplast kinase substrates and phosphorylation networks. Plant Physiol. 150, 889–903.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Sugiyama, N., Nakagami, H., Mochida, K., Daudi, A., Tomita, M., Shirasu, K., and Ishihama, Y. (2008). Large-scale phosphorylation mapping reveals the extent of tyrosine phosphorylation in Arabidopsis. Mol. Syst. Biol. 4, 193.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Tchorzewski, M. (2002). The acidic ribosomal P proteins. Int. J. Biochem. Cell Biol. 34, 911–915.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wang, H., Chevalier, D., Larue, C., Ki Cho, S., and Walker, J. C. (2007). The Protein phosphatases and protein kinases of Arabidopsis thaliana. Arabidopsis Book 5, e0106.

Pubmed Abstract | Pubmed Full Text

Wang, Z., Dong, G., Singh, S., Steen, H., and Li, J. (2009). A simple and effective method for detecting phosphopeptides for phosphoproteomic analysis. J. Proteomics 72, 831–835.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Whiteman, S. A., Serazetdinova, L., Jones, A. M., Sanders, D., Rathjen, J., Peck, S. C., and Maathuis, F. J. (2008). Identification of novel proteins and phosphorylation sites in a tonoplast enriched membrane fraction of Arabidopsis thaliana. Proteomics 8, 3536–3547.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wolschin, F., and Weckwerth, W. (2005). Combining metal oxide affinity chromatography (MOAC) and selective mass spectrometry for robust identification of in vivo protein phosphorylation sites. Plant Methods 1, 9.

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Keywords: protein phosphorylation, P³DB, mass spectrometry, plants, data repository, phosphoproteomics

Citation: Yao Q, Bollinger C, Gao J, Xu D and Thelen JJ (2012) P³DB: an integrated database for plant protein phosphorylation. Front. Plant Sci. 3:206. doi: 10.3389/fpls.2012.00206

Received: 30 May 2012; Accepted: 14 August 2012;
Published online: 07 September 2012.

Edited by:

Joshua L. Heazlewood, Lawrence Berkeley National Laboratory, USA

Reviewed by:

Dirk Walther, Max Planck Institute for Molecular Plant Physiology, Germany
Borjana Arsova, Heinrich-Heine University, Germany

Copyright: © 2012 Yao, Bollinger, Gao, Xu and Thelen. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.

*Correspondence: Jay J. Thelen, Department of Biochemistry, Christopher S. Bond Life Sciences Center, 271G Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA. e-mail: thelenj@missouri.edu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.