Previous Article | Next Article 
Molecular and Cellular Biology, March 1999, p. 1720-1730, Vol. 19, No. 3
0270-7306/99/$04.00+0
Copyright © 1999, American Society for Microbiology. All rights reserved.
Correlation between Protein and mRNA Abundance
in Yeast
Steven P.
Gygi,
Yvan
Rochon,
B. Robert
Franza, and
Ruedi
Aebersold*
Department of Molecular Biotechnology,
University of Washington, Seattle, Washington 98195-7730
Received 5 October 1998/Returned for modification 11 November
1998/Accepted 2 December 1998
 |
ABSTRACT |
We have determined the relationship between mRNA and protein
expression levels for selected genes expressed in the yeast
Saccharomyces cerevisiae growing at mid-log phase. The
proteins contained in total yeast cell lysate were separated by
high-resolution two-dimensional (2D) gel electrophoresis. Over 150 protein spots were excised and identified by capillary liquid
chromatography-tandem mass spectrometry (LC-MS/MS). Protein spots were
quantified by metabolic labeling and scintillation counting.
Corresponding mRNA levels were calculated from serial analysis of gene
expression (SAGE) frequency tables (V. E. Velculescu, L. Zhang, W. Zhou, J. Vogelstein, M. A. Basrai, D. E. Bassett, Jr., P. Hieter, B. Vogelstein, and K. W. Kinzler, Cell 88:243-251, 1997).
We found that the correlation between mRNA and protein levels was
insufficient to predict protein expression levels from quantitative
mRNA data. Indeed, for some genes, while the mRNA levels were of the
same value the protein levels varied by more than 20-fold. Conversely,
invariant steady-state levels of certain proteins were observed with
respective mRNA transcript levels that varied by as much as 30-fold.
Another interesting observation is that codon bias is not a predictor
of either protein or mRNA levels. Our results clearly delineate the
technical boundaries of current approaches for quantitative analysis of
protein expression and reveal that simple deduction from mRNA
transcript analysis is insufficient.
 |
INTRODUCTION |
The description of the state of a
biological system by the quantitative measurement of the system
constituents is an essential but largely unexplored area of biology.
With recent technical advances including the development of
differential display-PCR (21), of cDNA microarray and DNA
chip technology (20, 27), and of serial analysis of gene
expression (SAGE) (34, 35), it is now feasible to establish
global and quantitative mRNA expression profiles of cells and tissues
in species for which the sequence of all the genes is known. However,
there is emerging evidence which suggests that mRNA expression patterns
are necessary but are by themselves insufficient for the quantitative
description of biological systems. This evidence includes discoveries
of posttranscriptional mechanisms controlling the protein translation
rate (15), the half-lives of specific proteins or mRNAs
(33), and the intracellular location and molecular
association of the protein products of expressed genes (32).
Proteome analysis, defined as the analysis of the protein complement
expressed by a genome (26), has been suggested as an approach to the quantitative description of the state of a biological system by the quantitative analysis of protein expression profiles (36). Proteome analysis is conceptually attractive because
of its potential to determine properties of biological systems that are
not apparent by DNA or mRNA sequence analysis alone. Such properties
include the quantity of protein expression, the subcellular location,
the state of modification, and the association with ligands, as well as
the rate of change with time of such properties. In contrast to the
genomes of a number of microorganisms (for a review, see reference
11) and the transcriptome of Saccharomyces cerevisiae (35), which have been entirely determined,
no proteome map has been completed to date.
The most common implementation of proteome analysis is the combination
of two-dimensional gel electrophoresis (2DE) (isoelectric focusing-sodium dodecyl sulfate [SDS]-polyacrylamide gel
electrophoresis) for the separation and quantitation of proteins with
analytical methods for their identification. 2DE permits the
separation, visualization, and quantitation of thousands of proteins
reproducibly on a single gel (18, 24). By itself, 2DE is
strictly a descriptive technique. The combination of 2DE with protein
analytical techniques has added the possibility of establishing the
identities of separated proteins (1, 2) and thus, in
combination with quantitative mRNA analysis, of correlating
quantitative protein and mRNA expression measurements of selected genes.
The recent introduction of mass spectrometric protein analysis
techniques has dramatically enhanced the throughput and sensitivity of
protein identification to a level which now permits the large-scale analysis of proteins separated by 2DE. The techniques have reached a
level of sensitivity that permits the identification of essentially any
protein that is detectable in the gels by conventional protein staining
(9, 29). Current protein analytical technology is based on
the mass spectrometric generation of peptide fragment patterns that are
idiotypic for the sequence of a protein. Protein identity is
established by correlating such fragment patterns with sequence
databases (10, 22, 37). Sophisticated computer software
(8) has automated the entire process such that proteins are
routinely identified with no human interpretation of peptide fragment patterns.
In this study, we have analyzed the mRNA and protein levels of a group
of genes expressed in exponentially growing cells of the yeast S. cerevisiae. Protein expression levels were quantified by metabolic
labeling of the yeast proteins to a steady state, followed by 2DE and
liquid scintillation counting of the selected, separated protein
species. Separated proteins were identified by in-gel tryptic digestion
of spots with subsequent analysis by microspray liquid
chromatography-tandem mass spectrometry (LC-MS/MS) and sequence
database searching. The corresponding mRNA transcript levels were
calculated from SAGE frequency tables (35).
This study, for the first time, explores a quantitative comparison of
mRNA transcript and protein expression levels for a relatively large
number of genes expressed in the same metabolic state. The resultant
correlation is insufficient for prediction of protein levels from mRNA
transcript levels. We have also compared the relative amounts of
protein and mRNA with the respective codon bias values for the
corresponding genes. This comparison indicates that codon bias by
itself is insufficient to accurately predict either the mRNA or the
protein expression levels of a gene. In addition, the results
demonstrate that only highly expressed proteins are detectable by 2DE
separation of total cell lysates and that therefore the construction of
complete proteome maps with current technology will be very
challenging, irrespective of the type of organism.
 |
MATERIALS AND METHODS |
Yeast strain and growth conditions.
The source of protein
and message transcripts for all experiments was YPH499
(MATa ura3-52 lys2-801 ade2-101 leu2-
1 his3-
200
trp1-
63) (30). Logarithmically growing cells were obtained by growing yeast cells to early log phase (3 × 106 cells/ml) in YPD rich medium (YPD supplemented with 6 mM uracil, 4.8 mM adenine, and 24 mM tryptophan) at 30°C
(35). Metabolic labeling of protein was accomplished in YPD
medium exactly as described elsewhere (4) with the exception
that 1 ml of cells was labeled with 3 mCi to offset methionine present
in YPD medium. Protein was harvested as described by Garrels and
coworkers (12). Harvested protein was lyophilized,
resuspended in isoelectric focusing gel rehydration solution, and
stored at
80°C.
2DE.
Soluble proteins were run in the first dimension by
using a commercial flatbed electrophoresis system (Multiphor II;
Pharmacia Biotech). Immobilized polyacrylamide gel (IPG) dry strips
with nonlinear pH 3.0 to 10.0 gradients (Amersham-Pharmacia Biotech) were used for the first-dimension separation. Forty micrograms of
protein from whole-cell lysates was mixed with IPG strip rehydration buffer (8 M urea, 2% Nonidet P-40, 10 mM dithiothreitol), and 250 to
380 µl of solution was added to individual lanes of an IPG strip
rehydration tray (Amersham-Pharmacia Biotech). The strips were allowed
to rehydrate at room temperature for 1 h. The samples were run at
300 V-10 mA-5 W for 2 h, then ramped to 3,500 V-10 mA-5 W over
a period of 3 h, and then kept at 3,500 V-10 mA-5 W for 15 to
19 h. At the end of the first-dimension run (60 to 70 kV · h), the IPG strips were reequilibrated for 8 min in 2% (wt/vol)
dithiothreitol in 2% (wt/vol) SDS-6 M urea-30% (wt/vol) glycerol-0.05 M Tris HCl (pH 6.8) and for 4 min in 2.5% iodoacetamide in 2% (wt/vol) SDS-6 M urea-30% (wt/vol) glycerol-0.05 M Tris HCl
(pH 6.8). Following reequilibration, the strips were transferred and
apposed to 10% polyacrylamide second-dimension gels. Polyacrylamide gels were poured in a casting stand with 10% acrylamide-2.67% piperazine diacrylamide-0.375 M Tris base-HCl (pH 8.8)-0.1%
(wt/vol) SDS-0.05% (wt/vol) ammonium persulfate-0.05% TEMED
(N,N,N',N'-tetramethylethylenediamine) in Milli-Q water. The apparatus used to run second-dimension gels was a
noncommercial apparatus from Oxford Glycosciences, Inc. Once the IPG
strips were apposed to the second-dimension gels, they were immediately
run at 50 mA (constant)-500 V-85 W for 20 min, followed by 200 mA
(constant)-500 V-85 W until the buffer front line was 10 to 15 mm
from the bottom of the gel. Gels were removed and silver stained
according to the procedure of Shevchenko et al. (29).
Protein identification.
Gels were exposed to X-ray film
overnight, and then the silver staining and film were used to excise
156 spots of varying intensities, molecular weights, and isoelectric
focusing points. In order to increase the detection limit by mass
spectrometry, spots were cut out and pooled from up to four identical
cold, silver-stained gels. In-gel tryptic digests of pooled spots were performed as described previously (29). Tryptic peptides
were analyzed by microcapillary LC-MS with automated switching to MS/MS mode for peptide fragmentation. Spectra were searched against the
composite OWL protein sequence database (version 30.2; 250,514 protein
sequences) (24a) by using the computer program Sequest (8), which matches theoretical and acquired tandem mass
spectra. A protein match was determined by comparing the number of
peptides identified and their respective cross-correlation scores. All protein identifications were verified by comparison with theoretical molecular weights and isoelectric points.
mRNA quantitation.
Velculescu and coworkers have previously
generated frequency tables for yeast mRNA transcripts from the same
strain grown under the same stated conditions as described herein
(35). The SAGE technology is based on two main principles.
First, a short sequence tag (15 bp) that contains sufficient
information uniquely to identify a transcript is generated. A single
tag is usually generated from each mRNA transcript in the cell which
corresponds to 15 bp at the 3'-most cutting site for NlaIII.
Second, many transcript tags can be concatenated into a single molecule
and then sequenced, revealing the identity of multiple tags
simultaneously. Over 20,000 transcripts were sequenced from yeast
strain YPH499 growing at mid-log phase on glucose. Assuming the
previously derived estimate of 15,000 mRNA molecules per cell
(16), this would represent a 1.3-fold coverage even for mRNA
molecules present at a single copy per cell and would provide a 72%
probability of detecting such transcripts. Computer software which took
for input the gene detected, examined the nucleotide sequence, and performed the calculation as described by Velculescu and coworkers (35) was written. In practice, we found that for 21 of 128 (16%) genes examined viable mRNA levels from SAGE data could not be calculated. This was because (i) no CATG site was found in the open
reading frame (ORF), (ii) a CATG site was found but the corresponding 10-bp putative SAGE tag was not found in the frequency tables, or (iii)
identical putative SAGE tags were present for multiple genes (e.g.,
TDH2_YEAST and TDH3_YEAST).
Protein quantitation.
[35S]methionine-labeled
gels were exposed to X-ray film overnight, and then the silver stain
and film were used to excise 156 spots of varying intensities,
molecular weights, and pIs. The excised spots were placed in 0.6-ml
microcentrifuge tubes, and scintillation cocktail (100 µl) was added.
The samples were vortexed and counted. In addition, two parallel gels
were electroblotted to polyvinylidene difluoride membranes. The
membranes were exposed to X-ray film, and four intense single spots
were excised from each membrane and subjected to amino acid analysis.
For these four spots, a mean of 209 ± 4 cpm/pmol of
protein/methionine was found. This number was used to quantitate all
remaining spots in conjunction with the number of methionines present
in the protein.
To ensure that proteins were labeled to equilibrium, parallel 2D gels
were prepared and run on yeast metabolically labeled for 1, 2, 6, or
18 h. The corresponding 156 spots were excised from each gel, and
radioactivity was measured by liquid scintillation counting for each
spot. Calculated protein levels were highly reproducible for all time
points measured after 1 h.
Calculation of codon bias and predicted half-life.
Codon
bias values were extracted from the YPD spreadsheet (17).
Protein half-lives were calculated based on the N-end rule (33). When the N-terminal processing was not known
experimentally, it was predicted based on the affinity of methionine
aminopeptidase (31).
 |
RESULTS |
Characteristics of proteome approach.
Nearly every facet of
proteome analysis hinges on the unambiguous identification of large
numbers of expressed proteins in cells. Several techniques have been
described previously for the identification of proteins separated by
2DE, including N-terminal and internal sequencing (1, 2),
amino acid analysis (38), and more recently mass
spectrometry (25). We utilized techniques based on mass
spectrometry because they afford the highest levels of sensitivity and
provide unambiguous identification. The specific procedure used is
schematically illustrated in Fig. 1 and
is based on three principles. First, proteins are removed from the gel by proteolytic in-gel digestion, and the resulting peptides are separated by on-line capillary high-performance liquid
chromatography. Second, the eluting peptides are ionized and
detected, and the specific peptide ions are selected and
fragmented by the mass spectrometer. To achieve this, the mass
spectrometer switches between the MS mode (for peptide mass
identification) and the MS/MS mode (for peptide characterization and
sequencing). Selected peptides are fragmented by a process called
collision-induced dissociation (CID) to generate a tandem mass spectrum
(MS/MS spectrum) that contains the peptide sequence information. Third,
individual CID mass spectra are then compared by computer algorithms to
predicted spectra from a sequence database. This results in the
identification of the peptide and, by association, the protein(s) in
the spot. Unambiguous protein identification is attained in a single
analysis by the detection of multiple peptides derived from the same
protein.

View larger version (42K):
[in this window]
[in a new window]
|
FIG. 1.
Schematic illustration of proteome analysis by 2DE and
mass spectrometry. In part I, proteins are separated by 2DE, stained
spots are excised and subjected to in-gel digestion with trypsin, and
the resulting peptides are separated by on-line capillary
high-performance liquid chromatography. In part II, a peptide is shown
eluting from the column in part I. The peptide is ionized by
electrospray ionization and enters the mass spectrometer. The mass of
the ionized peptide is detected, and the first quadrupole mass filter
allows only the specific mass-to-charge ratio of the selected peptide
ion to pass into the collision cell. In the collision cell, the
energized, ionized peptides collide with neutral argon gas molecules.
Fragmentation of the peptide is essentially random but occurs mainly at
the peptide bonds, resulting in smaller peptides of differing lengths
(masses). These peptide fragments are detected as a tandem mass (MS/MS)
spectrum in the third quadrupole mass filter where two ion series are
recorded simultaneously, one each from sequencing inward from the N and
C termini of the peptide, respectively. In part III, the MS/MS spectrum
from the selected, ionized peptide is compared to predicted tandem mass
spectra computer generated from a sequence database. Provided that the
peptide sequence exists in the database, the peptide and, by
association, the protein from which the peptide was derived can be
identified. Unambiguous protein identification is attained in a single
analysis because multiple peptides are identified as being derived from
the same protein.
|
|
Protein identification.
Yeast total cell protein lysate (40 µg), metabolically labeled with [35S]methionine, was
electrophoretically separated by isoelectric focusing in the first
dimension and by SDS-10% polyacrylamide gel electrophoresis in the
second dimension. Proteins were visualized by silver staining and by
autoradiography. Of the more than 1,000 proteins visible by silver
staining, 156 spots were excised from the gel and subjected to in-gel
tryptic digestion, and the resulting peptides were analyzed and
identified by microspray LC-MS/MS techniques as described above. The
proteins in this study were all identified automatically by computer
software with no human interpretation of mass spectra. They are
indicated in Fig. 2 and detailed in Table
1.

View larger version (111K):
[in this window]
[in a new window]
|
FIG. 2.
2D silver-stained gel of the proteins in yeast total
cell lysate. Proteins were separated in the first dimension
(horizontal) by isoelectric focusing and then in the second dimension
(vertical) by molecular weight sieving. Protein spots (156) were chosen
to include the entire range of molecular weights, isoelectric focusing
points, and staining intensities. Spots were excised, and the
corresponding protein was identified by mass spectrometry and database
searching. The spots are labeled on the gel and correspond to the data
presented in Table 1. Molecular weights are given in thousands.
|
|
The CID spectra shown in Fig. 3 indicate
that the quality of the identification data generated was suitable for
unambiguous protein identification. The spectra represent the amino
acid sequences of tryptic peptides NSGDIVNLGSIAGR (Fig. 3A) and
FAVGAFTDSLR (Fig. 3B). Both peptides were derived from protein S57593
(hypothetical protein YMR226C), which migrated to spot 114 (molecular
weight, 29,156; pI, 6.59) in the 2D gel in Fig. 2. Five other peptides from the same analysis were also computer matched to the same protein
sequence.


View larger version (41K):
[in this window]
[in a new window]
|
FIG. 3.
Tandem mass (MS/MS) spectra resulting from analysis of a
single spot on a 2D gel. The first quadrupole selected a single
mass-to-charge ratio (m/z) of 687.2 (A) or 592.6 (B), while
the collision cell was filled with argon gas, and a voltage which
caused the peptide to undergo fragmentation by CID was applied. The
third quadrupole scanned the mass range from 50 to 1,400 m/z. The computer program Sequest (8) was
utilized to match MS/MS spectra to amino acid sequence by database
searching. Both spectra matched peptides from the same protein, S57593
(yeast hypothetical protein YMR226C). Five other peptides from the same
analysis were matched to the same protein.
|
|
Protein and mRNA quantitation.
For the 156 genes investigated,
the protein expression levels ranged from 2,200 (PGM2) to 863,000 (TDH2/TDH3) copies/cell. The levels of mRNA for each of the genes
identified were calculated from SAGE frequency tables (35).
These tables contain the mRNA levels for 4,665 genes in yeast strain
YPH499 grown to mid-log phase in YPD medium on glucose as a carbon
source. In some instances, the mRNA levels could not be calculated for
reasons stated in Materials and Methods. For the proteins analyzed in
this study, mean transcript levels varied from 0.7 to 473 copies/cell.
Selection of the sample population for mRNA-protein expression
level correlation.
The protein spots selected for identification
were selected from spots visible by silver staining in the 2D gel. An
attempt was made not to include spots where overlap with other spots
was readily apparent. The number of proteins identified was 156 (Table 1). Some proteins migrated to more than one spot (presumably due to
differential protein processing or modifications), and protein levels
from these spots were calculated by integrating the intensities of the
different spots. The 156 protein spots analyzed represented the
products of 128 different genes. Genes were excluded from the
correlation analysis only if part of the data set was missing; i.e.,
genes were excluded if (i) no mRNA expression data were available for
the protein or putative SAGE tags were ambiguous, (ii) the amino acid
sequence did not contain methionine, (iii) more than a single protein
was conclusively identified as migrating to the same gel spot, or (iv)
the theoretical and observed pIs and molecular weights could not be
reconciled. After these criteria were applied, the number of genes used
in the correlation analysis was 106.
Codon bias and predicted half-lives.
Codon bias is thought to
be an indicator of protein expression, with highly expressed proteins
having large codon bias values. The codon bias distribution for the
entire set of more than 6,000 predicted yeast gene ORFs is presented in
Fig. 4A. The interval with the largest
frequency of genes is between the codon bias values of 0.0 and 0.1. This segment contains more than 2,500 genes. The distribution of the
codon bias values of the 128 different genes found in this study (all
protein spots from Fig. 2) is shown in Fig. 4B, and protein half-lives
(predicted from applying the N-end rule [33] to the
experimentally determined or predicted protein N termini) are shown in
Fig. 4C. No genes were identified with codon bias values less than 0.1 even though thousands of genes exist in this category. In addition,
nearly all of the proteins identified had long predicted half-lives
(greater than 30 h).

View larger version (18K):
[in this window]
[in a new window]
|
FIG. 4.
Current proteome analysis technology utilizing 2DE
without preenrichment samples mainly highly expressed and long-lived
proteins. Genes encoding highly expressed proteins generally have large
codon bias values. (A) Distribution of the yeast genome (more than
6,000 genes) based on codon bias. The interval with the largest
frequency of genes is 0.0 to 0.1, with more than 2,500 genes. (B)
Distribution of the genes from identified proteins in this study based
on codon bias. No genes with codon bias values less than 0.1 were
detected in this study. (C) Distribution of identified proteins in this
study based on predicted half-life (estimated by N-end rule).
|
|
Correlation of mRNA and protein expression levels.
The
correlation between mRNA and protein levels of the genes selected as
described above is shown in Fig. 5. For
the entire group (106 genes) for which a complete data set was
generated, there was a general trend of increased protein levels
resulting from increased mRNA levels. The Pearson product moment
correlation coefficient for the whole data set (106 genes) was 0.935. This number is highly biased by a small number of genes with very large protein and message levels. A more representative subset of the data is
shown in the inset of Fig. 5. It shows genes for which the message
level was below 10 copies/cell and includes 69% (73 of 106 genes) of
the data used in the study. The Pearson product moment correlation
coefficient for this data set was only 0.356. We also found that levels
of protein expression coded for by mRNA with comparable abundance
varied by as much as 30-fold and that the mRNA levels coding for
proteins with comparable expression levels varied by as much as
20-fold.

View larger version (10K):
[in this window]
[in a new window]
|
FIG. 5.
Correlation between protein and mRNA levels for 106 genes in yeast growing at log phase with glucose as a carbon source.
mRNA and protein levels were calculated as described in Materials and
Methods. The data represent a population of genes with protein
expression levels visible by silver staining on a 2D gel chosen to
include the entire range of molecular weights, isoelectric focusing
points, and staining intensities. The inset shows the low-end portion
of the main figure. It contains 69% of the original data set. The
Pearson product moment correlation for the entire data set was 0.935. The correlation for the inset containing 73 proteins (69%) was only
0.356.
|
|
The distortion of the correlation value induced by the uneven
distribution of the data points along the x axis is further demonstrated by the analysis in Fig. 6.
The 106 samples included in the study were ranked by protein abundance,
and the Pearson product moment correlation coefficient was repeatedly
calculated after including progressively more, and higher-abundance,
proteins in each calculation. The correlation values remained
relatively stable in the range of 0.1 to 0.4 if the lowest-expressed 40 to 95 proteins used in this study were included. However, the
correlation value steadily climbed by the inclusion of each of the 11 very highly expressed proteins.

View larger version (13K):
[in this window]
[in a new window]
|
FIG. 6.
Effect of highly abundant proteins on Pearson product
moment correlation coefficient for mRNA and protein abundance in yeast.
The set of 106 genes was ranked according to protein abundance, and the
correlation value was calculated by including the 40 lowest-abundance
genes and then progressively including the remaining 66 genes in order
of abundance. The correlation value climbs as the final 11 highly
abundant proteins are included.
|
|
Correlation of protein and mRNA expression levels with codon
bias.
Codon bias is the propensity for a gene to utilize the same
codon to encode an amino acid even though other codons would insert the
identical amino acid in the growing polypeptide sequence. It is further
thought that highly expressed proteins have large codon biases
(3). To assess the value of codon bias for predicting mRNA
and protein levels in exponentially growing yeast cells, we plotted the
two experimental sets of data versus the codon bias (Fig.
7). The distribution patterns for both
mRNA and protein levels with respect to codon bias were highly similar.
There was high variability in the data within the codon bias range of
0.8 to 1.0. Although a large codon bias generally resulted in higher protein and message expression levels, codon bias did not appear to be
predictive of either protein levels or mRNA levels in the cell.

View larger version (12K):
[in this window]
[in a new window]
|
FIG. 7.
Relationship between codon bias and protein and mRNA
levels in this study. Yeast mRNA and protein expression levels were
calculated as described in Materials and Methods. The data represent
the same 106 genes as in Fig. 5.
|
|
 |
DISCUSSION |
The desired end point for the description of a biological system
is not the analysis of mRNA transcript levels alone but also the
accurate measurement of protein expression levels and their respective
activities. Quantitative analysis of global mRNA levels currently is a
preferred method for the analysis of the state of cells and tissues
(11). Several methods which either provide absolute mRNA
abundance (34, 35) or relative mRNA levels in comparative
analyses (20, 27) have been described elsewhere. The
techniques are fast and exquisitely sensitive and can provide mRNA
abundance for potentially any expressed gene. Measured mRNA levels are
often implicitly or explicitly extrapolated to indicate the levels of
activity of the corresponding protein in the cell. Quantitative
analysis of protein expression levels (proteome analysis) is much more
time-consuming because proteins are analyzed sequentially one by one
and is not general because analyses are limited to the relatively
highly expressed proteins. Proteome analysis does, however, provide
types of data that are of critical importance for the description of
the state of a biological system and that are not readily apparent from
the sequence and the level of expression of the mRNA transcript. This
study attempts to examine the relationship between mRNA and protein
expression levels for a large number of expressed genes in cells
representing the same state.
Limits in the sensitivity of current protein analysis technology
precluded a completely random sampling of yeast proteins. We therefore
based the study on those proteins visible by silver staining on a 2D
gel. Of the more than 1,000 visible spots, 156 were chosen to include
the entire range of molecular weights, isoelectric focusing points, and
staining intensities displayed on the 2D protein pattern. The genes
identified in this study shared a number of properties. First, all of
the proteins in this study had a codon bias of greater than 0.1 and
93% were greater than 0.2 (Fig. 4B). Second, with few exceptions, the
proteins in this study had long predicted half-lives according to the
N-end rule (Fig. 4C). Third, low-abundance proteins with regulatory functions such as transcription factors or protein kinases were not identified.
Because the population of proteins used in this study appears to be
fairly homogeneous with respect to predicted half-life and codon bias,
it might be expected that the correlation of the mRNA and protein
expression levels would be stronger for this population than for a
random sample of yeast proteins. We tested this assumption by
evaluating the correlation value if different subsets of the available
data were included in the calculation. The 106 proteins were ranked
from lowest to highest protein expression level, and the trend in the
correlation value was evaluated by progressively including more of the
higher-abundance proteins in the calculation (Fig. 6). The correlation
value when only the lower-abundance 40 to 93 proteins were examined was
consistently between 0.1 and 0.4. If the 11 most abundant proteins were
included, the correlation steadily increased to 0.94. We therefore
expect that the correlation for all yeast proteins or for a random
selection would be less than 0.4. The observed level of correlation
between mRNA and protein expression levels suggests the importance of posttranslational mechanisms controlling gene expression. Such mechanisms include translational control (15) and control of protein half-life (33). Since these mechanisms are also
active in higher eukaryotic cells, we speculate that there is no
predictive correlation between steady-state levels of mRNA and those of
protein in mammalian cells.
Like other large-scale analyses, the present study has several
potential sources of error related to the methods used to determine mRNA and protein expression levels. The mRNA levels were calculated from frequency tables of SAGE data. This method is highly quantitative because it is based on actual sequencing of unique tags from each gene,
and the number of times that a tag is represented is proportional to
the number of mRNA molecules for a specific gene. This method has some
limitations including the following: (i) the magnitude of the error in
the measurement of mRNA levels is inversely proportional to the mRNA
levels, (ii) SAGE tags from highly similar genes may not be
distinguished and therefore are summed, (iii) some SAGE tags are from
sequences in the 3' untranslated region of the transcript, (iv)
incomplete cleavage at the SAGE tag site by the restriction enzyme can
result in two tags representing one mRNA, and (v) some transcripts
actually do not generate a SAGE tag (34, 35).
For the SAGE method, the error associated with a value increases with a
decreasing number of transcripts per cell. The conclusions drawn from
this study are dependent on the quality of the mRNA levels from
previously published data (35). Since more than 65% of the
mRNA levels included in this study were calculated to 10 copies/cell or
less (40% were less than 4 copies/cell), the error associated with
these values may be quite large. The mRNA levels were calculated from
more than 20,000 transcripts. Assuming that the estimate of 15,000 mRNA
molecules per cell is correct (16), this would mean that
mRNA transcripts present at only a single copy per cell would be
detected 72% of the time (35). The mRNA levels for each
gene were carefully scrutinized, and only mRNA levels for which a high
degree of confidence existed were included in the correlation value.
Protein abundance was determined by metabolic radiolabeling with
[35S]methionine. The calculation required knowledge of
three variables: the number of methionines in the mature protein, the
radioactivity contained in the protein, and the specific activity of
the radiolabel normalized per methionine. The number of methionines per
protein was determined from the amino acid sequence of the proteins
identified by tandem mass spectrometry. For some proteins, it was not
known whether the methionine of the nascent polypeptide was processed away. The N termini of those proteins were predicted based on the
specificity of methionine aminopeptidase (31). If the
N-terminal processing did not conform to the predicted specificity of
processing enzymes, the calculation of the number of methionines would
be affected. This discrepancy would affect most the quantitation of a
protein with a very low number of methionines. The average number of
calculated methionines per protein in this study was 7.2. We therefore
expect the potential for erroneous protein quantitation due to unusual
N-terminal processing to be small.
The amount of radioactivity contained in a single spot might be the sum
of the radioactivity of comigrating proteins. Because protein
identification was based on tandem mass spectrometric techniques,
comigrating proteins could be identified. However, comigrating proteins
were rarely detected in this study, most likely because relatively
small amounts of total protein (40 µg) were initially loaded onto the
gels, which resulted in highly focused spots containing generally 1 to
25 ng of protein. Because of the relatively small amount loaded, the
concentrations of any potentially comigrating protein would likely be
below the limit of detection of the mass spectrometry technique used in
this study (1 to 5 ng) and below the limit of visualization by silver
staining (1 to 5 ng). In the overwhelming majority of the samples
analyzed, numerous peptides from a single protein were detected. It is
assumed that any comigrating proteins were at levels too low to be
detected and that their influence in the calculation would be small.
The specific activity of the radiolabel was determined by relating the
precise amount of protein present in selected spots of a parallel gel,
as determined by quantitative amino acid composition analysis, to the
number of methionines present in the sequence of those proteins and the
radioactivity determined by liquid scintillation counting. It is
possible that the resulting number might be influenced by unavoidable
losses inherent in the amino acid analysis procedure applied. Because
four different proteins were utilized in the calculation and the
experiment was done in duplicate, the specific activity calculated is
thought to be highly accurate. Indeed, the specific activities
calculated for each of the four proteins varied by less than 10%. Any
inconsistencies in the calculation of the specific activity would
result in differences in the absolute levels calculated but not in the
relative numbers and would therefore not influence the correlation
value determined.
The protein quantitative method used eliminates a number of potential
errors inherent in previous methods for the quantitation of proteins
separated by 2DE, such as preferential protein staining and bias caused
by inequalities in the number of radiolabeled residues per protein. Any
2D gel-based method of quantitation is complicated by the fact that in
some cases the translation products of the same mRNA migrated to
different spots. One major reason is posttranslational modification or
processing of the protein. Also, artifactual proteolysis during cell
lysis and sample preparation can lead to multiple resolved forms of the
protein. In such cases, the protein levels of spots coded for by the
same mRNA were pooled. In addition, the existence of other spots coded for by the same mRNA that were not analyzed by mass spectrometry or
that were below the limit of detection for silver staining cannot be
ruled out. However, since this study is based on a class of highly
expressed proteins, the presence of undetected minor spots below silver
staining sensitivity corresponding to a protein analyzed in the study
would generally cause a relatively small error in protein quantitation.
Codon bias is a measure of the propensity of an organism to selectively
utilize certain codons which result in the incorporation of the same
amino acid residue in a growing polypeptide chain. There are 61 possible codons that code for 20 amino acids. The larger the codon bias
value, the smaller the number of codons that are used to encode the
protein (19). It is thought that codon bias is a measure of
protein abundance because highly expressed proteins generally have
large codon bias values (3, 13).
Nearly all of the most highly expressed proteins had codon bias values
of greater than 0.8. However, we detected a number of genes with high
codon bias and relative low protein abundance (Fig. 7). For example,
the expressed gene with both the second largest protein and mRNA levels
in the study was ENO2_YEAST (775,000 and 289.1 copies/cell,
respectively). ENO1_YEAST was also present in the gel at much lower
protein and mRNA levels (44,200 and 0.7 copies/cell, respectively). The
codon bias values for ENO2 and ENO1 are similar (0.96 and 0.93, respectively), but the expression of the two genes is differentially
regulated. Specifically, ENO1_YEAST is glucose repressed (6)
and was therefore present in low abundance under the conditions used.
Other genes with large codon bias values that were not of high protein
abundance in the gel include EFT1, TIF1, HXK2, GSP1, EGD2, SHM2, and
TAL1. We conclude that merely determining the codon bias of a gene is
not sufficient to predict its protein expression level.
Interestingly, codon bias appears to be an excellent indicator of the
boundaries of current 2D gel proteome analysis technology. There are
thousands of genes with expressed mRNA and likely expressed protein
with codon bias values less than 0.1 (Fig. 4A). In this study, we
detected none of them, and only a very small percentage of the genes
detected in this study had codon bias values between 0.1 and 0.2 (Fig.
4B). Indeed, in every examined yeast proteome study (5, 7, 13,
28) where the combined total number of identified proteins is 300 to 400, this same observation is true. It is expected that for the more
complex cells of higher eukaryotic organisms the detection of
low-abundance proteins would be even more challenging than for yeast.
This indicates that highly abundant, long-lived proteins are
overwhelmingly detected in proteome studies. If proteome analysis is to
provide truly meaningful information about cellular processes, it must
be able to penetrate to the level of regulatory proteins, including
transcription factors and protein kinases. A promising approach is the
use of narrow-range focusing gels with immobilized pH gradients (IPG)
(23). This would allow for the loading of significantly more
protein per pH unit covered and also provide increased resolution of
proteins with similar electrophoretic mobilities. A standard pH
gradient in an isoelectric focusing gel covers a 7-pH-unit range (pH 3 to 10) over 18 cm. A narrow-range focusing gel might expand the range
to 0.5 pH units over 18 cm or more. This could potentially increase by
more than 10-fold the number of proteins that can be detected. Clearly,
current proteome technology is incapable of analyzing low-abundance
regulatory proteins without employing an enrichment method for
relatively low-abundance proteins. In conclusion, this study examined
the relationship between yeast protein and message levels and revealed
that transcript levels provide little predictive value with respect to
the extent of protein expression.
 |
ACKNOWLEDGMENTS |
This work was supported by the National Science Foundation
Science and Technology Center for Molecular Biotechnology, NIH grant
T32HG00035-3, and a grant from Oxford Glycosciences.
We thank Jimmy Eng for expert computer programming, Garry Corthals and
John R. Yates III for critical discussion, and Siavash Mohandesi for
expert technical help.
 |
FOOTNOTES |
*
Corresponding author. Mailing address: Department of
Molecular Biotechnology, Box 357730, University of Washington, Seattle, WA 98195-7730. Phone: (206) 221-4196. Fax: (206) 685-7301. E-mail: ruedi{at}u.washington.edu.
 |
REFERENCES |
| 1.
|
Aebersold, R. H.,
J. Leavitt,
R. A. Saavedra,
L. E. Hood, and S. B. Kent.
1987.
Internal amino acid sequence analysis of proteins separated by one- or two-dimensional gel electrophoresis after in situ protease digestion on nitrocellulose.
Proc. Natl. Acad. Sci. USA
84:6970-6974[Abstract/Free Full Text].
|
| 2.
|
Aebersold, R. H.,
D. B. Teplow,
L. E. Hood, and S. B. Kent.
1986.
Electroblotting onto activated glass. High efficiency preparation of proteins from analytical sodium dodecyl sulfate-polyacrylamide gels for direct sequence analysis.
Eur. J. Biochem.
261:4229-4238.
|
| 3.
|
Bennetzen, J. L., and B. D. Hall.
1982.
Codon selection in yeast.
J. Biol. Chem.
257:3026-3031[Abstract/Free Full Text].
|
| 4.
|
Boucherie, H.,
G. Dujardin,
M. Kermorgant,
C. Monribot,
P. Slonimski, and M. Perrot.
1995.
Two-dimensional protein map of Saccharomyces cerevisiae: construction of a gene-protein index.
Yeast
11:601-613[Medline].
|
| 5.
|
Boucherie, H.,
F. Sagliocco,
R. Joubert,
I. Maillet,
J. Labarre, and M. Perrot.
1996.
Two-dimensional gel protein database of Saccharomyces cerevisiae.
Electrophoresis
17:1683-1699[Medline].
|
| 6.
|
Carmen, A. A.,
P. K. Brindle,
C. S. Park, and M. J. Holland.
1995.
Transcriptional regulation by an upstream repression sequence from the yeast enolase gene ENO1.
Yeast
11:1031-1043[Medline].
|
| 7.
|
Ducret, A.,
I. VanOostveen,
J. K. Eng,
J. R. Yates, and R. Aebersold.
1998.
High throughput protein characterization by automated reverse-phase chromatography/electrospray tandem mass spectrometry.
Protein Sci.
7:706-719[Abstract].
|
| 8.
|
Eng, J.,
A. McCormack, and J. R. Yates.
1994.
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.
J. Am. Soc. Mass Spectrom.
5:976-989.
|
| 9.
|
Figeys, D.,
A. Ducret,
J. R. Yates, and R. Aebersold.
1996.
Protein identification by solid phase microextraction-capillary zone electrophoresis-microelectrospray-tandem mass spectrometry.
Nat. Biotechnol.
14:1579-1583[Medline].
|
| 10.
|
Figeys, D.,
I. VanOostveen,
A. Ducret, and R. Aebersold.
1996.
Protein identification by capillary zone electrophoresis/microelectrospray ionization-tandem mass spectrometry at the subfemtomole level.
Anal. Chem.
68:1822-1828[Medline].
|
| 11.
|
Fraser, C. M., and R. D. Fleischmann.
1997.
Strategies for whole microbial genome sequencing and analysis.
Electrophoresis
18:1207-1216[Medline].
|
| 12.
|
Garrels, J. I.,
B. Futcher,
R. Kobayashi,
G. I. Latter,
B. Schwender,
T. Volpe,
J. R. Warner, and C. S. McLaughlin.
1994.
Protein identifications for a Saccharomyces cerevisiae protein database.
Electrophoresis
15:1466-1486[Medline].
|
| 13.
|
Garrels, J. I.,
C. S. McLaughlin,
J. R. Warner,
B. Futcher,
G. I. Latter,
R. Kobayashi,
B. Schwender,
T. Volpe,
D. S. Anderson,
F. Mesquita-Fuentes, and W. E. Payne.
1997.
Proteome studies of Saccharomyces cerevisiae: identification and characterization of abundant proteins.
Electrophoresis
18:1347-1360[Medline].
|
| 14.
|
Gygi, S. P., and R. Aebersold.
1998.
Absolute quantitation of 2-DE protein spots, p. 417-421.
In
A. J. Link (ed.), 2-D protocols for proteome analysis. Humana Press, Totowa, N.J.
|
| 15.
|
Harford, J. B., and D. R. Morris.
1997.
Post-transcriptional gene regulation.
Wiley-Liss, Inc., New York, N.Y.
|
| 16.
|
Hereford, L. M., and M. Rosbash.
1977.
Number and distribution of polyadenylated RNA sequences in yeast.
Cell
10:453-462[Medline].
|
| 17.
|
Hodges, P. E.,
W. E. Payne, and J. I. Garrels.
1998.
The Yeast Protein Database (YPD): a curated proteome database for Saccharomyces cerevisiae.
Nucleic Acids Res.
26:68-72[Abstract/Free Full Text].
|
| 18.
|
Klose, J., and U. Kobalz.
1995.
Two-dimensional electrophoresis of proteins: an updated protocol and implications for a functional analysis of the genome.
Electrophoresis
16:1034-1059[Medline].
|
| 19.
|
Kurland, C. G.
1991.
Codon bias and gene expression.
FEBS Lett.
285:165-169[Medline].
|
| 20.
|
Lashkari, D. A.,
J. L. DeRisi,
J. H. McCusker,
A. F. Namath,
C. Gentile,
S. Y. Hwang,
P. O. Brown, and R. W. Davis.
1997.
Yeast microarrays for genome wide parallel genetic and gene expression analysis.
Proc. Natl. Acad. Sci. USA
94:13057-13062[Abstract/Free Full Text].
|
| 21.
|
Liang, P., and A. B. Pardee.
1992.
Differential display of eukaryotic messenger RNA by means of the polymerase chain reaction.
Science
257:967-971[Abstract/Free Full Text].
|
| 22.
|
Link, A. J.,
L. G. Hays,
E. B. Carmack, and J. R. Yates, III.
1997.
Identifying the major proteome components of Haemophilus influenzae type-strain NCTC 8143.
Electrophoresis
18:1314-1334[Medline].
|
| 23.
|
Nawrocki, A.,
M. R. Larsen,
A. V. Podtelejnikov,
O. N. Jensen,
M. Mann,
P. Roepstorff,
A. Gorg,
S. J. Fey, and P. M. Larsen.
1998.
Correlation of acidic and basic carrier ampholyte and immobilized pH gradient two-dimensional gel electrophoresis patterns based on mass spectrometric protein identification.
Electrophoresis
19:1024-1035[Medline].
|
| 24.
|
O'Farrell, P. H.
1975.
High resolution two-dimensional electrophoresis of proteins.
J. Biol. Chem.
250:4007-4021[Abstract/Free Full Text].
|
| 24a.
| OWL Protein Sequence Database. 2 August 1998, posting date. [Online.]
http://bmbsgi11.leeds.ac.uk/bmb5dp/owl.html. [8 January 1999, last
date accessed.]
|
| 25.
|
Patterson, S. D., and R. Aebersold.
1995.
Mass spectrometric approaches for the identification of gel-separated proteins.
Electrophoresis
16:1791-1814[Medline].
|
| 26.
|
Pennington, S. R.,
M. R. Wilkins,
D. F. Hochstrasser, and M. J. Dunn.
1997.
Proteome analysis: from protein characterization to biological function.
Trends Cell Biol.
7:168-173.
|
| 27.
|
Shalon, D.,
S. J. Smith, and P. O. Brown.
1996.
A DNA microarray system for analyzing complex DNA samples using two-color fluorescent probe hybridization.
Genome Res.
6:639-645[Abstract/Free Full Text].
|
| 28.
|
Shevchenko, A.,
O. N. Jensen,
A. V. Podtelejnikov,
F. Sagliocco,
M. Wilm,
O. Vorm,
P. Mortensen,
H. Boucherie, and M. Mann.
1996.
Linking genome and proteome by mass spectrometry: large-scale identification of yeast proteins from two dimensional gels.
Proc. Natl. Acad. Sci. USA
93:14440-14445[Abstract/Free Full Text].
|
| 29.
|
Shevchenko, A.,
M. Wilm,
O. Vorm, and M. Mann.
1996.
Mass spectrometric sequencing of proteins from silver-stained polyacrylamide gels.
Anal. Chem.
68:850-858[Medline].
|
| 30.
|
Sikorski, R. S., and P. Hieter.
1989.
A system of shuttle vectors and yeast host strains designed for efficient manipulation of DNA in Saccharomyces cerevisiae.
Genetics
122:19-27[Abstract/Free Full Text].
|
| 31.
|
Tsunasawa, S.,
J. W. Stewart, and F. Sherman.
1985.
Amino-terminal processing of mutant forms of yeast iso-1-cytochrome c. The specificities of methionine aminopeptidase and acetyltransferase.
J. Biol. Chem.
260:5382-5391[Abstract/Free Full Text].
|
| 32.
|
Urlinger, S.,
K. Kuchler,
T. H. Meyer,
S. Uebel, and R. Tamp'e.
1997.
Intracellular location, complex formation, and function of the transporter associated with antigen processing in yeast.
Eur. J. Biochem.
245:266-272[Medline].
|
| 33.
|
Varshavsky, A.
1996.
The N-end rule: functions, mysteries, uses.
Proc. Natl. Acad. Sci. USA
93:12142-12149[Abstract/Free Full Text].
|
| 34.
|
Velculescu, V. E.,
L. Zhang,
B. Vogelstein, and K. W. Kinzler.
1995.
Serial analysis of gene expression.
Science
270:484-487[Abstract/Free Full Text].
|
| 35.
|
Velculescu, V. E.,
L. Zhang,
W. Zhou,
J. Vogelstein,
M. A. Basrai,
D. E. Bassett, Jr.,
P. Hieter,
B. Vogelstein, and K. W. Kinzler.
1997.
Characterization of the yeast transcriptome.
Cell
88:243-251[Medline].
|
| 36.
|
Wilkins, M. R.,
K. L. Williams,
R. D. Appel, and D. F. Hochstrasser.
1997.
Proteome research: new frontiers in functional genomics.
Springer-Verlag, Berlin, Germany.
|
| 37.
|
Wilm, M.,
A. Shevchenko,
T. Houthaeve,
S. Breit,
L. Schweigerer,
T. Fotsis, and M. Mann.
1996.
Femtomole sequencing of proteins from polyacrylamide gels by nano-electrospray mass spectrometry.
Nature
379:466-469[Medline].
|
| 38.
|
Yan, J. X.,
M. R. Wilkins,
K. Ou,
A. A. Gooley,
K. L. Williams,
J. C. Sanchez,
O. Golaz,
C. Pasquali, and D. F. Hochstrasser.
1996.
Large-scale amino-acid analysis for proteome studies.
J. Chromatogr. A
736:291-302[Medline].
|
| 39.
| YPD Website. 6 March 1998, revision date.
[Online.] Proteome, Inc. http://www.proteome.com/YPDhome.html.
[8 January 1999, last date accessed.]
|
Molecular and Cellular Biology, March 1999, p. 1720-1730, Vol. 19, No. 3
0270-7306/99/$04.00+0
Copyright © 1999, American Society for Microbiology. All rights reserved.
This article has been cited by other articles:
-
Lee, Y. H., Kim, J. H., Bang, I. S., Park, Y. K.
(2008). The Membrane-Bound Transcriptional Regulator CadC Is Activated by Proteolytic Cleavage in Response to Acid Stress. J. Bacteriol.
190: 5120-5126
[Abstract]
[Full Text]
-
Smith, S. C., Smith, E. C., Gilman, M. L., Anderson, J. L., Taylor, R. L. Jr.
(2008). Differentially Expressed Soluble Proteins in Aortic Cells from Atherosclerosis-Susceptible and Resistant Pigeons. Poult. Sci.
87: 1328-1334
[Abstract]
[Full Text]
-
Lee, C. P., Eubel, H., O'Toole, N., Millar, A. H.
(2008). Heterogeneity of the Mitochondrial Proteome for Photosynthetic and Non-photosynthetic Arabidopsis Metabolism. Mol. Cell. Proteomics
7: 1297-1316
[Abstract]
[Full Text]
-
Hou, Q., Wu, Y. H., Grabsch, H., Zhu, Y., Leong, S. H., Ganesan, K., Cross, D., Tan, L. K., Tao, J., Gopalakrishnan, V., Tang, B. L., Kon, O. L., Tan, P.
(2008). Integrative Genomics Identifies RAB23 as an Invasion Mediator Gene in Diffuse-Type Gastric Cancer. Cancer Res.
68: 4623-4630
[Abstract]
[Full Text]
-
Kiss, N B, Geli, J, Lundberg, F, Avci, C, Velazquez-Fernandez, D, Hashemi, J, Weber, G, Hoog, A, Ekstrom, T J, Backdahl, M, Larsson, C
(2008). Methylation of the p16INK4A promoter is associated with malignant behavior in abdominal extra-adrenal paragangliomas but not pheochromocytomas. Endocr Relat Cancer
15: 609-621
[Abstract]
[Full Text]
-
Hughes, J. L., Nolder, C. L., Nowalk, A. J., Clifton, D. R., Howison, R. R., Schmit, V. L., Gilmore, R. D. Jr., Carroll, J. A.
(2008). Borrelia burgdorferi Surface-Localized Proteins Expressed during Persistent Murine Infection Are Conserved among Diverse Borrelia spp.. Infect. Immun.
76: 2498-2511
[Abstract]
[Full Text]
-
Kumaraswamy, S., Chinnaiyan, P., Shankavaram, U. T., Lu, X., Camphausen, K., Tofilon, P. J.
(2008). Radiation-Induced Gene Translation Profiles Reveal Tumor Type and Cancer-Specific Components. Cancer Res.
68: 3819-3826
[Abstract]
[Full Text]
-
Cornish, E. J., Hurtgen, B. J., McInnerney, K., Burritt, N. L., Taylor, R. M., Jarvis, J. N., Wang, S. Y., Burritt, J. B.
(2008). Reduced Nicotinamide Adenine Dinucleotide Phosphate Oxidase-Independent Resistance to Aspergillus fumigatus in Alveolar Macrophages. J. Immunol.
180: 6854-6867
[Abstract]
[Full Text]
-
Cnops, L., Hu, T.-T., Burnat, K., Arckens, L.
(2008). Influence of Binocular Competition on the Expression Profiles of CRMP2, CRMP4, Dyn I, and Syt I in Developing Cat Visual Cortex. Cereb Cortex
18: 1221-1231
[Abstract]
[Full Text]
-
Bhati, R., Patterson, C., Livasy, C. A., Fan, C., Ketelsen, D., Hu, Z., Reynolds, E., Tanner, C., Moore, D. T., Gabrielli, F., Perou, C. M., Klauber-DeMore, N.
(2008). Molecular Characterization of Human Breast Tumor Vascular Cells. Am. J. Pathol.
172: 1381-1390
[Abstract]
[Full Text]
-
Weinzierl, A. O., Maurer, D., Altenberend, F., Schneiderhan-Marra, N., Klingel, K., Schoor, O., Wernet, D., Joos, T., Rammensee, H.-G., Stevanovic, S.
(2008). A Cryptic Vascular Endothelial Growth Factor T-Cell Epitope: Identification and Characterization by Mass Spectrometry and T-Cell Assays. Cancer Res.
68: 2447-2454
[Abstract]
[Full Text]
-
Zheng, X., Hong, L., Shi, L., Guo, J., Sun, Z., Zhou, J.
(2008). Proteomics Analysis of Host Cells Infected with Infectious Bursal Disease Virus. Mol. Cell. Proteomics
7: 612-625
[Abstract]
[Full Text]
-
Kassie, F., Anderson, L. B., Higgins, L., Pan, Y., Matise, I., Negia, M., Upadhyaya, P., Wang, M., Hecht, S. S.
(2008). Chemopreventive agents modulate the protein expression profile of 4-(methylnitrosamino)-1-(3-pyridyl)-1-butanone plus benzo[a]pyrene-induced lung tumors in A/J mice. Carcinogenesis
29: 610-619
[Abstract]
[Full Text]
-
Stylianou, I. M., Affourtit, J. P., Shockley, K. R., Wilpan, R. Y., Abdi, F. A., Bhardwaj, S., Rollins, J., Churchill, G. A, Paigen, B.
(2008). Applying Gene Expression, Proteomics and Single-Nucleotide Polymorphism Analysis for Complex Trait Gene Identification. Genetics
178: 1795-1805
[Abstract]
[Full Text]
-
Querin, L., Sanvito, R., Magni, F., Busti, S., Van Dorsselaer, A., Alberghina, L., Vanoni, M.
(2008). Proteomic Analysis of a Nutritional Shift-up in Saccharomyces cerevisiae Identifies Gvp36 as a BAR-containing Protein Involved in Vesicular Traffic and Nutritional Adaptation. J. Biol. Chem.
283: 4730-4743
[Abstract]
[Full Text]
-
Morris, J. S., Clark, B. N., Gutstein, H. B.
(2008). Pinnacle: a fast, automatic and accurate method for detecting and quantifying protein spots in 2-dimensional gel electrophoresis data. Bioinformatics
24: 529-536
[Abstract]
[Full Text]
-
Nelson, M. M., Jones, A. R., Carmen, J. C., Sinai, A. P., Burchmore, R., Wastling, J. M.
(2008). Modulation of the Host Cell Proteome by the Intracellular Apicomplexan Parasite Toxoplasma gondii. Infect. Immun.
76: 828-844
[Abstract]
[Full Text]
-
Hojlund, K., Yi, Z., Hwang, H., Bowen, B., Lefort, N., Flynn, C. R., Langlais, P., Weintraub, S. T., Mandarino, L. J.
(2008). Characterization of the Human Skeletal Muscle Proteome by One-dimensional Gel Electrophoresis and HPLC-ESI-MS/MS. Mol. Cell. Proteomics
7: 257-267
[Abstract]
[Full Text]
-
Shahpiri, A., Svensson, B., Finnie, C.
(2008). The NADPH-Dependent Thioredoxin Reductase/Thioredoxin System in Germinating Barley Seeds: Gene Expression, Protein Profiles, and Interactions between Isoforms of Thioredoxin h and Thioredoxin Reductase. Plant Physiol.
146: 789-799
[Abstract]
[Full Text]
-
Ortega-Paino, E., Fransson, J., Ek, S., Borrebaeck, C. A. K.
(2008). Functionally associated targets in mantle cell lymphoma as defined by DNA microarrays and RNA interference. Blood
111: 1617-1624
[Abstract]
[Full Text]
-
Pandey, A., Chakraborty, S., Datta, A., Chakraborty, N.
(2008). Proteomics Approach to Identify Dehydration Responsive Nuclear Proteins from Chickpea (Cicer arietinum L.). Mol. Cell. Proteomics
7: 88-107
[Abstract]
[Full Text]
-
Qi, T, Han, J, Cui, Y, Zong, M, Liu, X, Zhu, B
(2008). Comparative proteomic analysis for the detection of biomarkers in pancreatic ductal adenocarcinomas. J. Clin. Pathol.
61: 49-58
[Abstract]
[Full Text]
-
MARGOLIN, A. A., CALIFANO, A.
(2007). Theory and Limitations of Genetic Network Inference from Microarray Data. Ann. N. Y. Acad. Sci.
1115: 51-72
[Abstract]
[Full Text]
-
OLAUSSON, P., JENTSCH, J. D., KRUEGER, D. D., TRONSON, N. C., NAIRN, A. C., TAYLOR, J. R.
(2007). Orbitofrontal Cortex and Cognitive-Motivational Impairments in Psychostimulant Addiction: Evidence from Experiments in the Non-human Primate. Ann. N. Y. Acad. Sci.
1121: 610-638
[Abstract]
[Full Text]
-
D'Hertog, W., Overbergh, L., Lage, K., Ferreira, G. B., Maris, M., Gysemans, C., Flamez, D., Cardozo, A. K., Van den Bergh, G., Schoofs, L., Arckens, L., Moreau, Y., Hansen, D. A., Eizirik, D. L., Waelkens, E., Mathieu, C.
(2007). Proteomics Analysis of Cytokine-induced Dysfunction and Death in Insulin-producing INS-1E Cells: New Insights into the Pathways Involved. Mol. Cell. Proteomics
6: 2180-2199
[Abstract]
[Full Text]
-
Gilchrist, M. A.
(2007). Combining Models of Protein Translation and Population Genetics to Predict Protein Production Rates from Codon Usage Patterns. Mol Biol Evol
24: 2362-2372
[Abstract]
[Full Text]
-
Jiang, Y., Yang, B., Harris, N. S., Deyholos, M. K.
(2007). Comparative proteomic analysis of NaCl stress-responsive proteins in Arabidopsis roots. J Exp Bot
0: erm207v1-erm207
[Abstract]
[Full Text]
-
Pigazzi, M., Ricotti, E., Germano, G., Faggian, D., Arico, M., Basso, G.
(2007). cAMP response element binding protein (CREB) overexpression CREB has been described as critical for leukemia progression. haematol
92: 1435-1437
[Abstract]
[Full Text]
-
Schmid, A. K., Reiss, D. J., Kaur, A., Pan, M., King, N., Van, P. T., Hohmann, L., Martin, D. B., Baliga, N. S.
(2007). The anatomy of microbial cell state transitions in response to oxygen. Genome Res
17: 1399-1413
[Abstract]
[Full Text]
-
Kong, L., Yu, X.-P., Bai, X.-H., Zhang, W.-F., Zhang, Y., Zhao, W.-M., Jia, J.-H., Tang, W., Zhou, Y.-B., Liu, C.-j.
(2007). RbAp48 Is a Critical Mediator Controlling the Transforming Activity of Human Papillomavirus Type 16 in Cervical Cancer. J. Biol. Chem.
282: 26381-26391
[Abstract]
[Full Text]
-
Wu, L., Hwang, S.-I., Rezaul, K., Lu, L. J., Mayya, V., Gerstein, M., Eng, J. K., Lundgren, D. H., Han, D. K.
(2007). Global Survey of Human T Leukemic Cells by Integrating Proteomics and Transcriptomics Profiling. Mol. Cell. Proteomics
6: 1343-1353
[Abstract]
[Full Text]
-
Kasper, G., Glaeser, J. D., Geissler, S., Ode, A., Tuischer, J., Matziolis, G., Perka, C., Duda, G. N.
(2007). Matrix Metalloprotease Activity Is an Essential Link Between Mechanical Stimulus and Mesenchymal Stem Cell Behavior. Stem Cells
25: 1985-1994
[Abstract]
[Full Text]
-
Maxwell, K. L., Frappier, L.
(2007). Viral Proteomics. Microbiol. Mol. Biol. Rev.
71: 398-411
[Abstract]
[Full Text]
-
Lane, C. S., Wang, Y., Betts, R., Griffiths, W. J., Patterson, L. H.
(2007). Comparative Cytochrome P450 Proteomics in the Livers of Immunodeficient Mice Using 18O Stable Isotope Labeling. Mol. Cell. Proteomics
6: 953-962
[Abstract]
[Full Text]
-
Luo, Q., Siconolfi-Baez, L., Annamaneni, P., Bielawski, M. T., Novikoff, P. M., Angeletti, R. H.
(2007). Altered protein expression at early-stage rat hepatic neoplasia. Am. J. Physiol. Gastrointest. Liver Physiol.
292: G1272-G1282
[Abstract]
[Full Text]
-
O'Hara, P., Slabas, A. R., Fawcett, T.
(2007). Antisense Expression of 3-Oxoacyl-ACP Reductase Affects Whole Plant Productivity and Causes Collateral Changes in Activity of Fatty Acid Synthase Components. Plant Cell Physiol
48: 736-744
[Abstract]
[Full Text]
-
Clerk, A., Kemp, T. J., Zoumpoulidou, G., Sugden, P. H.
(2007). Cardiac myocyte gene expression profiling during H2O2-induced apoptosis. Physiol. Genomics
29: 118-127
[Abstract]
[Full Text]
-
Komori, N., Takemori, N., Kim, H. K., Singh, A., Hwang, S.-H., Foreman, R. D., Chung, K., Chung, J. M., Matsumoto, H.
(2007). Proteomics study of neuropathic and nonneuropathic dorsal root ganglia: altered protein regulation following segmental spinal nerve ligation injury. Physiol. Genomics
29: 215-230
[Abstract]
[Full Text]
-
Facciotti, M. T., Reiss, D. J., Pan, M., Kaur, A., Vuthoori, M., Bonneau, R., Shannon, P., Srivastava, A., Donohoe, S. M., Hood, L. E., Baliga, N. S.
(2007). General transcription factor specified global gene regulation in archaea. Proc. Natl. Acad. Sci. USA
104: 4630-4635
[Abstract]
[Full Text]
-
Del Prete, M. J., Vernal, R., Dolznig, H., Mullner, E. W., Garcia-Sanz, J. A.
(2007). Isolation of polysome-bound mRNA from solid tissues amenable for RT-PCR and profiling experiments. RNA
13: 414-421
[Abstract]
[Full Text]
-
Johri, A. K., Margarit, I., Broenstrup, M., Brettoni, C., Hua, L., Gygi, S. P., Telford, J. L., Grandi, G., Paoletti, L. C.
(2007). Transcriptional and Proteomic Profiles of Group B Streptococcus Type V Reveal Potential Adherence Proteins Associated with High-Level Invasion. Infect. Immun.
75: 1473-1483
[Abstract]
[Full Text]
-
LeBaron, M. J., Ahonen, T. J., Nevalainen, M. T., Rui, H.
(2007). In Vivo Response-Based Identification of Direct Hormone Target Cell Populations Using High-Density Tissue Arrays. Endocrinology
148: 989-1008
[Abstract]
[Full Text]
-
Fernandez-Arenas, E., Cabezon, V., Bermejo, C., Arroyo, J., Nombela, C., Diez-Orejas, R., Gil, C.
(2007). Integrated Proteomics and Genomics Strategies Bring New Insight into Candida albicans Response upon Macrophage Interaction. Mol. Cell. Proteomics
6: 460-478
[Abstract]
[Full Text]
-
Mechin, V., Thevenot, C., Le Guilloux, M., Prioul, J.-L., Damerval, C.
(2007). Developmental Analysis of Maize Endosperm Proteome Suggests a Pivotal Role for Pyruvate Orthophosphate Dikinase. Plant Physiol.
143: 1203-1219
[Abstract]
[Full Text]
-
Frey, I. M., Rubio-Aliaga, I., Siewert, A., Sailer, D., Drobyshev, A., Beckers, J., de Angelis, M. H., Aubert, J., Hen, A. B., Fiehn, O., Eichinger, H. M., Daniel, H.
(2007). Profiling at mRNA, protein, and metabolite levels reveals alterations in renal amino acid handling and glutathione metabolism in kidney tissue of Pept2-/- mice. Physiol. Genomics
28: 301-310
[Abstract]
[Full Text]
-
Okano, T., Kondo, T., Fujii, K., Nishimura, T., Takano, T., Ohe, Y., Tsuta, K., Matsuno, Y., Gemma, A., Kato, H., Kudoh, S., Hirohashi, S.
(2007). Proteomic Signature Corresponding to the Response to Gefitinib (Iressa, ZD1839), an Epidermal Growth Factor Receptor Tyrosine Kinase Inhibitor in Lung Adenocarcinoma. Clin. Cancer Res.
13: 799-805
[Abstract]
[Full Text]
-
Janech, M. G., Raymond, J. R., Arthur, J. M.
(2007). Proteomics in renal research. Am. J. Physiol. Renal Physiol.
292: F501-F512
[Abstract]
[Full Text]
-
Bogeat-Triboulot, M.-B., Brosche, M., Renaut, J., Jouve, L., Le Thiec, D., Fayyaz, P., Vinocur, B., Witters, E., Laukens, K., Teichmann, T., Altman, A., Hausman, J.-F., Polle, A., Kangasjarvi, J., Dreyer, E.
(2007). Gradual Soil Water Depletion Results in Reversible Changes of Gene Expression, Protein Profiles, Ecophysiology, and Growth Performance in Populus euphratica, a Poplar Growing in Arid Regions. Plant Physiol.
143: 876-892
[Abstract]
[Full Text]
-
McCarthy, F. M., Bridges, S. M., Wang, N., Magee, G. B., Williams, W. P., Luthe, D. S., Burgess, S. C.
(2007). AgBase: a unified resource for functional analysis in agriculture. Nucleic Acids Res
35: D599-D603
[Abstract]
[Full Text]
-
Weinzierl, A. O., Lemmel, C., Schoor, O., Muller, M., Kruger, T., Wernet, D., Hennenlotter, J., Stenzl, A., Klingel, K., Rammensee, H.-G., Stevanovic, S.
(2007). Distorted Relation between mRNA Copy Number and Corresponding Major Histocompatibility Complex Ligand Density on the Cell Surface. Mol. Cell. Proteomics
6: 102-113
[Abstract]
[Full Text]
-
Sun, B., Ranish, J. A., Utleg, A. G., White, J. T., Yan, X., Lin, B., Hood, L.
(2007). Shotgun Glycopeptide Capture Approach Coupled with Mass Spectrometry for Comprehensive Glycoproteomics. Mol. Cell. Proteomics
6: 141-149
[Abstract]
[Full Text]
-
Gnatenko, D. V., Perrotta, P. L., Bahou, W. F.
(2006). Proteomic approaches to dissect platelet function: half the story. Blood
108: 3983-3991
[Abstract]
[Full Text]
-
Waters, K. M., Pounds, J. G., Thrall, B. D.
(2006). Data merging for integrated microarray and proteomic analysis. Brief Funct Genomic Proteomic
5: 261-272
[Abstract]
[Full Text]
-
Kasinath, B. S., Mariappan, M. M., Sataranatarajan, K., Lee, M. J., Feliers, D.
(2006). mRNA Translation: Unexplored Territory in Renal Science. J. Am. Soc. Nephrol.
17: 3281-3292
[Abstract]
[Full Text]
-
Nie, L., Wu, G., Zhang, W.
(2006). Correlation of mRNA Expression and Protein Abundance Affected by Multiple Sequence Features Related to Translational Efficiency in Desulfovibrio vulgaris: A Quantitative Analysis. Genetics
174: 2229-2243
[Abstract]