- Research Article
- Open Access
Limited Dynamic Range of Immune Response Gene Expression Observed in Healthy Blood Donors Using RT-PCR
Molecular Medicine volume 12, pages 185–195 (2006)
The use of quantitative gene expression analysis for the diagnosis, prognosis, and monitoring of disease requires the ability to distinguish pathophysiological changes from natural variations. To characterize these variations in apparently healthy subjects, quantitative real-time PCR was used to measure various immune response genes in whole blood collected from blood bank donors. In a single-time-point study of 131 donors, of 48 target genes, 43 were consistently expressed and 34 followed approximately log-normal distribution. Most transcripts showed a limited dynamic range of expression across subjects. Specifically, 36 genes had standard deviations (SDs) of 0.44 to 0.79 cycle threshold (CT) units, corresponding to less than a 3-fold variation in expression. Separately, a longitudinal study of 8 healthy individuals demonstrated a total dynamic range (> 2 standard error units) of 2- to 4-fold in most genes. In contrast, a study of whole blood gene expression in 6 volunteers injected with LPS showed 15 genes changing in expression 10- to 90-fold within 2 to 5 h and returning to within normal range within 21 hours. This work demonstrates that (1) the dynamic range of expression of many immune response genes is limited among healthy subjects; (2) expression levels for most genes analyzed are approximately log-normally distributed; and (3) individuals exposed to an infusion of bacterial endotoxin (lipopolysaccharide), show gene expression profiles that can be readily distinguished from those of a healthy population. These results suggest that normal reference ranges can be established for gene expression assays, providing critical standards for the diagnosis and management of disease.
Recent developments in gene and protein expression analysis technology have suggested that gene expression is a key indicator of an individual’s pathophysiologic status (1–4). Consequently, clinical application of gene expression technology will vastly improve on the current approaches for monitoring health and disease. Compelling associations between gene expression and disease have been demonstrated in many studies ranging from inflammatory disease to cancer. For instance, studies have pointed to abnormal gene expression in peripheral blood mononuclear cells in lupus patients compared with healthy controls (5,6). Other studies have found differences in gene expression patterns between cancerous liver or pancreatic tissue and nontumor liver and pancreatic tissues (7,8). Additionally, gene expression profiling of breast tumor biopsy tissue correlated with therapeutic response to treatment (9). Results from these studies demonstrate that measurements of gene expression can be used in the diagnosis and monitoring of disease. However, a key requirement for clinical application of gene expression technology is distinguishing between natural variations in gene expression among healthy subjects and changes associated with a disease condition. The establishment of a normal range of expression for a particular population is required as a “reference range” (10).
Immune function is controlled by a network of molecular and cellular pathways. It is well recognized that suppressed immune responses (for example, immunosuppressive therapies and AIDS) or excessive responses (for example, acute respiratory distress syndrome and autoimmunity) can contribute to disease. Thus, homeostatic control and tight regulation of responses are fundamental characteristics of the immune system. For example, in the absence of disease, body temperature remains relatively constant within an individual, suggesting that the body strives to hold its temperature close to a defended set point. During a response to infection, the inflammatory cytokines interleukin-1, interleukin-6, and tumor necrosis factor are released into the blood and bind with receptors in the hypothalamus, resulting in fever (11). However, immune cells also manufacture and release factors, such as interleukin-1 receptor antagonist and interleukin-10, that counteract the effects of pro-inflammatory cytokines and reduce body temperature (12,13). As a result, body temperature rises only moderately during many fever episodes, and returns to its previous set point upon clearance of the infection. This and other evidence (14) imply that inflammatory/immune genes may be tightly regulated. It is further hypothesized that immune system homeostasis would be reflected in a narrow range of expression levels or set points for key molecules in these pathways among healthy subjects.
In certain gene expression studies, reproducible patterns in subsets of genes have been noted in normal tissues (15–18). The majority of these studies have used microarrays to explore the patterns of expression in isolated blood cell fractions (15,18) or other target tissues, including retina (16) and skin (17). Some studies (16,19) have used replicate arrays to assess the relative contributions of technical and biological factors to the overall variation in measurement values. The results show interindividual variation for gene expression, as well as variation over time within an individual. In addition, gene expression can be sensitive to sources of technical variability, such as time after phlebotomy and method of RNA isolation (20–23). Even within a platform, such as microarray, considerable divergence is reported (24).
In recent years, quantitative real-time (QRT) PCR has emerged as an effective and reproducible tool for transcript analysis (25). It measures relative abundances through PCR-based synthesis of target gene amplicons and activation of target-specific fluorescent probes. The amount of fluorescence generated during the exponential amplification phase provides robust comparative abundance measurements for different amplicons in the same or different wells (25). Whole blood contains representative populations of all the mature cells of the immune system as well as secretory proteins associated with cellular communications (26). The earliest observable changes of cellular immune activity are altered levels of gene expression within the various immune cell types (27). Therefore, QRT-PCR can be an effective technology for reproducibly quantifying gene expression in whole blood.
In studies reported here, we explored the variation among apparently healthy blood bank donors in the expression of a set of genes involved in immune responses. QRT-PCR was used to measure immune-related gene expression in whole blood samples, using procedures designed to sustain a high level of precision (repeatability and reproducibility). We tested the observed distribution of values to determine if it was consistent with sampling from a log-normal distribution, as has been asserted for many genes (28,29), and computed maximum likelihood estimates for the parameters of this distribution. We used statistical models to estimate the contributions of gender, age, and ethnicity to the overall differences in expression among subjects. By performing replicate measurements on longitudinal samples from a group of 8 donors, we computed relative proportions of variance arising from technical, temporal, and intersubject variability. Finally, to obtain limits for the dynamic range of expression achievable with a strong inflammatory stimulus, we performed time-course measurements for several immune response genes in a group of healthy volunteers challenged with an infusion of the bacterial endotoxin lipopolysaccharide (LPS).
Materials and Methods
Single-time-point blood samples from 131 blood donors satisfying American Red Cross blood bank standards (30) were obtained from 3 individual donor centers operated by Bonfils Blood Center, Denver, CO, USA. The samples were drawn on 3 different days over a 3-month period. Subject ages ranged from 22 to 69 years, with a median age of 44 years; age was not recorded for 61 subjects. Women (n = 64) and men (n = 67) were represented in about equal numbers. Ethnicity was reported as white/non-Hispanic for 109 subjects, Hispanic for 19, African-American for 2, and Asian/Pacific Islander for 1. No subjects in this study showed overt signs of disease that would make them ineligible to donate blood under American Red Cross standards. Because we cannot rule out undetected disease in the subjects, however, we refer to them as “apparently healthy” (18).
In addition, longitudinal samples were drawn from 8 volunteers (3 women, 5 men, age range 23 to 50 years) from the Denver area. Samples were collected from these donors approximately once per month for 6 to 8 months, yielding a total of 58 samples.
Samples from the blood donor subjects were collected under Western Institutional Review Board Study No. 20010324. The studies were also reviewed by the Lawrence Livermore National Laboratory Institutional Review Board. Written informed consent was obtained from all volunteers.
In a separate study, 6 healthy volunteers were injected intravenously over 1 min with a single dose (30 units/kg) of Gram-negative bacterial LPS, according to an approved protocol at Guys Hospital, London, UK. Blood samples were drawn and assayed before the LPS injection (0 h) and 2 and 5 h after LPS injection. Additional blood samples from 3 of 6 subjects (adult male volunteers who signed an informed consent form) were drawn and assayed 21 h after LPS injection. Medical history, physical examination, routine laboratory examination, and electrocardiogram were all normal. Subjects did not use any medication or have any significant illness within 8 weeks of the study.
Sample Handling, Purification of RNA, and Preparation of cDNA
Blood was collected from study subjects by standard phlebotomy methods using a 21-gauge butterfly needle and PAXgene Blood RNA Tubes (no. 762115; Qiagen, Valencia, CA, USA) to stabilize messenger RNA (mRNA) against degradation and prevent induction of new mRNA expression (23). Samples were gently mixed by inversion and sat at room temperature for 2 to 24 h to ensure complete nucleic acid stabilization. Samples were then frozen at −70 °C and batch-shipped on dry ice in compliance with International Air Transport Association (IATA) shipping regulations.
Total RNA from PAXgene Blood RNA samples was extracted within 30 days of collection using the PAXgene Blood RNA Kit (no. 762134; Qiagen). RNA samples were treated with RNase-free DNase I (no. 79254; Qiagen) for digestion of contaminating genomic DNA, using manufacturer-recommended protocols during the purification process. Purified RNA samples were placed at −80 °C for long-term storage.
First-strand cDNA was synthesized with random hexamer primers using TaqMan Reverse Transcription reagents (N808-0234; Applied Biosystems, Foster City, CA, USA). Approximately 250 ng RNA was added to a prepared reverse-transcription reagent mixture consisting of PCR Buffer II, 1×; MgCl2, 5.5 mM; random hexamers, 2.5 µM; dNTP blend, 2 mM; RNase inhibitor, 40 units; and MultiScribe Reverse Transcriptase, 125 units. Samples were incubated at ambient temperature for 10 min with subsequent incubation at 37 °C for 60 min. After the 37 °C incubation, samples were incubated at 90 °C for 10 min and immediately chilled on ice. Newly synthesized cDNA samples were then placed at −80 °C for storage. Prior to QRT-PCR analysis, each cDNA sample was quality control tested for RNA quantity and quality of target genes using quantitative PCR analysis (QPCR; ABI Prism 7700 Sequence Detection System, Applied Biosystems, Foster City, CA, USA) of the 18S rRNA and β-actin.
QRT-PCR Analysis of Target Genes
Primer/probe reagents were custom-designed to achieve 3 performance criteria: (1) single-gene specificity of amplification as tested by gel electrophoresis, (2) dilutional linearity of amplification performance over 2 orders of magnitude, and (3) optimal amplification efficiency of 100 ± 6%, to yield a 2-fold change in transcript per CT unit (31). Primer/probe sets were designed to span 90 to 120 base pairs, optimized for robust amplification and specificity, minimization of secondary hybridization, and consistent performance. Quality control testing of reagents and manufactured plates ensured that amplification specificity and efficiency remained within established metrics during storage and new synthesis of nucleotides.
Amplification specificity was tested by QRT-PCR with a custom cDNA standard template of induced whole blood and cell lines. Specificity was determined by the size, number, and DNA sequence of the amplified product. The size and number of amplified products was determined by agarose gel electrophoresis. Amplified products were electrophoresed on a 4% agarose gel to visualize the number of DNA bands present. The molecular weight of each band was determined by comparison to known molecular weight markers (no. PR-G1741; Fisher Scientific, Hampton, NH, USA). The presence of a single DNA band of the correct size suggested specific amplification of the intended gene sequence. In certain cases, the amplified product DNA sequence was compared with the published sequence. Primer/probe amplification of genomic DNA was investigated using purified genomic DNA rather than cDNA as the template for QRT-PCR. The formation of primer dimers and spurious amplification was also investigated using DEPC water as a “no template” control for the QRT-PCR assay.
Amplification efficiency of a primer/probe set was determined by a dilutional linearity assay, using 5 serial dilutions of the standard cDNA template and running PCR reactions on each dilution in replicates of 4. Two or more versions of each target gene primer/probe set were designed and tested to select for both amplification efficiency and specificity. Similarly, each new primer/probe reagent lot was monitored to ensure matched amplification specificity and efficiency to previous primer/probe reagent lots.
Target gene transcripts were analyzed by QRT-PCR for each cDNA preparation using 2× TaqMan Universal PCR Master Mix (no. 4305719; Applied Biosystems) and Source MDx’s proprietary primerprobe sets. Reactions were run in sets of 4 replicates per gene (24 gene targets in a 96-well plate) on an ABI Prism 7700 Sequence Detection System. Each well also contained the specific primers and probe set to measure 18S rRNA as an internal control. The amount of cDNA template added to each reaction was held to a relatively narrow range, as determined by the cDNA quality control measurement of 18S RNA.
The difference between the fluorescence CT for the target gene and the endogenous control (18S rRNA) is presented as a ΔCT value (CT of target − CT of control]. For reference, a ΔCT of 2 is approximately equivalent to a 4-fold change in the amount of the transcript. For example, at baseline, TGFβ may have a ΔCT value of 16; after treatment, that ΔCT value may increase to 18. This change represents a 2 ΔCT difference or a decrease of 75% (1/4). The CT reporting system and estimation of relative gene expression are well described in the literature (32).
CT values above 37 were not used in the analysis, because they correspond to gene expression levels below the linear range of the assay. Values over this threshold were obtained for varying proportions of samples, depending on the gene and the study population examined. For the single-time-point samples, the mean and SD of the underlying ΔCT distribution were inferred by maximum likelihood estimation (MLE), under the assumption of a normal distribution, for genes having up to 50% of their CT values over the threshold. Distribution parameters and dynamic ranges were not computed for genes with more than 50% of CT values greater than 37.
Tests for Normality
Because ΔCT values are roughly proportional to the logarithm of the corresponding mRNA abundances, we used a combination of analytical methods to test ΔCT values for each gene for departures from normality.
The Anderson-Darling and Shapiro-Wilk tests were used to test the data against the null hypothesis that the observed values were sampled from a normal distribution, parameterized by the observed mean and standard error. These tests differ in their sensitivity to outliers and in the weight given to central versus outlying values. Smaller P values from these tests indicate rejection of the null hypothesis, i.e., deviation from normality.
We also generated plots of the quantiles of each gene’s ΔCT values against the corresponding quantiles of a standard normal distribution (Q-Q normal plots), together with histograms and normal density curves, to graphically characterize their deviations from normality.
Linear Mixed-Effect Model Analysis
Previous reports on longitudinal gene expression data sets (16,19) suggest that, for many genes, expression levels in repeated samples from the same subject are relatively stable compared with interindividual differences, even when the repeat samples are separated by time periods of several weeks. To quantify the relative magnitudes of intersubject versus temporal and technical variability in apparently healthy, untreated subjects, we fitted a linear mixed-effects (LME) model to the longitudinal study data. In this data set, each ΔCT measurement was associated with a gene g, subject i, sample index j, and replicate k. An LME model for these data is described by equation 1:
where αg is an intercept term dependent on the gene only, ugi is a random effect due to intersubject variability, βgj is a fixed effect due to systematic variations in processing affecting all samples drawn at the same time point, vgij is a random effect representing variability among samples from the same subject, and εgijk is an error term encompassing all residual sources of variability between replicates. The random effects u gi, vingij, and εgijk are assumed to be normally distributed with mean zero and variances σ2S, σ2T and σ2R respectively. A restricted maximum likelihood (REML) algorithm (33) was used to fit the model parameters αg, βgj, σ2S, σ2T, and σ2R to the data.
In addition, it is useful to quantify the contributions to intersubject variability arising from subject characteristics such as sex, age, and ethnicity. All 3 of these parameters were recorded for 68 subjects in the single-time-point study. Expression data for these subjects was fitted to the LME model described by equation 2:
where αg is an intercept term dependent on the gene only, Gi, Ai, and Ei are the sex, age, and ethnicity of subject i, βg(G,E) is a gene-specific offset for the given sex and ethnicity, ζg(G,E) is the slope of a linear age effect depending on both sex and ethnicity, ugi is a random effect due to intersubject variability not explained by age, sex, or ethnicity, and εgik is an error term encompassing all residual sources of variability between replicate PCR reactions for a given sample. After fitting this model, the percentage contribution of sex, age, and ethnicity effects to the intersubject variance for gene g was estimated by equation 3:
where N is the total number of measurements for gene g, σ2S is the variance parameter estimated for the distribution of the random subject effects, predicted ΔCT is the value predicted from the fixed effects portion of equation 2, and mean ΔCT is computed over all measurements for gene g.
Most Genes Exhibit Limited Dynamic Range of Expression Across Subjects in Single-Time-Point Measurements
A series of studies were undertaken to examine the expression of immune-related gene transcripts in whole blood of apparently healthy subjects. In the largest single-time-point study, blood was collected from 131 blood donors following the American Red Cross donor standards and analyzed for the expression of 48 inflammation- and immune-related gene transcripts. These transcripts encode cell surface molecules, such as CD4, CD14, CD19, and ICAM-1; signaling molecules, such as PTGS2 (COX2), PLA2G7, and NF-κB; cytokines, such as IL-1B and TGFβ; proteinases, such as ELA2; and proteinase inhibitors (see Table 1). The overall range of CT values for the 48 genes studied is plotted in Figure 1. The bars in the plot encompass the central 90% of the observed values (i.e., they extend from the 5th to the 95th percentiles), whereas the whiskers on either end of the bar extend to the extreme values. For genes with expression levels sampled from a log-normal distribution, the ends of the bars would correspond to 1.64 SD on either side of the mean CT.
Of the 48 genes profiled in this study, 2 important signals of inflammation, IL6 and CXCL2, lacked detectable expression in most of the apparently healthy subjects, and their CT values were at or greater than 37. Dynamic ranges and variance components were not computed for these genes. For the remaining 46 genes, the estimated SD of the ΔCT values ranged from 0.44 to 1.46 and were below 0.792 for 36 of the 46 genes, as shown in Table 1. Thus, the dynamic range of expression extending 2 SD in either direction from the geometric mean was less than 22 * °.792 or a 3-fold change (32). For normally distributed ΔCT values, this range covers 95.4% of the sample measurements. The distribution of dynamic ranges corresponding to a ±2 SD span is shown in Figure 2. The highest dynamic range observed was 7.53-fold change units for IL-8. The SDs of ΔCT values were independent of the mean ΔCT, indicating that the dynamic ranges did not depend on a gene’s expression level.
The Majority of Genes Have Expression Values following Log-Normal Distributions
Commonly used parametric tests for differential gene expression between groups of samples, such as t tests and analysis of variance, are based partly on the assumption that the values being compared are sampled from normal distributions. Although it is commonly asserted that transcription levels of many genes are log-normally distributed (28,29), it is important to test this assumption to use such tests for disease diagnosis and detection. The majority of expressed transcripts followed approximately log-normal distributions, according to the Anderson-Darling and Shapiro-Wilk tests (Table 1, Figure 3). The gene most closely following a normal distribution of ΔCT values was IL1R1 (Figure 3A), with an AndersonDarling P value of 0.945. Among the 46 genes tested, 34 had P values greater than 0.001. All genes had unimodal distributions; the deviations from normality involved moderate degrees of left or right skewness, and/or heavy or light tails. Although these departures were not dramatic, they will need to be incorporated into the predicted error rates for diagnostic tests based on expression of these genes.
Of the 48 genes shown in Table 1, the gene deviating most from a normal distribution of ΔCT values was TNFSF5 (CD40 ligand, Figure 3B), with an Anderson-Darling P value of 1.52 × 10−10. The observed distribution is characterized by a heavy tail and large ΔCT, suggesting the presence of a subpopulation with an unusually low expression level of this gene.
Minor Variations in Expression May Be Based on Sex, Ethnicity, and Age
Table 2 shows the contributions of sex, age, and ethnicity on interindividual variation estimated by the LME model (equation 2). For the 43 genes examined, the observed effects of sex, ethnicity, and age were small. Only 10 genes had contributions from these effects, explaining more than 20% of the intersubject variance; the maximum contribution was only 27.9% for NFKB1. For most genes, sex effects accounted for most of this contribution. Fifteen genes showed significant sex differences (unadjusted P value < 0.05), but the largest fold change from women to men was only 1.62 for TNFSF6. Likewise, only moderate ethnicity effects were observed. Five genes (MPO, MYC, TNFSF6, ELA2, and HMGB1) showed significant differential expression between white (non-Hispanic) and Hispanic subjects, with the largest change being a 2.5-fold overexpression of ELA2 in Hispanic women relative to white women.
Age effects were difficult to measure in this data set, due to the markedly different age distributions between the female and male blood donors. Male blood donors had a median age of 53 years, compared with 43 years for females. Therefore, sex and age effects are potentially confounded. The LME model defined in equation 2 addresses the confounding factors by fitting the ΔCT versus age data to different slopes for each sex/ethnicity combination. According to the LME model, 3 genes (IL18, ELA2, and C1QA) had significant age effects for at least 1 sex/ethnicity combination. For all 3 of these genes, the fitted slopes were markedly different between sexes. For example, age had virtually no effect on IL18 expression in white men, whereas in white women the slope corresponded to a 2-fold increase from age 23 to age 69. Similarly, the fitted slopes suggest dramatic differences in age effects among ethnicities. Overall, the size of the sample is too small to reliably estimate ethnic differences.
Variation of Expression within Subjects Over Time Is Limited
To compare the contributions of intersubject, temporal, and technical components to the overall variation in gene expression, we fitted the LME model (equation 1) to the longitudinal set of measurements described in “Materials and Methods.” For this data set, we fitted the model for each of 29 genes with detectable expression in at least 90% of the samples to obtain, for each gene, a set of variance parameters σ2S, σ2T, and σ2R. These are approximate estimates of the contributions to the total variance from intersubject variation, variation among samples taken at different times from each subject, and residual variation between replicate reactions, respectively.
The results of the initial LME model analysis are summarized in Figure 4, which shows the fitted standard error parameters σ2S, σ2T, and σ2R for each gene. For 6 of the 29 genes examined (CD19, TNFSF13B, HMOX1, C1QA, CD8A, and CD4), intersubject variation comprised more than 50% of the total variance of ΔCT values. For the remaining 23 genes, variation between samples taken at different times was the largest component. However, the magnitude of the temporal variation was limited; the parameter σT ranged from 0.36 ΔCT units for the gene PTPRC to 0.72 ΔCT units for MMP9. The dynamic ranges corresponding to 2σT ranged from 1.66- to 2.72-fold change units. Because measurements from samples taken over a period of 8 months may be subject to several sources of technical variation (for example, instrument calibration, reagent lots, and variations in sample handling), these ranges can be considered upper limits on the true temporal variation of expression for the genes analyzed.
LPS Stimulation Induces Transient Gene Expression Changes in Excess of Natural Variation
To demonstrate that changes marked beyond the normal reference range occur, gene expression was measured in blood collected from healthy subjects injected with LPS. Healthy subjects who receive an injection of LPS experience mild fever and flu-like symptoms that subside within 24 h (35). Figure 5 shows the expression of a subset of genes with significant changes at any time point after LPS injection. Reference ranges (mean ± 2 SD) for healthy subjects are indicated by dashed lines. The plotted ΔΔCT values are computed relative to the mean ΔCT for the apparently healthy blood donors. Individual time courses are shown for each subject. Twenty-seven genes had significant changes in expression in LPS-injected subjects at any time postinfusion relative to apparently healthy blood donors, with adjusted false discovery rates of less than 5%. Each of these genes had pre-injection expression levels within the normal reference range for apparently healthy blood donors; each showed increased or decreased expression at 2 and/or 5 h postinfusion; and most returned to the normal expression range by 21 h after infusion. Fifteen genes increased or decreased expression by a factor greater than 10-fold, and 2 (MMP9 and IL1RN) increased more than 90-fold (Figure 5). Because the innate immune system’s immediate response to LPS infusion is the production of inflammatory mediators by monocytes, it is not surprising that the genes showing substantial increases in expression include cytokines and chemokines associated with the monocyte/macrophage lineage, such as TNF, IL1B, CXCL1, and IL18. Key cell-surface markers (ICAM1, CD14) and signaling molecules (PTGS2/COX-2) also respond. Interestingly, the anti-inflammatory regulator IL1RN, which blocks the binding of IL1 to its receptor, was 1 of the 2 most overexpressed genes. This fits with the premise that inflammatory processes are tightly regulated by coordinated expression of pro-inflammatory and anti-inflammatory factors. These include genes with significant decreases in expression such as PLA2G7 and TNFSF5 (CD40 ligand) (see Figure 5).
The studies reported here are an initial step toward establishing normal reference ranges for the expression of genes related to inflammation and immunity. Several key observations emerged. First, the dynamic range of expression of most immune response genes is relatively limited among apparently healthy subjects. Second, expression levels for most genes analyzed are approximately log-normally distributed. Third, individuals exposed to bacterial endotoxin have gene expression profiles that are easily (albeit transiently) distinguished from those of an apparently healthy population. In developing the methods for these studies, it was also observed that multiple technical factors, including sample handling procedures, PCR reagents, and instrument calibration, contribute to the overall variation, which must be carefully controlled. Taken together, these observations support both the usefulness and practicality of establishing normal reference ranges for gene expression assays related to immune system function.
A variety of biological factors may contribute to the variation of expression observed in apparently healthy subjects (18). In general, these factors can be divided into intrinsic (for example, age, sex, genetics) and extrinsic (for example, inflammatory, autoimmune disease, cancer, infections, and metabolism) factors. The apparently healthy blood donor population studied here may have included individuals with subacute illnesses or chronic conditions that contributed to the variability in expression of some immune response genes. Many chronic inflammatory and atopic diseases, such as arthritis, asthma, ulcers, gastritis, and allergies, are highly prevalent in the U.S. adult population, with frequencies ranging from 7% to 27% (36). Nonetheless, individuals with these conditions are deemed “healthy” and permitted to donate blood, provided these “chronic conditions are bring treated and the condition is under control,” and they “feel well and are able to perform normal activities” (30).
Atherosclerosis is another highly prevalent condition which develops over several years and is asymptomatic in its early or even late stages. Several studies have demonstrated an elevation of C-reactive protein and other markers of inflammation in early stages of cardiovascular disease (37,38). Chronic infections with viruses (cytomegalovirus, Epstein-Barr virus, genital herpes, and human papillomavirus), bacteria (Helicobacter pylori), and protozoans (Toxoplasma gondii) also are common in the U.S. population, but do not consistently produce symptoms in immunocompetent persons. Periodic reactivation and suppression of these infections may account for some of the background variation in immune response gene expression. Dietary influences on immune system gene expression may include consumption of omega-3 fatty acids, arginine, and other nutrients as well as vegetarian diets (39,40).
Age, sex, and ethnicity also may contribute to the intersubject variation observed for several transcripts. However, the contributions of these factors appeared to be modest in the present study. Variations associated with age and sex have been previously reported (18,41,42), with some sex differences being directly attributable to differences in sex chromosomes (18). Several studies (18,42) have observed individual differences in interferon-responsive genes among individuals, suggesting further stratification in an apparently normal healthy subject group. Larger studies specifically targeting some of these factors are needed to elucidate the effects so that populations can be stratified for more precise diagnostic resolution.
Intrinsic and extrinsic factors can also alter the proportions of blood cell types such as neutrophils, monocytes, and lymphocytes, as well as the relative expression of individual transcripts within each cell type. These effects combine to produce the observed variation in transcript abundances in whole blood. The individual contributions of cell populations and gene regulation within cell types could be examined using flow cytometry combined with QRT-PCR, and deserve further study.
Given the variety of factors that can affect the expression of immune response genes in a blood donor population, it is remarkable that the overall dynamic range of expression is not wider than observed in the present study, whereas larger, up to 90-fold, but transient changes can be induced by the severe acute inflammatory stimulus LPS. In other diseases, such as rheumatoid arthritis and lupus, differences in gene expression from apparently healthy normals are more modest, 2- to 5-fold (43). These observations support the view that expression of these genes is maintained within limits by regulatory mechanisms, possibly to reduce the danger of tissue damage from constant activation of immune responses, while allowing appropriate responses to infectious threats. The limited dynamic range observed supports the development of expression-based diagnostics, allowing expression outside the normal reference range to indicate the presence of infections, cancer, or indolent autoimmune diseases.
Molecular diagnostics, including those based on gene expression, are increasingly being applied in the clinic (44,45). These tests have improved the selection of therapies, as well as dosage and treatment schedule. In addition, “treat-to-normal” strategies are routinely used in major diseases such as hypertension and diabetes. Assays based on precise, quantitative measurements of immune system gene expression offer the promise of effective clinical monitors in infection, autoimmune diseases, and other immune-related conditions, such as transplant rejection and drug- or virus-induced immunosuppression, as well as cancer. A better understanding of the relevant factors that contribute to the individuality of gene expression in the human will help to establish the most appropriate normal reference values in the clinic and will serve as an essential step in the development of effective molecular diagnostics for these and other inflammatory and immunologic diseases.
Bild AH et al. (2006) Oncogenic pathway signatures in human cancers as a guide to targeted therapies. Nature 439:274–5.
Gladkevich A, Nelemans SA, Kauffman HF, Korf J. (2005) Microarray profiling of lymphocytes in internal diseases with an altered immune response: potential and methodology. Mediators Inflamm. 2005:317–30.
Han D, Leith J, Alejandro R, Bolton W, Ricordi C, Kenyon NS. (2005) Peripheral blood cytotoxic lymphocyte gene transcript levels differ in patients with long-term type 1 diabetes compared to normal controls. Cell Transplant. 14:403–9.
Perez EA, Pusztai L, Van de Vijver M. (2004) Improving patient care through molecular diagnostics. Semin. Oncol. 31(5 Suppl 10):14–20.
Baechler EC et al. (2003) Interferon-inducible gene expression signature in peripheral blood cells of patients with severe lupus. Proc. Natl. Acad. Sci. U. S. A. 100:2610–5.
Rus V, Chen H, Zernetkina V, Magder LS, Mathai S, Hochberg MC, Via CS. (2004) Gene expression profiling in peripheral blood mononuclear cells from lupus patients with active and inactive disease. Clin. Immunol. 112:231–4.
Chen X et al. (2002) Gene expression patterns in human liver cancers. Mol. Biol. Cell 13:1929–39.
Gang J et al. (2005) Discovery and analysis of pancreatic adenocarcinoma genes using DNA microarrays. World J. Gastroenterol. 11:6543–8.
Chang JC et al. (2003) Gene expression profiling for the prediction of therapeutic response to docetaxel in patients with breast cancer. Lancet 362:362–9.
US patent no. 6,960,439: Identification, monitoring and treatment of disease and characterization of biological condition using gene expression profiles, covering the use of a Healthy Normals Reference Dataset, issued to Source MDx, Nov. 4, 2005.
Stitt JT (1979) Fever versus hyperthermia. Fed. Proc. 38:39–43
Conti B, Tabarean I, Andrei C, Bartfai T. (2004) Cytokines and fever. Front. Biosci. 9:1433–49.
Dinarello CA. (2004) Infection, fever, and exogenous and endogenous pyrogens: some concepts have changed. J. Endotoxin Res. 10:201–22.
Jiang H, Chess L (2004) An integrated view of suppressor T cell subsets in immunoregulation. J. Clin. Invest. 114:1198–1208.
Campbell C, Vernon SD, Karem KL, Nisenbaum R, Unger ER. (2002) Assessment of normal variability in peripheral blood gene expression. Dis. Markers 18:201–6.
Chowers I, Liu D, Farkas RH, et al. (2003) Gene expression variation in the adult human retina. Hum. Mol. Genet. 12:2881–93.
Cole J, Tsou R, Wallace K, Gibran N, Isik F. (2001) Comparison of normal human skin gene expression using cDNA microarrays. Wound Repair Regen. 9:77–85.
Whitney AR, Diehn M, Popper SJ, Alizadeh AA, Boldrick JC, Relman DA, Brown PO. (2003) Individuality and variation in gene expression patterns in whole blood. Proc. Natl. Acad. Sci. U. S. A. 100:1896–1901.
Cheung VG, Conlin LK, Weber TM, Arcaro M, Jen KY, Morley M, Spielman RS. (2003) Natural variation in human gene expression assessed in lymphoblastoid cells. Nat. Genet. 33:422–5
Baechler EC, Batliwalla FM, Karypis G, et al. (2004) Expression levels for many genes in human peripheral blood cells are highly sensitive to ex vivo incubation. Genes Immun. 5:347–53.
Debey S, Schoenbeck U, Hellmich M, Gathof BS, Pillai R, Zander T, Schultze JL. (2004) Comparison of different isolation techniques prior gene expression profiling of blood derived cells: impact on physiological responses, on overall expression and the role of different cell types. Pharmacogenomics J 4:193–207.
Han ES, Wu Y, McCarter R, Nelson JF, Richardson A, Hilsenbeck SG (2004) Reproducibility, sources of variability, pooling and sample size: important considerations for the design of high-density oligonucleotide array experiments. J. Gerontol. A Biol. Sci. Med. Sci. 59:306–15.
Rainen L, Oelmueller U, Jurgensen S, et al. (2002) Stabilization of mRNA expression in whole blood samples. Clin. Chem. 48:1883–90.
Tan PK, Downey TJ, Spitznagel EL, et al. (2003) Evaluation of gene expression measurements from commercial microarray platforms. Nucleic Acids Res. 31:5676–84.
Snider JV, Wechser MA, Lossos IS. (2001) Human disease characterization: real-time quantitative PCR analysis of gene expression. Drug Discov. Today 6:1062–7.
Liles WC, Van Voorhis WC. (1995). Nomenclature and biological significance of cytokines involved in inflammation and the host immune response. J. Infect. Dis. 172:1573–80.
Joyce DA, Steer JH, Beilharz MW, Stranger R. (1995). Asystem for assessment of monokine gene expression using human whole blood. Genet. Anal. 12:39–43.
Inoue M, Nishimura S, Hori G, Nakahara H, Saitom M, Yoshihara Y, Amari S. (2004) Improved parameter estimation for variance-stabilizing transformation of gene-expression microarray data. J. Bioinform. Comput. Biol. 2:669–79.
Naef F, Hacker CR, Patil N, Magnasco M. (2002) Empirical characterization of the expression ratio noise structure in high-density oligonucleotide arrays. Genome Biol. 3(4) Epub 2002 Mar 22
American Red Cross: www.redcross.org
Heid CA, Stevens J, Livak KJ, Williams PM (1996) Real time quantitative PCR. Genome Res. 6:986–94.
Livak KJ, Schmittgen TD. (2001) Analysis of relative gene expression data using real-time quantitative PCR and the 2−??CT method. Methods 25:402–8.
Pinheiro J, Bates DM. (2000) Mixed-Effects Models in S and S-PLUS. New York: Springer. 528 pp
R Development Core Team (2004) R Foundation for Statistical Computing, Vienna, Austria.
Martich G, Boujoukos A, Suffredini A (1993) Response of man to endotoxin. Immunobiology 187:403–16.
Schiller JS, Adams PF, Nelson ZC (2005) Summary health statistics for the US population: National health interview survey, 2003. Vital Health Stat. 10 Apr(224):1–104.
Koenig W et al. (1999) C-Reactive protein, a sensitive marker of inflammation, predicts future risk of coronary heart disease in initially healthy middle-aged men: results from the MONICA (Monitoring Trends and Determinants in Cardiovascular Disease) Augsburg Cohort Study, 1984 to 1992. Circulation 99:237–42.
Pearson TA et al. (2003) Markers of inflammation and cardiovascular disease: application to clinical and public health practice: A statement for healthcare professionals from the Centers for Disease Control and Prevention and the American Heart Association. Circulation 107:499–511.
Bistrian BR. (2004) Practical recommendations for immune-enhancing diets. J. Nutr. 134:2868S–72S.
Simopoulos AP. (2002) Omega-3 fatty acids in inflammation and autoimmune diseases. J. Am. Coll. Nutr. 21:495–505.
Eady J et al. (2005) Variation on gene expression profiles of peripheral blood mononuclear cells from healthy volunteers. Physiol. Genomics 22:402–11.
Radich J et al. (2004) Individual-specific variation of gene expression in peripheral blood leukocytes. Genomics 83:980–8.
Tryon V et al. High-precision gene expression analysis of rheumatoid arthritis and other inflammatory diseases. Int. Assoc. Inflammation Soc. Meeting Poster Presentation, Vancouver BC, August 2003.
Ross J et al. (2003) The HER-2/neu gene and protein in breast cancer 2003: biomarker and target of therapy. Oncologist 8:307–25.
Madhusudan S, Ganesan TS (2004) Tyrosine kinase inhibitors in cancer therapy. Clin. Biochem. 37:618–35.
The authors would like to thank C. Edwards, C. Dinarello, A. Rasley, D. Nelson, M. Ascher, and C.T. Rigl for helpful comments, review, and discussion. This work was performed under the auspices of the Lawrence Livermore National Laboratory and was supported with funds from the Laboratory Directed Research and Development (LDRD) Program.
About this article
Cite this article
McLoughlin, K., Turteltaub, K., Bankaitis-Davis, D. et al. Limited Dynamic Range of Immune Response Gene Expression Observed in Healthy Blood Donors Using RT-PCR. Mol Med 12, 185–195 (2006). https://doi.org/10.2119/2006-00018.McLoughlin
- Immune Response Genes
- Limited Dynamic Range
- Immune Response Gene Expression
- Normal Reference Range
- Standard Error Units