|
|
||||||||||
|
J Am Coll Cardiol, 2005; 45:1611-1619, doi:10.1016/j.jacc.2005.01.051 © 2005 by the American College of Cardiology Foundation |


* Sections of Cardiology and Atherosclerosis, Center for Preventive Cardiology, Department of Medicine, Baylor College of Medicine, Houston, Texas
Weill Medical College of Cornell University, New York, New York
University of Texas Health Science Center, Houston, Texas
Manuscript received October 14, 2004; revised manuscript received January 4, 2005, accepted January 11, 2005.
* Reprint requests and correspondence: Dr. Ali J. Marian, Baylor College of Medicine, Section of Cardiology, One Baylor Plaza, 519D, Houston, Texas 77030 (Email: amarian{at}bcm.tmc.edu).
| Abstract |
|---|
|
|
|---|
BACKGROUND: Mutations in PCSK9 cause autosomal-dominant hypercholesterolemia. We hypothesized that PCSK9 variants could affect plasma LDL-C in individuals with polygenic hypercholesterolemia.
METHODS: We sequenced all 12 exons and boundaries to detect novel polymorphisms, and genotyped 372 subjects in LCAS and 319 subjects in a second independent population for six polymorphisms, including novel leucine repeats, by fluorescently tagged markers. We reconstructed haplotypes using a Bayesian algorithm.
RESULTS: Permutation test results showed statistically significant differences in global haplotype distribution among the tertiles of LDL-C (odds ratio [OR]: 2.36, 95% confidence interval [CI]: 1.90 to 4.32, p = 0.005) and minimum lumen diameter of coronary lesions (OR: 1.83, 95% CI: 1.01 to 3.55, p = 0.045). Regression analysis identified haplotype 3 as an independent determinant of LDL-C levels (adjusted R2 = 2.2%, F = 9.37, p = 0.002). Haplotype structure analysis identified E670G as the determinant variant, exerting a dose effect (GG > EG > EE) and accounting for 3.5% of plasma LDL-C variability (F = 14.6, p < 0.001). Plasma total cholesterol, apolipoprotein B, and lipoprotein (a) levels were also associated with the E670G variant. Distributions of the E670G genotypes in an independent normolipidemic and the hyperlipidemic LCAS populations were significantly different (F = 7.2, p = 0.027). No significant treatment-by-genotype interactions were detected. The false positive report probability was between 2% and 8%.
CONCLUSIONS: Haplotype 3 encompassing the E670G variant is an independent determinant of plasma LDL-C levels and the severity of coronary atherosclerosis.
| |||||||||||||||||
We hypothesized that genetic variants of PCSK9 could also affect plasma levels of LDL-C in polygenic (non-Mendelian) hypercholesterolemia. To test this hypothesis, we screened the exons and exon-intron boundaries of PCSK9, identified novel polymorphisms, reconstructed haplotypes, and analyzed the association of the haplotypes and genotypes with plasma levels of lipids, severity of atherosclerosis, and their responses to fluvastatin therapy in a well-characterized population with high LDL-C levels. We then tested the main results for replication in an independent population with normal plasma LDL-C levels.
| Materials and methods |
|---|
|
|
|---|
Detection of novel single nucleotide polymorphisms (SNPs) in PCSK9. We screened all 12 exons of PCSK9 and exon-intron boundaries for the presence of polymorphisms in 50 healthy Caucasians by polymerase chain reaction (PCR) and direct sequencing (See Appendix for online supplementary Table 1). We compared the sequence among the individuals and with the published GenBank sequence (NT_032977).
Genotyping. Genotyping was performed by 5' nuclease assay (allelic discrimination assays) and capillary electrophoresis for 13,097C > T, 13,248A > G, 19,018A > G (I474V), 23,968A > G (E670G), and 24,609T > C SNPs using the Applied Biosystems Prism 7900HT Sequence Detection System (See Appendix for online supplementary Table 2). The short tandem repeat (STR) polymorphism was genotyped using fluorescent-labeled primers, PCR amplification of a 243-bp encompassing fragment, and capillary electrophoresis on an Applied Biosystems 3100 genetic analyzer and analyzed using the GeneScan/Genotyper software (version 3.7, 2001, Applied Biosystems Inc.). An investigator without knowledge of the angiographic and clinical data performed the genotyping.
In silico haplotyping and population frequency of the haplotypes. We used Phase 2.0 (10,11) to reconstruct haplotypes and estimate population haplotype frequencies, as published (12). Phase 2 implements a Bayesian statistical method to infer phase and to reconstruct haplotypes from population genetics by Markov Chain-Monte Carlo algorithm and coalescent theory. It has been shown to infer haplotypes more accurately than other Bayesian-based methods in real data sets (10). Other previously used algorithms, such as Partition-Ligation-Expectation-Maximization (PL-EM) and Haplotyper (13,14), did not permit haplotype reconstruction in the presence of three alleles of the STR marker. We determined the estimates of population haplotype frequencies for 20, 100, 500, 1,000, and 10,000 iterations. Pairwise linkage disequilibrium (LD) was calculated with known gametic phases, estimated by expectation-maximization algorithm, using Arlequin 2.0 (University of Geneva, Geneva, Switzerland) (15).
Replication study population (TexGen). To determine whether the main findings in the LCAS population could be replicated in an independent population with normal plasma LDL-C levels, we screened 1,010 individuals from our outpatient clinics and identified 319 subjects who had plasma LDL-C levels of <130 mg/dl. Demographic data were collected, and fasting plasma levels of total cholesterol, HDL-C, LDL-C, and TG were obtained. Subjects with advanced comorbid conditions, including malignancies, advanced heart failure, and valvular heart disease, were not included.
False positive report probability.
We determined the probability of no true association between the variant and phenotype given a statistically significant finding using a previously described method as follows: False positive report probability (FPRP) =
(1
)/[
(1
) + (1 ß)
], whereby
is the probability of a statistically significant finding, given that null hypothesis of no association is true; 1 ß is the statistical power, and
is the prior probability (16).
Statistical analysis. Mean and SD were used for analysis of the continuous variables unless otherwise specified. Permutation analysis (between 100 and 1,000 permutations and 10,000 iterations) was used to compare haplotype frequencies among the categorical phenotypes (Phase 2.0). Because Phase 2.0 does not permit a permutation test for the parametric phenotypes, they were discretized to categorical phenotypes according to the tertiles of the traits. Linear and logistic regression analyses were used to detect association of the parametric and nonparametric phenotypes with the haplotypes, respectively. Differences in parametric phenotypes between carriers and noncarriers of specific haplotypes or genotypes were compared by analysis of variance or Student t test and for nonparametric phenotypes by the Kruskal-Wallis test. Treatment-by-haplotype interactions were analyzed by permutation analysis as well as by analysis of variance (general linear model).
| Results |
|---|
|
|
|---|
|
Polymorphisms in PCSK9. The list and location of SNPs and STR markers, the reference number, and the minor allele frequencies (MAF) are shown in Figure 1. The STR polymorphism in exon 1 (CTG)n leads to expansion of amino acid leucine repeats Ln of 9, 10, and 11.
|
LD and frequency of haplotypes. Table 2 shows the D' and r2 values for pairwise LD. Increasing the number of iterations to more than 100 did not significantly affect the estimates of population haplotype frequencies (See Appendix for online supplementary Table 4). Thirty, including 11 relatively common, haplotypes (f > 0.02) were detected, with mean and median values of certainties of haplotypes assignment being 87% and 94%, respectively. Estimates of population haplotype frequencies (10,000 iterations) are shown in Table 3.
|
|
PCSK9 genotypes/haplotypes and baseline plasma levels of lipids. A permutation test showed a significant association between PCSK9 haplotypes and the tertiles of plasma LDL-C levels (p = 0.05), and linear regression analysis identified haplotype 3 as an independent predictor (adjusted R2 = 2.2%, F = 9.37, p = 0.002). Mean plasma LDL-C levels were 143.65 ± 19.7, 152.21 ± 20.06, and 172.25 ± 15.91 (mg/dl) in those with zero copy (N = 335), one copy (N = 35), and two copies (N = 2) of haplotype 3 (F = 4.95, p = 0.008). Similarly, LDL-C levels were higher in those with haplotype 3 as compared with the referent haplotype (Fig. 2). Logistic regression analysis showed an association between the tertiles of LDL-C and haplotype 3 (OR 2.36, 95% CI 1.90 to 4.32, p = 0.005). Multivariate linear regression analysis that included age, gender, body mass index, smoking status, alcohol consumption, diabetes, and all 11 common haplotypes identified haplotype 3 (regression coefficient: 10.4, T = 2.58, p = 0.010, N = 39), and age (regression coefficient: 0.2841, T = 1.97, p = 0.050) as independent predictors of plasma LDL-C levels. Reconstruction of the haplotypes after exclusion of the Ln polymorphism, which was in linkage equilibrium with the SNPs, reduced the number of haplotypes from 30 to 19. In accord with the original analysis, haplotype 3 remained an independent predictor of plasma LDL-C levels (regression coefficient: 10.2 ± 4.0, T = 2.6, p = 0.011). Plasma LDL-C levels were associated with haplotype 3 in a copy-dependent manner (0 copies: 143.7 ± 19.6, N = 343; 1 copy: 153.9 ± 22.0, N = 27; 2 copies: 172.3 ± 15.9, n = 2; F = 5.3, p = 0.006). Regarding other lipids and lipoproteins, haplotype 3 (regression coefficient: 12.8, T = 2.78, p = 0.006) and gender (regression coefficient: 22.3, T = 7.0, p < 0.001) were independent predictors of plasma levels of TC (adjusted R2: 12.3%, F = 26.4, p < 0.001). Subjects with haplotype 3 had higher plasma TC levels than those without (256.0 ± 12.0 vs. 226.0 ± 25.4 vs. 219.8 ± 24.0, p = 0.042). There were no associations between haplotypes and plasma levels of HDL-C and triglycerides. Finally, multivariate regression analysis in a Caucasian-only population showed a significant association between plasma LDL-C levels and haplotype 3 (F = 4.27, p = 0.015). Plasma levels of LDL-C (mg/dl) were higher in subjects with haplotype 3 (152.1 ± 20.1, N = 33) than those without (143.0 ± 19.7, N = 300, p = 0.018).
|
|
Pharmacogenetics. Treatment with fluvastatin reduced plasma LDL-C levels by 25.3 ± 17.5% and apoB by 16.0 ± 15.4%, and raised plasma levels of HDL-C and apoA-I by 9.3 ± 15.7% and 6.9 ± 20.1%, respectively (all p < 0.005 as compared with placebo). Plasma TG levels remained unchanged (0.8 ± 41%) in the fluvastatin group, but were increased by 9.8 ± 40% in the placebo group (p = 0.033).
In multivariate regression analysis, only treatment with fluvastatin was the primary predictor of changes in plasma LDL-C during the follow-up period. There were no significant haplotype- or genotype-by-treatment interactions regarding changes in TC, LDL-C, and HDL-C levels or change in MLD or development of new coronary lesions in response to treatment with fluvastatin (as compared with placebo).
Fifty-four clinical events occurred during the follow-up. There were no statistically significant interactions between SNPs or haplotypes and the incidence of new clinical events in the entire population.
FPRP.
Because PCSK9 mutations are known to affect LDL-C levels in ADH, the prior probability was considered moderate to high and multiple
values (0.1, 0.15, 0.20, and 0.25) were tested. The sample size of 37 subjects with haplotype 3 and 335 without provided >99% power to detect the observed 10-mg/dl differences in plasma LDL-C levels at an
value of 0.05 and SD of 20 mg/dl. Accordingly, FPRP was calculated for the observed p value of 0.008,
values ranging from 0.1 to 0.25, and ß values ranging from 0.05 to 0.2. Even at the most relaxed ß value of 0.2, the FPRP varied from 3% to 8% for
between 0.1 to 0.25. At the ß value of 0.01, which was the calculated value, FPRP was between 2% and 7% for the prior probabilities ranging from 0.1 to 0.25. Calculation for the observed association of E670G cSNP with plasma LDL-C levels also showed a low rate of false positive probability.
Results in the TexGen population. Because the E670G cSNP was the main SNP determining plasma LDL-C levels, we genotyped 319 individuals for the E670G cSNP by the 5' nuclease assay. The AA genotype was present in 291 individuals; AG in 28 subjects and GG in 0 subjects, which followed the HWE (expected numbers were AA: 292; AG: 27; and GG: 0; p = 0.888). Compared with the LCAS population, the frequency of the G allele was lower (0.043 vs. 0.074, chi-squared = 5.5, p = 0.019) and the GG genotype was absent in the TexGen population. The overall distribution of genotypes differed between the LCAS and the TexGen populations (chi squared = 7.2, p = 0.027). Plasma LDL-C levels were 88.1 ± 25.3 in those with the AA and 94.1 ± 21.4 in those with the AG genotype (p = 0.10). Plasma levels of TC, HDL-C, and TG were not significantly different between the AA and AG groups (data not shown).
| Discussion |
|---|
|
|
|---|
The strengths of the study are the prospective and placebo-controlled randomized design of LCAS and comprehensive phenotypic characterization, an essential component of LD studies. The study includes detection and analysis of novel polymorphisms including the Ln polymorphism, and reconstruction of haplotypes comprising information on the content of six polymorphisms that collectively span the PCSK9 locus. Statistical analyses used a permutation test, which is considered robust and less prone to spurious association. Cognizant of the relatively high rate of spurious results in genetic association studies, we also calculated the FPRP, which was less than 8%, even under the most relaxed conditions. Finally, the findings are in accord with recent genetic linkage mapping and detection of mutations in PCSK9 in families with ADH (2,3). Collectively, these findings suggest the presence of a significant association between PCSK9 E670G cSNP (haplotype 3) and plasma LDL-C and TC levels in non-Mendelian dyslipidemia.
Genetic linkage studies have established PCSK9 as a causal gene for familial ADH (2,3,17), however, its function and the mechanisms by which PCSK9 mutations affect plasma LDL-C levels are largely unknown (18). PCSK9 encodes neural apoptosis-regulated convertase (NARC-1 aka PCSK9), a novel 691-amino-acid proprotein convertase expressed predominantly in the liver and intestine (19,20). The protein is a member of subtilase subfamily with multiple domains (Fig. 1) (20) including signaling peptide (aa 130), pro-segment (aa 31147), catalytic (aa 148 -425), and cysteine-rich C-terminal (aa 526 to 691) domains. Human PCSK9 is synthesized as a zymogen and undergoes autocatalytic intramolecular processing in the endoplasmic reticulum, a step necessary for exiting the endoplasmic reticulum. The E670G cSNP is located in the cysteine-rich C-terminal domain, which is involved in regulation of autoprocessing, because deletion of this domain leads to accumulation of the processed PCSK9 (19). The biological role of PCSK9 in regulating plasma LDL-C levels also remains elusive. Recent studies suggest that PCSK9 negatively regulates expression of LDL-C receptors in the liver through a post-translational mechanism before the internalization and recycling of the receptor (21,22). Adenoviral-mediated overexpression of murine Pcsk9 results in near-complete depletion of the LDL-C receptor, whereas inactivation of the catalytic activity of Pcsk9 has no effect (22). Accordingly, PCSK9 mutations are expected to increase the activity of the enzyme (gain of function). Other effects of PCSK9 mutations comprise decreased zymogen processing of PCSK9, reduced LDL-C receptor density (23), and an increased production rate of apoB100 (24). Finally, PCSK9 could interfere with the ability of the LDL-C receptor to bind to LDL-C (25). Whether E670G cSNP impairs the effects of PCSK9 on LDL-C receptor abundance and/or activity or the production rate of apoB100 remains unknown.
Despite a strong association of the plasma LDL-C levels with haplotype 3 (E670G cSNP), MLD showed only a modest association. This is not surprising because the effect of gene variants is expected to be stronger on the immediate (gene products) and weaker on the remote phenotypes (such as atherosclerosis or death) because of the contribution of the competing factors to the distant phenotypes. A recent study in a Japanese population implicated an intronic SNP (C-161T) and a cSNP (I474V) in influencing plasma LDL-C levels but reported no association with MI (26). We genotyped the I474V SNP and detected an MAF of 0.15, which is considerably higher than that in the Japanese population (MAF = 0.03). The I474V cSNP was in partial LD with the E670G cSNP and was not an independent predictor of the phenotypes in the LCAS population. The MAF of the C(161)T SNP, determined in 50 subjects, was 0.059. Because of the relatively low frequency and its location in a noncoding region, we did not genotype the C(161)T SNP in the LCAS population.
Our approach for genetic studies of complex trait, pending the completion of the HapMap project and resolution of the superiority of the SNP-centric or haplotype-centric approach, is based on analysis of common (common disease-common variant hypothesis) putatively functional SNPs (pfSNPs) in the gene/locus of interest, complemented by haplotype reconstruction. Accordingly, we analyzed six common SNPs, including two cSNPs, one amino acid repeat polymorphism, and one regulatory (3' untranslated region) SNP. There are at least six additional SNPs in PCSK9, as shown in Figure 1, that were genotyped only in 50 subjects. The MAFs of these SNPs are shown in Figure 1. We did not genotype the entire LCAS population for these SNPs because they were in near-complete LD with those analyzed and/or because they were located in introns. Thus, we cannot exclude the possible presence of additional common haplotypes in the LCAS population. Finally, because we analyzed only the common SNPs, based on common disease-common variant hypothesis, we cannot exclude the possibility of uncommon alleles contributing to plasma LDL-C levels in the LCAS population.
The sample size of LCAS provided 80% power, at an
value of 0.05, to detect a 20% difference in the mean baseline plasma LDL-C levels for haplotypes that were present in at least 25 heterozygous subjects. Considering the relatively low frequency of the approximately half of the haplotypes, a larger sample size would be necessary to detect effects of the rare haplotypes or smaller effects of the common haplotypes. In addition, mean MLD values in the placebo and fluvastatin groups changed only by 0.11 and 0.04 mm, respectively, which are consistent with the results of other angiographic regression/progression studies (2730), but less than 10% of the baseline MLD. Thus, a much larger sample size and/or a longer duration of follow up may be necessary to detect the potential effects of genetic variants on MLD. Similarly, the number of clinical events could be too small to detect the potential impact of genotypes or haplotypes on the clinical events.
In conclusion, we have identified and analyzed novel and known polymorphisms in PCSK9 locus, reconstructed haplotypes encompassing the information content of six polymorphisms, and shown that haplotype 3, representative of the E670G cSNP, is an important determinant of plasma LDL-C and TC levels and is associated with the severity of coronary atherosclerosis in the LCAS population. Identification of molecular mechanism(s) by which PCSK9 variants affect plasma LDL-C levels could provide new insight into the pathogenesis of atherosclerosis and development of new drug targets.
| Appendix |
|---|
|
|
|---|
| Footnotes |
|---|
| References |
|---|
|
|
|---|
Related Article
This article has been cited by other articles:
![]() |
K. E. North, H. H. H. Goring, S. A. Cole, V. P. Diego, L. Almasy, S. Laston, T. Cantu, B. V. Howard, E. T. Lee, L. G. Best, et al. Linkage analysis of LDL cholesterol in American Indian populations: the Strong Heart Family Study J. Lipid Res., January 1, 2006; 47(1): 59 - 66. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J.P. Kastelein, S. W. Fouchier, and J. C. Defesche What Promise Does PCSK9 Hold? J. Am. Coll. Cardiol., May 17, 2005; 45(10): 1620 - 1621. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | SUBSCRIPTIONS | CURRENT ISSUE | PAST ISSUES | CARDIOSOURCE | SEARCH | HELP | FEEDBACK |