| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Osteoporosis Research Center (Y.-J.L., F.-H.X., H.S., Y.-Z.L., H.-Y.D., L-J.Z., Q.-Y.H., V.D., T.C., K.M.D., R.R.R., H.-W.D.) and Department of Biomedical Sciences (Y.-J.L., F.-H.X., H.S., Y.-Z.L., L.-J.Z., H.-W.D.), Creighton University, Omaha, Nebraska 68131; Laboratory of Molecular and Statistical Genetics (H.-W.D.), College of Life Sciences Hunan Normal University, Changsha, 410081 Hunan, China; and Center for Medical Informatics (J.-L.L.), School of Medicine, Yale University, New Haven, Connecticut 06520-8009
Address all correspondence and requests for reprints to: Hong-Wen Deng, Ph.D., Osteoporosis Research Center, Creighton University, 601 North 30th Street, Suite 6787, Omaha, Nebraska 68131. E-mail: deng{at}creighton.edu.
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
We have recently reported a whole-genome linkage scan for QTLs for obesity phenotypes on 53 extended Caucasian pedigrees that contained more than 10,000 relative pairs (including 1,249 sibling pairs) informative for linkage analyses (5). That study includes a large number of subjects with body composition [fat mass, percentage fat mass (PFM), and lean mass] assessed by dual x-ray absorptiometry (DXA), a contemporary technique that yields homogeneous measurements of body composition with excellent precision. Because PFM relates differentially to body mass index (BMI) in various ethnic groups (6) and the validity of BMI as an index of obesity remains uncertain and controversial (7), a study using different obesity-related phenotypes may offer potential advantages in detecting obesity genes.
In the present study, with the aim of confirming our previous whole-genome scan findings, we conducted a follow-up linkage study with denser markers spanning the previously localized regions. Within six regions (i.e. 1p36, 2q14, 4q12, 6q27, 12q14, and 20q13) that achieved maximum multipoint LOD scores > 1.5 in our initial genome scan, we genotyped a set of 31 microsatellite markers in an expanded sample of 79 pedigrees (with 53 pedigrees previously reported and 26 new pedigrees) that contained 1816 subjects, narrowing the average intermarker genomic distance from approximately 10 cM to approximately 5 cM. Based on two-point and multipoint linkage analyses, our studies support that chromosomes 1p36 and 2q14 may harbor QTLs for BMI variation, whereas chromosome 6q27 may harbor QTL for fat mass variation. However, linkage peaks at other loci were generally reduced or even disappeared, reflecting the difficulties and challenges that face the genetic mapping of complex traits.
| Subjects and Methods |
|---|
|
|
|---|
The study subjects came from an expanding database being created for studies to search for genes underlying the risk to osteoporosis and obesity in the Osteoporosis Research Center of Creighton University. All the study subjects were Caucasians of European origin. We adopted an exclusion criterion that had been elaborated before (5). In brief, subjects with diseases, treatments, or conditions that would be a nongenetic cause for low bone mass were excluded. Our initial genome scan (5) was based on 630 subjects from 53 multigenerational pedigrees. In the follow-up study, all the subjects in the initial genome scan were included in the analyses. In addition, 128 additional subjects who were members of the previous 53 pedigrees were included, resulting in a total of 758 subjects in the 53 pedigrees. We recruited 26 new pedigrees consisting of 1058 subjects. Thus, our total sample comprises 79 pedigrees with 1816 subjects. The pedigrees vary in size from four to 416 individuals, with a mean of 31.9 individuals (SD =; 48.9; Table 1
). Each pedigree was identified through a proband having bone mineral density (BMD) Z score of either
-1.28 or
+1.28 at the hip or spine. Among them, 50 pedigrees were ascertained through probands having extremely low BMD (Z score < -1.28 at the hip or spine), 25 pedigrees were ascertained through extremely high BMD (Z score > +1.28 at the hip or spine), and four pedigrees were ascertained without regard to BMD. BMD values expressed as Z scores adjust for age, gender, and ethnic difference in general healthy referent populations. Our ascertainment scheme through the proband of extremely low or high BMD may offer higher statistical power to identify QTLs for BMD than would be achieved through random sampling (8, 9). Because BMD and BMI are significantly correlated (5), the sampling scheme of our study pedigrees for detecting QTLs for BMD variation may also have a similar effect (if any) on BMI and related obesity phenotypes. Actually, the probands having extremely low BMD tend to have BMI that are lower than the average in their pedigrees, whereas the probands having extremely high BMD tend to have BMI that are higher than the average in their pedigrees (see Results).
|
The procedures of genotyping have been described previously (5). In this study, we genotyped 31 microsatellite markers (six markers from our previous study) flanking six regions (i.e. 1p36, 2q14, 4q12, 6q27, 12q14, and 20q13) in which multipoint LOD scores greater than 1.5 were attained in the initial genome scan, narrowing the average space from approximately 10 cM to approximately 5 cM. PCR cycling conditions followed those suggested in the ABI PRISM Linkage Mapping Sets Version 2.5 (Applied Biosystems, Foster City, CA). Genotyping was performed using ABI PRISM 3700 DNA Analyzer running the Genescan and GENOTYPER softwares (Applied Biosystems) for allele identification and sizing. Because the genotyping system has been upgraded from ABI PRISM 377 DNA Sequencer to ABI PRISM 3700 DNA Analyzer in the present study, we retyped the markers that were used in the initial genome scan for the previous sample (53 pedigrees), with a purpose to avoid potential problems introduced by binning adjustment due to systemic change and to keep the genotyping data consistent. A genetic database management system (GenoDB) (10) was used to manage the phenotype and genotype data, as well as allele binning. PedCheck (available at http://watson.hgen.pitt.edu/register/soft_doc.html) was used to check the conformation of Mendelian inheritance pattern at all the marker loci and the relationships of family members within pedigrees. The genotyping data set was 99.7% complete, with 0.3% genotypes missing due to PCR failure, unsolved PedCheck errors, and microsatellite mutation.
Measurement
BMI was recorded as body weight (in kilograms) divided by the square of height (in meters). Fat mass and lean mass were measured by DXA. As mentioned by Deng et al. (5), in our initial genome scan, only 289 subjects from 38 pedigrees had the data of fat mass, PFM, and lean mass. In the present study, all 1058 subjects from the 26 newly recruited pedigrees and 128 additional subjects who are members of the previous 53 pedigrees have the data on fat mass, PFM, and lean mass. The PFM is recorded as the ratio of fat mass to body weight, which is obtained from DXA (i.e. the sum of fat mass, lean mass, and bone mass).
Statistical analyses
The statistical analyses have been detailed previously (5). In brief, a variance component linkage analysis (11, 12) for quantitative traits was used. Using SOLAR program version 1.7 (12), we performed two- and multipoint linkage analyses in the expanded 53 pedigrees, the 26 new pedigrees, and the 79 total pedigrees, respectively. The pedigree-ascertainment scheme, based on the low or high BMD values of probands, leads to corresponding lower-than-average or higher-than-average BMI values (see Results). The ascertainment scheme is accounted for in the analyses by identifying the probands and their phenotypic values for each pedigree. The built-in modules of the SOLAR program will then be able to account for the ascertainment scheme by using conditional likelihood in LOD score computation. The updated version of SOLAR (version 2.0) has the function of evaluating genetic heterogeneity by incorporating the program HOMO. HOMO performs a test for heterogeneity using an admixture model (13) and is, in principle, quite similar to Jurg Otts program HOMOG (14). In this study, loci achieving LOD scores more than 2.0 in the analyses of the 79 total pedigrees were analyzed for linkage heterogeneity using the HOMO program.
Before linkage analyses, the effects of covariates (sex and age) on obesity phenotype were evaluated in polygenic models, with only the significant covariates (P
0.05) being retained in the model and included in the linkage analyses. SOLAR is a flexible tool that can accommodate the effect of a common household in the polygenic model. This may increase statistical power to detect linkage by decreasing the proportion of the residual phenotypic variation attributable to random environmental factors. However, because the information of household in our large pedigrees is incomplete, it was not included in the analyses. Because we have some large multigenerational pedigrees, generation may also be incorporated in the analyses as a covariate. Generally, the variance component analyses implemented in SOLAR are quite robust to deviation of normality (15). However, some types of nonnormality of the data (e.g. extreme leptokurtosis of two or more) may inflate the type 1 error rate (16). For our total sample (79 pedigrees), the values of skewness of BMI, fat mass, PFM, and lean mass are 0.68, 0.83, 0.08, and 0.41, respectively, and values of kurtosis of BMI, fat mass, PFM, and lean mass are 0.56, 1.00, -0.57, and -0.73, respectively. These values are quite similar to those of the original 53 pedigrees and 26 new pedigrees (data not shown).
In SOLAR, a multipoint approximation for pedigrees of unlimited size and complexity was developed that uses a linear function of identity by descent (IBD) values at genotyped markers to estimate IBD sharing at arbitrary chromosomal locations (12). The IBD estimation procedure has been shown to be efficient and compares favorably to other multipoint methods suitable for use in pedigrees (12). So far, there have been few software programs capable of handling large and complex pedigrees (>200 individuals in a pedigree) with regard to multipoint linkage analysis. SimWalk, a statistical genetics computer application that uses Markov chain Monte Carlo and simulated annealing algorithms to perform linkage analysis, is able to deal with pedigree of any size (17). To investigate the potential difference between the methods implemented in the two computer programs, we used SimWalk to perform linkage analyses and compared the results with the results from SOLAR.
| Results |
|---|
|
|
|---|
|
|
|
|
|
For fat mass, marginal evidence of linkage was observed at 6q27 in the 79 total pedigrees. A LOD of 1.24 was achieved near marker D6S11696 in two-point analysis, and an MLS of 0.92 was achieved at 6q27 in multipoint analysis. For PFM, linkage peaks diminished dramatically at 2q14, with LOD scores less than 1.0 in all of the three sample sets. A similar tendency was observed for lean mass.
Because the largest pedigree contained 416 subjects with six generations, we performed additional linkage analyses in this family but found no significant evidence of linkage for any of the obesity phenotypes. We also performed separate linkage analyses in the subsample that excludes the 416 subjects. We found that the results were in perfect agreement with that of the expanded 53 pedigrees (data not shown).
Incorporating generation as a covariate in the analyses slightly changed the results. To keep the results comparable with our previous whole-genome linkage scan, the results were presented without considering it in the analyses. We also performed linkage analysis using the program SimWalk and compared the results with that of SOLAR. We found no significant differences between them, and the linkage results demonstrated reasonable consistency (data not shown).
| Discussion |
|---|
|
|
|---|
It is known that the power and the robustness of the linkage results critically rely on the study sample sizes used, among several other factors. Compared with the commonly used sibling-pair linkage approach (20), the variance components method used in the present study provides a more powerful test, which can be expanded to handle pedigrees of arbitrary size and complexity (21). Generally, the larger and more complex the sample, the more powerful the method. Compared with our initial genome scan, the present study sample has been expanded greatly; and accordingly, the numbers of informative relative pairs have been enhanced greatly. This large sample allows us to perform extension studies in the expanded 53 pedigrees (758 subjects) and the 79 total pedigrees, as well as independent replication analyses in the 26 newly recruited pedigrees.
Our study sample was initially recruited based on BMD with the purpose of efficiently locating genes underlying BMD (9), and BMD and body weight are known to be correlated (22). Because bone mass is a part of body mass, BMD and BMI may also share some genes underlying their variation, although the significance of these shared genetic determinations is unknown. In linkage analyses, the ascertainment scheme has been accounted for in the analyses by identifying the probands and their phenotypic values for each pedigree. As we know, a significant and high phenotypic correlation does not necessarily imply or translate into a significant and high genetic correlation, which indexes the degree of shared genetic determination of two phenotypes. This has been demonstrated by our earlier whole-genome linkage scans for BMD (9) and obesity phenotypes (5), in which the significant genomic regions for BMD vs. BMI were largely nonoverlapped. However, when two correlated traits show linkage to the same region, it is necessary to differentiate between pleiotropic effects of a single locus influencing all the traits and separate tightly clustered loci each influencing a single trait. In this case, bivariate quantitative trait linkage analysis is proposed to be pursued (23).
Based on our analyses, chromosomes 1p36 and 2q14 are the most promising regions that are likely to harbor QTLs for BMI variation. In a whole-genome scan, Stone et al. (24) reported suggestive linkage to severe obesity in females at 1p36, with a multipoint heterogeneity LOD score of 2.5 achieved near marker D1S468. In another study in Pima Indians, Norman et al. (25) reported linkage to percentage body fat at 1p3121 (LOD = 2.8), a region adjacent to 1p36. The chromosome 1p36 contains an important candidate gene TNF
2, the allelic polymorphisms of which were associated with obesity, leptin levels, and insulin resistance (26). The leptin receptor gene, LEPR, another prominent candidate for obesity, maps to this region. Polymorphisms of the LEPR gene have been associated with variation in obesity phenotypes in various populations (27, 28, 29). The NR0B2 (nuclear receptor subfamily 0, group B, member 2) gene is also located at 1p36, and mutations of NR0B2 were related with early-onset mild obesity (30). In addition, evidence of linkage was revealed between the 6-phosphogluconate dehydrogenase locus at 1p36 and suprailiac skinfolds (31). Chromosome 2q14 seems to be a novel locus that has not been replicated by other linkage studies in humans. Nonetheless, a recent HERITAGE Family Study in black and white families reported linkage to abdominal visceral fat at 2q22.1, a region that is approximately 20 cM apart from 2q14 (32). Furthermore, the importance of this region to obesity is likely to be corroborated by several mouse studies, which suggested the QTLs influence body weight and fatness in regions homologous to human chromosome 2q (3).
Chromosome 6q27 appears to be a promising region that may harbor a QTL underlying variation of fat mass. A recent study was performed on eight insulin resistance syndrome-related phenotypes using phenotypic data from 261 nondiabetic subjects distributed across 27 Mexican-American extended families (33). Using principal component factor analyses, they found significant evidence of linkage for adiposity-insulin factor 1 (BMI, leptin, and fasting-specific insulin) to two regions on chromosome 6p near markers D6S403 (LOD = 4.2) and D6S264 (LOD = 4.9), which are close to 6q27. Moreover, several earlier QTL mapping studies from animal models suggested regions homologous to human chromosome 6q2527 (34). One of the strong candidate genes located within this region is estrogen receptor-
. Genetic polymorphism at the restriction enzyme PvuII site of the estrogen receptor-
gene was associated with BMI variation in a sample of postmenopausal Caucasian women (35). Other potential candidate genes within this region may include IGF2R and acetyl-coenzyme A acetyltransferase 2.
Several issues need to be addressed in the present study. It is noted that the strongest evidence of confirmation comes from the analyses in the expanded 53 pedigrees, whereas the linkage peaks reduce substantially in the analyses of the 26 new pedigrees. This seems to be unexpected, given the large amount of relationships and thus substantially increased statistical power of the 26 new pedigrees. There exists several potential explanations. First, our study subjects of European-origin Caucasians, although relatively homogeneous, may allow for potential population heterogeneity. In this context, it is possible that the QTLs identified in the previous pedigrees may not exist or only have minor effects in the new pedigrees and thus are hard to detect; or linkage findings detected in the initial genome scan may involve a weak effect, which turns out to be weaker in the follow-up study. Our simulations show that, assuming the total heritability of the phenotype to be 60%, the total sample (79 pedigrees) provides strong power (95%) to detect linkage (at LOD > 3.0) for a QTL accounting for 25% or more of the total trait variance. With this fairly high power, it is unlikely that the loss of linkage signals is simply due to statistical fluctuation. Instead, population heterogeneity is probably one of the potential reasons for failure of replication in the 26 new pedigrees. Second, compared with the previous sample (53 pedigrees), the newly recruited sample (26 pedigrees) comprises more complex families, in which more than 60% of them are made up by pedigrees with at least 85 family numbers. Among these, a large pedigree (with six generations) that contains 416 genotyped individuals may have significant impact on linkage results, as reflected by the separate analyses in this pedigree and the remaining subsample excluding the 416 subjects. This pedigree could share some nongenetic factors that could contribute to the phenotypes in consideration that have not been adjusted for in the statistical analysis. Third, several key individuals who are involved in a lot of pairs and have an extreme value in pedigrees may have a substantial impact on the results of linkage analysis. We found that the removal of these individuals from the analyses, to some extent, changed the LOD scores.
For BMI, the LOD score achieved on chromosome 1p36 in multipoint analysis (LOD = 1.34) is lower than the score achieved in two-point analysis (LOD = 2.34). This difference is also seen for some other genomic regions in this study, such as for fat mass at 6q27. Actually, this observation is not uncommon in linkage studies. Compared with two-point linkage analysis, multipoint linkage analysis, which combines marker information in the form of haplotypes, may increase the power to detect linkages and decrease the false-positive rate (12). When linkage is detected, multipoint analysis also allows support or confidence intervals to be determined for the location of a gene (12). However, several issues (e.g. the greater dependence on precise and accurate marker locations and more devastating effects of marker-locus genotyping errors, etc.) may compromise these advantages. In gene-mapping studies, genotyping errors may deflate the power and underestimation of the strength of correlation between trait- and marker-locus genotypes. In two-point analysis, these errors can be absorbed in an inflated recombination-fraction estimate, leaving the test statistic quite robust. In multipoint analysis, however, genotyping errors can easily result in false exclusion of the true location of a disease-predisposing gene (36, 37). In addition, in multipoint analysis, marker-locus maps must be assumed to be known accurate, although, in reality, fine-scale intermarker recombination fractions cannot be accurately estimated without thousands of informative meioses (36, 37). The position of markers on commonly used genetic maps (e.g. Marshfield) (38) are based on a limited number of meioses (39), which may lead to incorrect marker order and poor estimates of recombination fractions (40, 41). As a consequence, multipoint LOD scores could be lower than two-point LOD scores in the presence of linkage, despite the use of more information in the multipoint analysis.
In the follow-up study, the LOD scores obtained are generally lower than the scores of the initial genome scan. Theoretically, LOD scores can be added across studies but only when they are computed by the same method, with the same set of markers, and at the same map position (19). In the present study, the putative intervals are saturated with additional markers, which may affect the LOD scores. In addition, with more individuals (especially founders in the same family) genotyped, the IBD sharing of two relative pairs can be calculated with higher precision, which ultimately influences the LOD scores (36). Furthermore, genotyping errors may lead to the loss of power and underestimation of the strength of correlation between trait- and marker-locus genotypes (37). Despite the low genotyping error rate (0.3%) attained in our initial genome scan and follow-up study, the distributions of the errors might be altered, which unavoidably changes the LOD scores. The exact relevance of these issues to linkage analyses is one of our future interests in statistical genetics.
Location variation has been observed in this study. For example, in our initial scan, a peak MLS of 4.44 was achieved at 128 cM on chromosome 2 in multipoint analysis, whereas the peak MLS of 3.93 was attained at 115 cM in the follow-up study. This is not unusual in gene mapping of complex traits, as exemplified in linkage studies of type 1 diabetes, in which two studies reported evidence of a susceptibility locus on genomic region 14q, but the location estimates were approximately 33 cM apart (42, 43). Some studies indicate that, in independent replication studies, chance variation in the location estimate is substantial, even with relatively large numbers of multiplex families (44).
For PFM and lean mass, the present study did not lend strong support to our previous findings. As mentioned by Deng et al. (5), in our initial linkage scan, only 289 subjects from 38 pedigrees had the phenotypic data of fat mass, PFM, and lean mass. This modest sample size renders the linkage analysis of limited statistical power, which may result in false-positive findings or the missing of true linkages. Because the linkage approach adopted in this study is intended to detect linkage instead of linkage exclusion, the relevance of these regions to fat mass, PFM, and lean mass awaits future exclusion analyses and/or confirmation studies in larger samples. In gene-mapping studies of complex traits, due to polygenic inheritance, some QTLs may be detected in a whole-genome scan even if the power is low. However, to detect linkage of a specific QTL (e.g. to replicate a previous linkage finding) is a much harder task; even some well-designed, large-scale replication studies may not repeat prior true-positive findings (45, 46). It is likely that the robustness of linkage results and the power of linkage studies for complex traits generally require sample sizes far larger than those currently used. Our next effort of searching for QTLs underlying obesity will be performing a whole-genome linkage scan in our enlarged sample (the 79 total pedigrees, in which
400 microsatellite markers spaced at
10 cM on human genome will be genotyped for all the 1816 subjects). It would be interesting in the near future to compare the results to see whether the additional linkage peaks will be identified on the initial genome scan of the 53 pedigrees.
In summary, in a follow-up study, we confirmed some of our previous linkage findings for obesity phenotypes. In the meantime, the complexity of the genetic basis of obesity and the difficulties facing genetic mapping of complex traits are also implicated. Recently, topics such as "Genome-Wide Scans of Complex Human Diseases: True Linkage Is Hard to Find" (47) and "Limits of Fine-Mapping a Quantitative Trait" (48) have been receiving more attention. In the future, it is likely that endeavors from multidisciplinary approaches incorporating knowledge of pathophysiology, biochemistry, genetic epidemiology, genomics, expression profiling, and bio-informatics will eventually unravel the molecular determinants of obesity.
| Footnotes |
|---|
Abbreviations: BMD, Bone mineral density; BMI, body mass index; DXA, dual x-ray absorptiometry; IBD, identity by descent; MLS, maximum LOD score; PFM, percentage fat mass; QTL, quantitative trait locus.
Received April 30, 2003.
Accepted October 20, 2003.
| References |
|---|
|
|
|---|
receptor 2 gene is associated with obesity, leptin levels, and insulin resistance in young subjects and diet-treated type 2 diabetic patients. Diabetes Care 23:831837This article has been cited by other articles:
![]() |
W. Yang, T. Kelly, and J. He Genetic Epidemiology of Obesity Epidemiol. Rev., June 12, 2007; (2007) mxm004v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Pausova, D. Gaudet, F. Gossard, M. Bernard, M. L. Kaldunski, M. Jomphe, J. Tremblay, T. J. Hudson, G. Bouchard, T. A. Kotchen, et al. Genome-Wide Scan for Linkage to Obesity-Associated Hypertension in French Canadians Hypertension, December 1, 2005; 46(6): 1280 - 1285. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. J. Morris Dissecting Hypertension by Obesity Identifies a Locus at 1p36 Hypertension, December 1, 2005; 46(6): 1256 - 1258. [Full Text] [PDF] |
||||
![]() |
Y.-J. Liu, S. M.S. Rocha-Sanchez, P.-Y. Liu, J.-R. Long, Y. Lu, L. Elze, R. R. Recker, and H.-W. Deng Tests of linkage and/or association of the LEPR gene polymorphisms with obesity phenotypes in Caucasian nuclear families Physiol Genomics, April 13, 2004; 17(2): 101 - 106. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Endocrinology | Endocrine Reviews | J. Clin. End. & Metab. |
| Molecular Endocrinology | Recent Prog. Horm. Res. | All Endocrine Journals |