Aims

To support the free and open dissemination of research findings and information on alcoholism and alcohol-related problems. To encourage open access to peer-reviewed articles free for all to view.

For full versions of posted research articles readers are encouraged to email requests for "electronic reprints" (text file, PDF files, FAX copies) to the corresponding or lead author, who is highlighted in the posting.

___________________________________________

Monday, December 31, 2007

Examining the effect of linkage disequilibrium between markers on the Type I error rate and power of nonparametric multipoint linkage analysis of two-generation and multigenerational pedigrees in the presence of missing genotype data
Genetic Epidemiology Volume 32, Issue 1 , Pages 41 - 51

Because most multipoint linkage analysis programs currently assume linkage equilibrium between markers when inferring parental haplotypes, ignoring linkage disequilibrium (LD) may inflate the Type I error rate.

We investigated the effect of LD on the Type I error rate and power of nonparametric multipoint linkage analysis of two-generation and multigenerational multiplex families.

Using genome-wide single nucleotide polymorphism (SNP) data from the Collaborative Study of the Genetics of Alcoholism, we modified the original data set into 30 total data sets in order to consider six different patterns of missing data for five different levels of SNP density. To assess power, we designed simulated traits based on existing marker genotypes. For the Type I error rate, we simulated 1,000 qualitative traits from random distributions, unlinked to any of the marker data.

Overall, the different levels of SNP density examined here had only small effects on power (except sibpair data). Missing data had a substantial effect on power, with more completely genotyped pedigrees yielding the highest power (except sibpair data). Most of the missing data patterns did not cause large increases in the Type I error rate if the SNP markers were more than 0.3 cM apart.

However, in a dense 0.25-cM map, removing genotypes on founders and/or founders and parents in the middle generation caused substantial inflation of the Type I error rate, which corresponded to the increasing proportion of persons with missing data.

Results also showed that long high-LD blocks have severe effects on Type I error rate.


Read Full Abstract


Request Reprint E-Mail: hokim@snu.ac.kr or jebw@mail.nih.gov
_________________________________________________________________