TY - JOUR
T1 - Parametric model-based statistics for possible genotyping errors and sample stratification in sibling-pair SNP data
AU - Korostishevsky, Michael
AU - Malkin, Ida
AU - Spector, Tim
AU - Livshits, Gregory
PY - 2010/1
Y1 - 2010/1
N2 - The detection of genotyping errors, based on apparent Mendelian incompatibilities in a sample of sib-pairs, is a complicated problem. In the case of a single marker and unknown parental genotypes, all combinations of sib-pair genotypes are self-consistent. Moreover, the observed deviation from equilibrium genotype frequencies may result from genotyping errors as well as from the sample's stratification. This in turn, may profoundly affect the results of association and linkage analyses, and therefore an estimation of these factors should be done beforehand. Here we present several parametric models, and using likelihood ratio statistics, we suggest a method of combined analysis of genotyping errors and a sample stratification for randomly ascertained sib-pair single nucleotide polymorphism (SNP) data. Specifically, we implemented two models of genotyping errors in either heterozygotes or homozygotes, and two models of sample stratification resulting from either the presence of families of different ethnic origin (e.g., a population admixture) or from a different ethnic origin of the parents in the family (e.g., intermarriage). The power of this method was established by Monte Carlo data simulation. The results clearly suggest that the proposed method is most efficient for detecting genotyping errors in heterozygotes, a common error caused by incorrect SNP data interpretation. We also provide an example of its application to real data.
AB - The detection of genotyping errors, based on apparent Mendelian incompatibilities in a sample of sib-pairs, is a complicated problem. In the case of a single marker and unknown parental genotypes, all combinations of sib-pair genotypes are self-consistent. Moreover, the observed deviation from equilibrium genotype frequencies may result from genotyping errors as well as from the sample's stratification. This in turn, may profoundly affect the results of association and linkage analyses, and therefore an estimation of these factors should be done beforehand. Here we present several parametric models, and using likelihood ratio statistics, we suggest a method of combined analysis of genotyping errors and a sample stratification for randomly ascertained sib-pair single nucleotide polymorphism (SNP) data. Specifically, we implemented two models of genotyping errors in either heterozygotes or homozygotes, and two models of sample stratification resulting from either the presence of families of different ethnic origin (e.g., a population admixture) or from a different ethnic origin of the parents in the family (e.g., intermarriage). The power of this method was established by Monte Carlo data simulation. The results clearly suggest that the proposed method is most efficient for detecting genotyping errors in heterozygotes, a common error caused by incorrect SNP data interpretation. We also provide an example of its application to real data.
KW - Admixture
KW - Genotyping errors
KW - Hardy-Weinberg equilibrium test
KW - Intermarriage
KW - Monte Carlo simulations
KW - Sib-pair sample
KW - Single nucleotide polymorphism
UR - http://www.scopus.com/inward/record.url?scp=75649150286&partnerID=8YFLogxK
U2 - 10.1002/gepi.20431
DO - 10.1002/gepi.20431
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 19455577
AN - SCOPUS:75649150286
SN - 0741-0395
VL - 34
SP - 26
EP - 33
JO - Genetic Epidemiology
JF - Genetic Epidemiology
IS - 1
ER -