Statistical inference for hardy-weinberg proportions in the presence of missing genotype information

dc.contributor.authorGraffelman, Jan
dc.contributor.authorSánchez, Milagros
dc.contributor.authorCook, Samantha
dc.contributor.authorMoreno Aguado, Víctor
dc.date.accessioned2016-02-09T14:39:04Z
dc.date.available2016-02-09T14:39:04Z
dc.date.issued2013-12-31
dc.date.updated2016-02-09T14:39:04Z
dc.description.abstractIn genetic association studies, tests for Hardy-Weinberg proportions are often employed as a quality control checking procedure. Missing genotypes are typically discarded prior to testing. In this paper we show that inference for Hardy-Weinberg proportions can be biased when missing values are discarded. We propose to use multiple imputation of missing values in order to improve inference for Hardy-Weinberg proportions. For imputation we employ a multinomial logit model that uses information from allele intensities and/or neighbouring markers. Analysis of an empirical data set of single nucleotide polymorphisms possibly related to colon cancer reveals that missing genotypes are not missing completely at random. Deviation from Hardy-Weinberg proportions is mostly due to a lack of heterozygotes. Inbreeding coefficients estimated by multiple imputation of the missings are typically lowered with respect to inbreeding coefficients estimated by discarding the missings. Accounting for missings by multiple imputation qualitatively changed the results of 10 to 17% of the statistical tests performed. Estimates of inbreeding coefficients obtained by multiple imputation showed high correlation with estimates obtained by single imputation using an external reference panel. Our conclusion is that imputation of missing data leads to improved statistical inference for Hardy-Weinberg proportions.
dc.format.extent11 p.
dc.format.mimetypeapplication/pdf
dc.identifier.idgrec636500
dc.identifier.issn1932-6203
dc.identifier.pmid24391752
dc.identifier.urihttps://hdl.handle.net/2445/69341
dc.language.isoeng
dc.publisherPublic Library of Science (PLoS)
dc.relation.isformatofReproducció del document publicat a: http://dx.doi.org/10.1371/journal.pone.0083316
dc.relation.ispartofPLoS One, 2013, vol. 8, num. 12, p. e83316
dc.relation.urihttp://dx.doi.org/10.1371/journal.pone.0083316
dc.rightscc-by (c) Graffelman, J. et al., 2013
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es
dc.sourceArticles publicats en revistes (Ciències Clíniques)
dc.subject.classificationGenoma humà
dc.subject.classificationEstadística matemàtica
dc.subject.classificationMalalties hereditàries
dc.subject.otherHuman genome
dc.subject.otherMathematical statistics
dc.subject.otherGenetic diseases
dc.titleStatistical inference for hardy-weinberg proportions in the presence of missing genotype information
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/publishedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
636500.pdf
Mida:
659.21 KB
Format:
Adobe Portable Document Format