Yang, YingWebb, Geoffrey I.Cerquides Bueno, JesúsKorb, Kevin B.Boughton, JaniceTing, Kai Ming2009-06-052009-06-0520071041-4347https://hdl.handle.net/2445/8523We conduct a large-scale comparative study on linearly combining superparent-one-dependence estimators (SPODEs), a popular family of seminaive Bayesian classifiers. Altogether, 16 model selection and weighing schemes, 58 benchmark data sets, and various statistical tests are employed. This paper's main contributions are threefold. First, it formally presents each scheme's definition, rationale, and time complexity and hence can serve as a comprehensive reference for researchers interested in ensemble learning. Second, it offers bias-variance analysis for each scheme's classification error performance. Third, it identifies effective schemes that meet various needs in practice. This leads to accurate and fast classification algorithms which have an immediate and significant impact on real-world applications. Another important feature of our study is using a variety of statistical tests to evaluate multiple learning methods across multiple data sets.14 p.application/pdfeng(c) IEEE, 2007Estadística bayesianaComplexitat computacionalTeoria de l'estimacióAprenentatge automàticReconeixement de formes (Informàtica)Bayes methodsComputational complexityEstimation theoryLearning (artificial intelligence)Pattern classificationStatistical testingTo select or to weigh: a comparative study of linear combination schemes for superparent-one-dependence estimatorsinfo:eu-repo/semantics/article556420info:eu-repo/semantics/openAccess