Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/116041
Title: Weighted distance based discriminant analysis: the R package WeDiBaDis
Author: Irigoien, Itziar
Mestres i Naval, Francesc
Arenas Solà, Concepción
Keywords: R (Llenguatge de programació)
Anàlisi discriminant
Ciències de la salut
R (Computer program language)
Discriminant analysis
Medical sciences
Issue Date: Dec-2016
Publisher: The R Foundation
Abstract: The WeDiBaDis package provides a user friendly environment to perform discriminant analysis (supervised classification). WeDiBaDis is an easy to use package addressed to the biological and medical communities, and in general, to researchers interested in applied studies. It can be suitable when the user is interested in the problem of constructing a discriminant rule on the basis of distances between a relatively small number of instances or units of known unbalanced-class membership measured on many (possibly thousands) features of any type. This is a current situation when analyzing genetic biomedical data. This discriminant rule can then be used both, as a means of explaining differences among classes, but also in the important task of assigning the class membership for new unlabeled units. Our package implements two discriminant analysis procedures in an R environment: the well-known distance-based discriminant analysis (DB-discriminant) and a weighteddistance- based discriminant (WDB-discriminant), a novel classifier rule that we introduce. This new procedure is based on an improvement of the DB rule taking into account the statistical depth of the units. This article presents both classifying procedures and describes the implementation of each in detail. We illustrate the use of the package using an ecological and a genetic experimental example. Finally, we illustrate the effectiveness of the new proposed procedure (WDB), as compared with DB. This comparison is carried out using thirty-eight, high-dimensional, class-unbalanced, cancer data sets, three of which include clinical features.
Note: Reproducció del document publicat a: https://journal.r-project.org/archive/2016/RJ-2016-057/index.html
It is part of: The R Journal, 2016, vol. 8, num. 2, p. 434-450
URI: http://hdl.handle.net/2445/116041
ISSN: 2073-4859
Appears in Collections:Articles publicats en revistes (Genètica, Microbiologia i Estadística)

Files in This Item:
File Description SizeFormat 
665732.pdf448.52 kBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons