Naneva, LudmilaNedyalkova, MiroslavaMadurga Díez, SergioMas i Pujadas, FrancescSimeonov, Vasil2019-06-192019-06-192019-06-03https://hdl.handle.net/2445/135518As a result of increased healthcare requirements and the introduction of genetically modified foods, the problem of allergies is becoming a growing health problem. The concept of allergies has prompted the use of new methods such as genomics and proteomics to uncover the nature of allergies. In the present study, a selection of 1400 food proteins was analysed by PLS-DA (Partial Least Square-based Discriminant Analysis) after suitable transformation of structural parameters into uniform vectors. Then, the resulting strings of different length were converted into vectors with equal length by Auto and Cross-Covariance (ACC) analysis. Hierarchical and non-hierarchical (K-means) Cluster Analysis (CA) was also performed in order to reach a certain level of separation within a small training set of plant proteins (16 allergenic and 16 non-allergenic) using a new three-dimensional descriptor based on surface protein properties in combination with amino acid hydrophobicity scales. The novelty of the approach in protein differentiation into allergenic and non-allergenic classes is described in the article. The general goal of the present study was to show the effectiveness of a traditional chemometric method for classification (PLS-DA) and the options of Cluster Analysis (CA) to separate by multivariate statistical methods allergenic from non-allergenic proteins.7 p.application/pdfengcc-by (c) Naneva, L. et al., 2019http://creativecommons.org/licenses/by/3.0/esAminoàcidsAnàlisi de conglomeratsAmino acidsCluster analysisApplying discriminant and cluster analysis to separate allergenic from non-allergenic proteinsinfo:eu-repo/semantics/article6903732019-06-19info:eu-repo/semantics/openAccess