Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/138985
Title: Structuring chemical space: similarity-based characterization of the PubChem Database
Author: Cincilla, Giovanni
Thormann, Michael
Pons Vallès, Miquel
Keywords: Cribratge
Espai químic
Medical screening
Chemical space
Issue Date: 26-Jan-2010
Publisher: Wiley-VCH Verlag GmbH & Co. KGaA
Abstract: The ensemble of conceivable molecules is referred to as the Chemical Space. In this article we describe a hierarchical version of the Affinity Propagation (AP) clustering algorithm and apply it to analyze the LINGO‐based similarity matrix of a 500 000‐molecule subset of the PubChem database, which contains more than 19 million compounds. The combination of two highly efficient methods, namely the AP clustering algorithm and LINGO‐based molecular similarity calculations, allows the unbiased analysis of large databases. Hierarchical clustering generates a numerical diagonalization of the similarity matrix. The target‐independent, intrinsic structure of the database , derived without any previous information on the physical or biological properties of the compounds, maps together molecules experimentally shown to bind the same biological target or to have similar physical properties
Note: Versió postprint del document publicat a: https://doi.org/10.1002/minf.200900015
It is part of: Molecular Informatics, 2010, vol. 29, num. 1-2, p. 37-49
URI: http://hdl.handle.net/2445/138985
Related resource: https://doi.org/10.1002/minf.200900015
ISSN: 1868-1743
Appears in Collections:Articles publicats en revistes (Química Inorgànica i Orgànica)

Files in This Item:
File Description SizeFormat 
587082.pdf1.41 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.