Managing, Profiling and Analyzing a Library of 2.6 Million Compounds Gathered from 32 Chemical Providers - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Molecular Diversity Année : 2006

Managing, Profiling and Analyzing a Library of 2.6 Million Compounds Gathered from 32 Chemical Providers

Résumé

3.8 million compounds from structural databases of 32 providers were gathered and stored in a single chemical database. Duplicates are removed using the IUPAC International Chemical Identifier. After this, 2.6 million compounds remain. Each database and the final one were studied in term of uniqueness, diversity, frameworks, ‘drug-like' and ‘lead-like' properties. This study also shows that there are more than 87 000 frameworks in the database. It contains 2.1 million ‘drug-like' molecules among which, more than one million are ‘lead-like'. This study has been carried out using ‘ScreeningAssistant', a software dedicated to chemical databases management and screening sets generation. Compounds are stored in a MySQL database and all the operations on this database are carried out by Java code. The druglikeness and leadlikeness are estimated with ‘in-house' scores using functions to estimate convenience to properties; unicity using the InChI code and diversity using molecular frameworks and fingerprints. The software has been conceived in order to facilitate the update of the database. ‘ScreeningAssistant' is freely available under the GPL license.
Fichier principal
Vignette du fichier
monge_Molecular_Diversity_revised_2.pdf (526.4 Ko) Télécharger le fichier
Loading...

Dates et versions

hal-00079712 , version 1 (13-06-2006)

Identifiants

Citer

Aurélien Monge, Alban Arrault, Christophe Marot, Luc Morin-Allory. Managing, Profiling and Analyzing a Library of 2.6 Million Compounds Gathered from 32 Chemical Providers. Molecular Diversity, 2006, 10 (3), pp.389-403. ⟨10.1007/s11030-006-9033-5⟩. ⟨hal-00079712⟩
90 Consultations
423 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More