用户名: 密码: 验证码:
Cell-Integral-Diversity Criterion: A Proposal for Minimizing Cluster Artifact in Cell-Based Selections
详细信息    查看全文
文摘
Cell-based methods and the diversity integral criterion (a distance-based technique) are commonly usedapproaches for assessing the diversity of collections of compounds in terms of space coverage. The maindeficiency with cell-based methods is the arbitrariness of cell boundaries which leads to edge effects orcluster artifacts, i.e., situations in which similar molecules separated by a cell boundary yield a higher diversityscore than molecules falling within the same cell but which are less similar to each other. We describe astraightforward diversity metric based on quantifying the distance to the center of the bins resulting frompartitioning the descriptor space which aims at bypassing these artifacts. The mentioned criteria are comparedfor the diversity assessment of a set of selections carried out on three combinatorial libraries of differentcardinalities. For each method, the influence of its parameters (reference partition and number of points) ontheir efficacy is examined. Furthermore, the proposed diversity metric is also applied to designing diverselibraries for three test cases. We show that full arrays selected by minimizing the sum of distances to thecenter of the cells are formed by compounds spaced further apart than selections obtained by maximizingthe degree of cell occupancy.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700