Computationally Efficient Algorithm to Identify Matched Molecular Pairs (MMPs) in Large Data Sets

设为首页

收藏本站

网站地图 | English | 公务邮箱

远程访问

NSTL服务站

Computationally Efficient Algorithm to Identify Matched Molecular Pairs (MMPs) in Large Data Sets

详细信息查看全文

作者：Jameed Hussain ; Ceara Rea
刊名：Journal of Chemical Information and Modeling
出版年：2010
出版时间：March 22, 2010
年：2010
卷：50
期：3
页码：339-348
全文大小：311K
年卷期：v.50,no.3(March 22, 2010)
ISSN：1549-960X

文摘

Modern drug discovery organizations generate large volumes of SAR data. A promising methodology that can be used to mine this chemical data to identify novel structure−activity relationships is the matched molecular pair (MMP) methodology. However, before the full potential of the MMP methodology can be utilized, a MMP identification method that is capable of identifying all MMPs in large chemical data sets on modest computational hardware is required. In this paper we report an algorithm that is capable of systematically generating all MMPs in chemical data sets. Additionally, the algorithm is computationally efficient enough to be applied on large data sets. As an example the algorithm was used to identify the MMPs in the 300k NIH MLSMR set. The algorithm identified 5.3 million matched molecular pairs in the set. These pairs cover 2.6 million unique molecular transformations.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700