MetMaxStruct: a Tversky-similarity-based strategy for analysing the (sub)structural similarities of drugs and endogenous metabolites

Background. Previous studies compared the molecular similarity of marketed drugs and endogenous human metabolites (endogenites), using a series of fingerprint-type encodings, variously ranked and clustered using the Tanimoto (Jaccard) similarity coefficient (TS). Because this gives equal weight to a...

Full description

Saved in:
Bibliographic Details
Main Authors: Steve O'Hagan (Author), Douglas Bruce Kell (Author)
Format: Book
Published: Frontiers Media S.A., 2016-08-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_9f1e6399a5d443a6b9d3562f1b3b8eb9
042 |a dc 
100 1 0 |a Steve O'Hagan  |e author 
700 1 0 |a Douglas Bruce Kell  |e author 
245 0 0 |a MetMaxStruct: a Tversky-similarity-based strategy for analysing the (sub)structural similarities of drugs and endogenous metabolites 
260 |b Frontiers Media S.A.,   |c 2016-08-01T00:00:00Z. 
500 |a 1663-9812 
500 |a 10.3389/fphar.2016.00266 
520 |a Background. Previous studies compared the molecular similarity of marketed drugs and endogenous human metabolites (endogenites), using a series of fingerprint-type encodings, variously ranked and clustered using the Tanimoto (Jaccard) similarity coefficient (TS). Because this gives equal weight to all parts of the encoding (thence to different substructures in the molecule) it may not be optimal, since in many cases not all parts of the molecule will bind to their macromolecular targets. Unsupervised methods cannot alone uncover this. We here explore the kinds of differences that may be observed when the TS is replaced - in a manner more equivalent to semi-supervised learning - by variants of the asymmetric Tversky (TV) similarity, that includes  and  parameters. Results. Dramatic differences are observed in (i) the drug-endogenite similarity heatmaps, (ii) the cumulative 'greatest similarity' curves, and (iii) the fraction of drugs with a Tversky similarity to a metabolite exceeding a given value when the Tversky  and  parameters are varied from their Tanimoto values. The same is true when the sum of the  and  parameters is varied. A clear trend towards increased endogenite-likeness of marketed drugs is observed when  or adopt values nearer the extremes of their range, and when their sum is smaller. The kinds of molecules exhibiting the greatest similarity to two interrogating drug molecules (chlorpromazine and clozapine) also vary in both nature and the values of their similarity as  and  are varied. The same is true for the converse, when drugs are interrogated with an endogenite. The fraction of drugs with a Tversky similarity to a molecule in a library exceeding a given value depends on the contents of that library, and  and  may be 'tuned' accordingly, in a semi-supervised manner. At some values of  and  drug discovery library candidates or natural products can look much more like (i.e. have a numerical similarity much closer to) drugs than do even endogenites. Conclusions. Overall, the Tversky similarity metrics provide a more useful range of examples of molecular similarity than does the simpler Tanimoto similarity, and help to draw attention 
546 |a EN 
690 |a Metabolomics 
690 |a Cheminformatics 
690 |a Drug transporter 
690 |a Tanimoto similarity 
690 |a endogenites 
690 |a Drug transporters - cheminformatics - Tversky similarity - endogenites - metabolomics 
690 |a Therapeutics. Pharmacology 
690 |a RM1-950 
655 7 |a article  |2 local 
786 0 |n Frontiers in Pharmacology, Vol 7 (2016) 
787 0 |n http://journal.frontiersin.org/Journal/10.3389/fphar.2016.00266/full 
787 0 |n https://doaj.org/toc/1663-9812 
856 4 1 |u https://doaj.org/article/9f1e6399a5d443a6b9d3562f1b3b8eb9  |z Connect to this object online.