Improvements to Daylight Clustering

Jack Delany, John Bradshaw

DAYLIGHT Chemical Information Systems, Inc. Mission Viejo, CA USA


John MacCuish (Mesa Analytics), for helpful discussions on the ties issue.

Roger Sayle (Open Eye), for his initial work on the expression evaluator.


R. D. Brown, Y.C. Martin, Use of Structure-Activity Data to Compare Structure-Based Clustering Methods and Descriptors for Use in Compound Selection, J. Chem. Inf. Comput. Sci., 1996, 36, 572-584.

D. Butina, Unsupervised Database Clustering Based on Daylights Fingerprint and Tanimoto Similarity: A Fast and Automated Way to Cluster Small and Large Datasets, J. Chem. Inf. Comput. Sci., 1999, 39, 747-750.

T.N Doman, J.M Cibulskis, M.J. Cibulskis, P.D. McCray, D.P. Spangler, Algorithm5: A Technique for Fuzzy Similarity Clustering of Chemical Inventories, J. Chem. Inf. Comput. Sci., 1996, 36, 1195-1204.

C.M.R. Ginn, P. Willett, J. Bradshaw, Combination of molecular similarity measures using data fusion. Perspectives in Drug Discovery and Design. 2001, P. 1-16.

A. Gobbi, M-L Lee, DISE: Directed Sphere Exclusion, J. Chem. Inf. Comput. Sci., 2003, 43, 317-323.

J. D. Holliday, C-Y Hu, P. Willett, Grouping of coefficients for the calculation of inter-molecular similarity and dissimilarity using 2D fragment bit-strings. Combinatorial Chemistry & High Throughput Screening, 2002 5:155-166.

R.A. Jarvis, E.A. Patrick, Clustering Using a Similarity Measure Based on Shared Near Neighbors, IEEE Transactions on Computers, 1973, C22, 1025-1034.

J. MacCuish, C. Nicolaou, N. MacCuish, Ties in Proximity and Clustering Compounds, J. Chem. Inf. Comput. Sci, 2001, 41, 134-146.

N. MacCuish, J. Bradshaw, J. MacCuish, Structure Clustering using Asymmetry: A Matruska Approach, MUG03 Presentation