The Chemistry Cartridge adds the SIMILAR operator to SQL. When applied to a SMILES-containing column, rows containing structures which are similar to a given structure are selected. The binary Tanimoto coefficient is used for similarity determination: a value of 1.0 means "identical", 0.0 means "completely dissimliar". The level of similarity to be required is user-adjustable. At high levels (~0.8 or more), this is a fast, robust search type. At low levels (~0.65 or less), the results are not meaningful and result in large answer sets. Other examples are also available.

TO_CHAR(SYSDATE,'MM:
--------------------
10:21:1998, 04:43:29
Elapsed: 00:00:00.01
old 2: where Similar(smiles,'&1',&2,1)=1
new 2: where Similar(smiles,'NCCc1ccc(O)c(O)c1',0.9,1)=1
ID SMILES
------- ----------------------------
8779 NCCc1ccc(O)c(O)c1
29783 CNCCc1ccc(O)c(O)c1
33028 NCCc1cc(O)c(O)c(O)c1
33812 C[N+](C)(C)CCc1ccc(O)c(O)c1
102401 NCCc1ccc(O)cc1
110045 NCCc1cc(O)c(O)cc1O
6 rows selected.
Elapsed: 00:00:00.95
TO_CHAR(SYSDATE,'MM:
--------------------
10:21:1998, 04:43:30