Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fingerprint definition #4

Open
hkmztrk opened this issue Mar 12, 2018 · 4 comments
Open

Fingerprint definition #4

hkmztrk opened this issue Mar 12, 2018 · 4 comments

Comments

@hkmztrk
Copy link

hkmztrk commented Mar 12, 2018

Hello,

I'd like to ask that whether there is a paper reference for Extended Fingerprint that explains how the fingerprints are decided.
(https://cdk.github.io/cdk/2.0/docs/api/org/openscience/cdk/fingerprint/ExtendedFingerprinter.html)

Thanks!

@johnmay
Copy link
Member

johnmay commented Mar 12, 2018

Not exactly, but it's your bog standard path based (aka Daylight fingerprint): http://www.daylight.com/dayhtml/doc/theory/theory.finger.html. It does set some extra bits for ring sizes, but I don't know if there were tested when it was originally or how much these help etc. For example you can actually capture ring while you traverse the paths, obviously if you can reach back to the start point you know it's a ring and you can hash it differently.

BTW You should only be using it for substructure searching :-).

@johnmay johnmay closed this as completed Mar 12, 2018
@johnmay johnmay reopened this Mar 12, 2018
@johnmay
Copy link
Member

johnmay commented Mar 12, 2018

Will leave it open unless there is actually something to extra needed...

@hkmztrk
Copy link
Author

hkmztrk commented Mar 13, 2018

Thank you.

BTW You should only be using it for substructure searching :-).

I was using it for ligand representation and to compute similarities.

@johnmay
Copy link
Member

johnmay commented Mar 13, 2018

Use CircularFingerprint (ECFP4) it performs much better for similarity, AKA morgan fingerprint in RDKIT. Old poster but shows performance on well known benchmark Briem Lessel: https://chemaxon.com/app/uploads/2011/05/NextMovePoster3.pdf

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants