-
-
Notifications
You must be signed in to change notification settings - Fork 132
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BUG in construct_graph
#418
Comments
Hi @kamurani thanks for flagging. I could reproduce the error. I think I've tracked down the source. The issue is the indexing here: graphein/graphein/protein/edges/distance.py Line 967 in 46b935b
For some reason the dataframe index is shifted. I believe it happens in this function, likely at this sort operation: graphein/graphein/protein/graphs.py Line 460 in 46b935b
I suspect I didn't catch this when switching from biopandas to CPDB for parsing. I see two options. 1. add an index reset in |
I will have a chance today to work on it, will have a go and submit. |
I have found quite a severe bug in
construct_graph
when using theadd_distance_threshold
edge construction function.Essentially, spurious edges are added to the graph that should not appear, based on the provided threshold distance in Angstroms.
I have decided to make an issue here ASAP before I can diagnose this but I intend to track down why this is happening.
This is potentially a very serious bug as using
threshold_distance
edges is a common protein graph construction technique and may result in people being unaware of the true structure of their data.Version
Steps to reproduce bug
Download AlphaFold predicted structure here
Note that this edge mysteriously appears, despite the node-node distance being well over the threshold distance specified!
The text was updated successfully, but these errors were encountered: