-
Notifications
You must be signed in to change notification settings - Fork 103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add 3di encoding to biotite.structure
#665
Conversation
Thanks for providing your great code and the initial refactoring! As indicated a will further refactor it. |
I pushed some intermediate refactored code. No worries, I am not finished yet 😉 . |
Did you create the reference sequence in |
Hi @padix-key, you can generate the 3di sequences for arbitrary PDB files using the |
ff3831d
to
ecb1200
Compare
CodSpeed Performance ReportMerging #665 will not alter performanceComparing Summary
|
Todo-list as reminder to myself:
|
From my side the code is ready, have a look if you like. Thanks again for providing the code from your package! Of course you can also add yourself to One thing I do not fully understand yet, is the invalid state
|
bd6bf58
to
5f34d28
Compare
Ping @althonos |
I will merge this PR now, as I plan to include this new feature in the upcoming Biotite 1.1 release. However, if you see any room for improvement or like to be added to |
@padix-key : Sorry, only seeing this now! The invalid state being 2 is actually from the original code (see https://github.com/steineggerlab/foldseek/blob/d2d09b588f50d5f8e2fd7a958377a33b2f725415/lib/3di/structureto3di.h#L9); it can also be used for valid states, as it seems to correspond to a coil state. |
Thanks for the explanation. I also just merged your |
Hi Patrick,
As discussed here is the encoder as taken (and slightly updated) from
mini3di
.I am not familiar with the biotite API that much but I tried doing some changes to make it fit more: I added a
StructureSequence
inheritingSequence
with the 3di alphabet and added support for extracting the coordinates from anAtomArray
. This can probably be refactored a bit.