Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

constructing a protein map code #5

Open
zhaolongNCU opened this issue Jan 8, 2024 · 15 comments
Open

constructing a protein map code #5

zhaolongNCU opened this issue Jan 8, 2024 · 15 comments

Comments

@zhaolongNCU
Copy link

Thank you very much for your important contribution in this field, please ask you about whether you can give the code for constructing a protein map, I would like to utilize this protein map construction method of yours to process another batch of data, thank you very much!

@vtarasv
Copy link
Owner

vtarasv commented Jan 10, 2024

The concrete code for data processing and feature engineering is currently a trade secret used by the RECEPTOR.AI company. Therefore, I cannot directly share it. However, you can find useful information in this issue.

@shrimonmuke0202
Copy link

Hi, I want to how you do domain annotations from UniProt16 to determine the ligand binding sites for a particular protein?

@vtarasv
Copy link
Owner

vtarasv commented Feb 13, 2024

Hi, I want to how you do domain annotations from UniProt16 to determine the ligand binding sites for a particular protein?

You can find the answer here: #3 (comment)

@shrimonmuke0202
Copy link

shrimonmuke0202 commented Feb 13, 2024

Thank you for your response. I am interested in generating residue graphs from PDB files and would appreciate if you could share the source code or method you used for this purpose.

Additionally, I have a question regarding the treatment of mutated proteins within the DAVIS dataset, especially in relation to PDB files generated by AlphaFold. For example, in the file named "ABL1(E255K)-phosphorylated.pdb", the amino acid residues are numbered from 1 to 251, yet the mutation occurs at position 255 (from E to K). If the sequence is truncated to 251 residues before using AlphaFold, the mutation at position 255 would not be included. Could you explain how you addressed this issue?

@vtarasv
Copy link
Owner

vtarasv commented Feb 14, 2024

Thank you for your response. I am interested in generating residue graphs from PDB files and would appreciate if you could share the source code or method you used for this purpose.

Additionally, I have a question regarding the treatment of mutated proteins within the DAVIS dataset, especially in relation to PDB files generated by AlphaFold. For example, in the file named "ABL1(E255K)-phosphorylated.pdb", the amino acid residues are numbered from 1 to 251, yet the mutation occurs at position 255 (from E to K). If the sequence is truncated to 251 residues before using AlphaFold, the mutation at position 255 would not be included. Could you explain how you addressed this issue?

I cannot provide the source code, but you can find the hints for generating protein graphs here:
#2 (comment)

Regarding your question, you are right, the mutations are not always included in the generated structures. We didn't address those issues in the work as it didn't concern the vast majority of proteins.

@shrimonmuke0202
Copy link

Thanks, but you did not mentioned that how you generate residue graph, could you give some hint how to get residue graph?

@vtarasv
Copy link
Owner

vtarasv commented Feb 14, 2024

Thanks, but you did not mentioned that how you generate residue graph, could you give some hint how to get residue graph?

The concrete code for data processing and feature engineering is currently a trade secret used by the RECEPTOR.AI.
Thus, I cannot provide the source code, but you can find the hints for generating residue graphs here:
#2 (comment)

@shrimonmuke0202
Copy link

Can you give hint only the graph creation part of the data preprocessing it will help me very much

@vtarasv
Copy link
Owner

vtarasv commented Feb 14, 2024

Can you give hint only the graph creation part of the data preprocessing it will help me very much

You may use scalar graph features proposed by TankBind, it should work just fine:
https://github.com/luwei0917/TankBind/blob/ff85f511db11d7a3e648d2e01cd6fdb4f9823483/tankbind/feature_utils.py#L201

@shrimonmuke0202
Copy link

Thanks for your prompt reply, I will look into it. If there is any problem I will contact you.

@shrimonmuke0202
Copy link

How you calculate Perpendicular pi-stacking and Parallel pi-stacking, to generate the edge between residue.

@vtarasv
Copy link
Owner

vtarasv commented Feb 16, 2024

How you calculate Perpendicular pi-stacking and Parallel pi-stacking, to generate the edge between residue.

I used Open Drug Discovery Toolkit by considering each residue as separate molecule.
https://oddt.readthedocs.io/en/latest/rst/oddt.html#module-oddt.interactions

@shrimonmuke0202
Copy link

How you consider each residue as separate molecule? actually I am new to Open Drug Discovery Toolkit

@vtarasv
Copy link
Owner

vtarasv commented Feb 19, 2024

How you consider each residue as separate molecule? actually I am new to Open Drug Discovery Toolkit

You may split a PDB file by residue ID and treat each resulting file as a separate molecule in PDB format.

@shrimonmuke0202
Copy link

Can you give an example how can I apply this methods, it will be very help for me

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants