Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Protein fragment range format? #9

Open
abargnesi opened this issue Apr 4, 2016 · 3 comments
Open

Protein fragment range format? #9

abargnesi opened this issue Apr 4, 2016 · 3 comments

Comments

@abargnesi
Copy link
Member

The fragment range description specifies the following formats:

  • 5_20
    • amino-terminal fragment of unknown length
  • 1_?
    • carboxyl-terminal fragment of unknown length
  • ?_*
    • fragment with unknown start/stop
  • ?
    fragment with unknown start/stop and a descriptor

Are these examples comprehensive? Is there an lower and upper bound when you know the start/stop positions?

/cc @ncatlett

abargnesi pushed a commit to OpenBEL/bel_parser that referenced this issue Apr 4, 2016
@ncatlett
Copy link
Member

ncatlett commented Apr 5, 2016

I think you are missing part of the examples in the description. The examples are intended to be comprehensive:

  • fragment with known start/stop p(HGNC:YFG, frag(5_20))
  • amino-terminal fragment of unknown length p(HGNC:YFG, frag(1_?))
  • carboxyl-terminal fragment of unknown length p(HGNC:YFG, frag(?_*))
  • fragment with unknown start/stop p(HGNC:YFG, frag(?))
    The lower bound should be the start codon (position 1), and the upper bound is the length of the referenced protein (stop codon is represented as *)

@abargnesi
Copy link
Member Author

Could we also have:

p(HGNC:YFG, frag(5_*))
(start code at position 5; unknown stop codon)

and

p(HGNC:YFG, frag(?_20))
(unknown start codon; stop codon at position 20)

abargnesi pushed a commit to OpenBEL/bel_parser that referenced this issue Apr 5, 2016
@ncatlett
Copy link
Member

ncatlett commented Apr 5, 2016

yes -

  • c-terminal fragment (stop at end of protein sequence) with known start - p(HGNC:YFG,frag(5_*))
  • fragment with unknown start and known stop - p(HGNC:YFG, frag(?_20))

in general:

  • 1 is the amino terminal end of the protein
  • * is the carboxyl terminal end
  • ? is an unknown/unspecified position
  • any number between 1 and the length of the referenced protein can be used to specify a fragment start/stop position.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants