GFF3::attributes #14

lpantano · 2017-06-15T14:02:18Z

cc: @lpantano @gurgese @ThomasDesvignes @mhalushka @mlhack @keilbeck @BastianFromm @ivlachos @TJU-CMC

I'd like to discuss the last columns since probably would need more time, and before everybody go in holidays, trips, conferences ... etc :) I'd like to have a chance to get your thoughts.

ID: unique ID based on sequence like mintmap has for tRNA: prefix-22-BZBZOS4Y1 (https://github.com/TJU-CMC-Org/MINTmap/tree/master/MINTplates). good way to use it as cross-mapper ID between different naming or future changes.
Name: miRNA name used in the database
Parent: hairpin precursor name
Alias or Dbxref: get names from other databases miRBase or miRgeneDB
Expression: raw counts separated by ,
Normalized_expression: normalization by the tool if any. Same format than before
Filter: PASS or REJECT (this allow to keep all the data and select the one you really want to consider as valid features)
Variant: string character similar to CIGAR to show the difference with the ref_miRNA
Target: to add other genomic positions where the sequences map as well?
Seed: just to have the 2-8 nt of the sequence

Any other attribute you normally use or would like to have?

The text was updated successfully, but these errors were encountered:

ivlachos · 2017-06-15T14:07:37Z

I might have opted for the database version or the actual reference sequence, since these might change in time.
I would've avoided the "target" keyword for this since for we're talking about miRNAs and it might be a confusing term.
What do you think?

lpantano · 2017-06-15T14:59:06Z

Thanks for the ideas! I think Target is confusing as well, I was trying to follow the current attributes meaning, we can modify slightly? maybe: precursors?, or we can add multiple names in the Parent attribute, although it would be good to have the exact multiple genomic positions. About the first suggestion to use the database version, this is for some attribute in column 9 or you mean the source column? I think is good idea to have the version, although maybe having in the header once is enough??

…

On Jun 15, 2017, at 10:07 AM, ivlachos ***@***.***> wrote: I might have opted for the database version or the actual reference sequence, since these might change in time. I would've avoided the "target" keyword for this since for we're talking about miRNAs and it might be a confusing term. What do you think? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#14 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABi_HJ-3ybUl-UmN1xhn45Ize_VP8R0oks5sETqpgaJpZM4N7Q1T>.

Bastami · 2017-06-23T11:48:45Z

Hi!
I think Dbxref is important, but I noticed that miRBase or miRgeneDB are not listed in the authoritative list of databases (ftp://ftp.geneontology.org/pub/go/doc/GO.xrf_abbs) which contains DBTAGs and the URL transformation rules that can be used to fetch the objects given their IDs.
Do you think we should add them?

lpantano · 2017-06-23T16:12:41Z

Thats a good point, @keilbeck do you know who can do this or the requirements for this? Thanks!

gurgese · 2017-06-28T17:45:35Z

@lpantano
In my opinion a supplementary attribute should be included to collect high level labels useful to classify how the read has been mapped on the mature sequence.
This field can be useful to filter particular classes of reads of interest.
As an example all reads mapped with an insertion on the 5p end of the mature form can be easily identified if a label is assigned to them.

If some other like this idea we can discuss deeper how it can be supported in the new format.

ThomasDesvignes · 2017-07-06T21:15:54Z

All those comments are great!
One additional information that I think needs to be associated is the aligner used. For isomiRs that have no edited sites or untemplated additions, the aligner doesn't really matter, but as soon as we start looking into untemplated nucleotides and edited miRNAs, then different aligners can potentially provide different answers and return different CIGAR strings.

lpantano mentioned this issue Jun 22, 2017

GFF3::source | GFF3::type #13

Open

lpantano added the discussion label Jun 26, 2017

lpantano closed this as completed Jun 26, 2017

lpantano reopened this Jun 26, 2017

lpantano added a commit that referenced this issue Aug 9, 2017

adaptation from issues #13 #14

5125b18

lpantano added the consensus label Aug 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GFF3::attributes #14

GFF3::attributes #14

lpantano commented Jun 15, 2017

ivlachos commented Jun 15, 2017

lpantano commented Jun 15, 2017 via email

Bastami commented Jun 23, 2017 •

edited

Loading

lpantano commented Jun 23, 2017

gurgese commented Jun 28, 2017

ThomasDesvignes commented Jul 6, 2017

GFF3::attributes #14

GFF3::attributes #14

Comments

lpantano commented Jun 15, 2017

ivlachos commented Jun 15, 2017

lpantano commented Jun 15, 2017 via email

Bastami commented Jun 23, 2017 • edited Loading

lpantano commented Jun 23, 2017

gurgese commented Jun 28, 2017

ThomasDesvignes commented Jul 6, 2017

Bastami commented Jun 23, 2017 •

edited

Loading