lapa_correct_talon generates inconsistent corrected abundance and gtf files #29

MustafaElshani · 2025-01-28T10:01:53Z

When running the comparison between corrected abundance file and gtf I get the following

 57282 isoforms were quantified.
 50042 isoforms are annotated.
 Only 50042 overlap.
 7240 isoforms quantifed had no corresponding annoation

This is not the case of the parental TALON generated files

Additionally there seems to be some transcript abundance only differing ENST00000268841.10#0 ENST00000268841.10#1
where is this coming from ?

The text was updated successfully, but these errors were encountered:

MustafaElshani · 2025-01-29T13:59:55Z

I think the issue arises due to the way _update_abundace is calculated which takes the linked counts, I think assumption was made that the linked counts would match the way the threshold worked for creation of corrected gtf .

I introduced a further filtering criteria that excludes transcripts in the corrected abundance that are not present in the corrected gtf. I introduced some changes here to match corrected abundance and gtf outputs MustafaElshani@321196e

MustafaElshani · 2025-02-11T18:42:15Z

Another issue i'm noticing are the improbable start and end coordinates in the output corrected gtf

MustafaElshani · 2025-02-12T12:24:36Z

Another issue i'm noticing are the improbable start and end coordinates in the output corrected gtf

I think the code logic allows for genes to copies from input gtf without the TES and TSS support and then also creates erroneous exons.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lapa_correct_talon generates inconsistent corrected abundance and gtf files #29

lapa_correct_talon generates inconsistent corrected abundance and gtf files #29

MustafaElshani commented Jan 28, 2025

MustafaElshani commented Jan 29, 2025

MustafaElshani commented Feb 11, 2025

MustafaElshani commented Feb 12, 2025

lapa_correct_talon generates inconsistent corrected abundance and gtf files #29

lapa_correct_talon generates inconsistent corrected abundance and gtf files #29

Comments

MustafaElshani commented Jan 28, 2025

MustafaElshani commented Jan 29, 2025

MustafaElshani commented Feb 11, 2025

MustafaElshani commented Feb 12, 2025