Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

lapa_correct_talon generates inconsistent corrected abundance and gtf files #29

Open
MustafaElshani opened this issue Jan 28, 2025 · 3 comments

Comments

@MustafaElshani
Copy link

When running the comparison between corrected abundance file and gtf I get the following

 57282 isoforms were quantified.
 50042 isoforms are annotated.
 Only 50042 overlap.
 7240 isoforms quantifed had no corresponding annoation

This is not the case of the parental TALON generated files

Additionally there seems to be some transcript abundance only differing ENST00000268841.10#0 ENST00000268841.10#1
where is this coming from ?

@MustafaElshani
Copy link
Author

I think the issue arises due to the way _update_abundace is calculated which takes the linked counts, I think assumption was made that the linked counts would match the way the threshold worked for creation of corrected gtf .

I introduced a further filtering criteria that excludes transcripts in the corrected abundance that are not present in the corrected gtf. I introduced some changes here to match corrected abundance and gtf outputs MustafaElshani@321196e

@MustafaElshani
Copy link
Author

Another issue i'm noticing are the improbable start and end coordinates in the output corrected gtf

@MustafaElshani
Copy link
Author

Another issue i'm noticing are the improbable start and end coordinates in the output corrected gtf

I think the code logic allows for genes to copies from input gtf without the TES and TSS support and then also creates erroneous exons.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant