Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[validation] How to arrange the mtDNA after assembly with mitoHiFi from hifiasm #88

Open
Isoris opened this issue May 3, 2024 · 2 comments

Comments

@Isoris
Copy link

Isoris commented May 3, 2024

Hello,

I have assembled a mitogenome with mitoHiFi using a reference genome from a closely related species, starting from Hifiasm contigs, I could get the final.mitogenome.fasta , then I remapped HiFi reads as follows:

``
minimap2 -t 32 -ax map-hifi --secondary=no final_mitogenome.fasta ../../../00-raw_data/hifi/CM_M_HIFI_None_reads.fastq.gz | samtools view -@ 32 -bhS - | samtools sort -@ 32 -o final_mitogenome.fasta.hifi.aligned.sorted.bam

index the bam files with samtools index

for file in *.bam
do
samtools index -@ 16 --bai ${file} -o ${file}.bai
bedtools genomecov -ibam ${file} -bg > ${file}.coverage.bedgraph.bed
done

igv -g final_mitogenome.fasta final_mitogenome.fasta.hifi.aligned.filtered.sorted.bam
``
however, I could see that there are some inconsistencies in the sequence:

  1. for instance there are breakpoints with no evidence of support from read coverage (see below):

image

  1. And also regions which seems to be heterozygous:

image

  1. what are those repeats?

image

Do you have any idea why is it happening?

Below is the gb annotation file:

final_mitogenome.zip

in the region from 700- 8000 there is no annotations for any genes or anything. I can see tRNAs from 0-700 and from 11,000 to the end ~26,000

Thank you in advance for your answers and help :)

Quentin

@Isoris
Copy link
Author

Isoris commented May 3, 2024

Here are the annotation results:

  1. Contigs
    contigs_annotations
  2. Final mtDNA
    final_mitogenome annotation

Could suggest some next steps? also do you think I rather use the ONT hifiasm asm as input or start from the ONT reads instead? ?

Thank you for your help :)

@Isoris
Copy link
Author

Isoris commented May 3, 2024

Do you think that there could be contamination in the nanopore data?

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant