Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalisation of bigwig files #357

Closed
alhh507 opened this issue Jul 18, 2023 · 5 comments
Closed

Normalisation of bigwig files #357

alhh507 opened this issue Jul 18, 2023 · 5 comments
Labels
question Further information is requested

Comments

@alhh507
Copy link

alhh507 commented Jul 18, 2023

Description of feature

Hi there,

Firstly, thank you nf-core team for the development of this chip-seq pipleine. I have used it successfully multiple times, I LOVE IT.

I was just wondering about the specifics of the normalisation procedure.

You write on the pipeline overview page "Create normalised bigWig files scaled to 1 million mapped reads"

Does this mean that the generated bigwig files are just scaled to reads per million - or are these resulting bigwig files also normlaised to input?

If they are normalised to input are they IP-input or IP/input? Also, if they are normalised to input then is there a way to circumvent this?

Thanks and all the best,

Alex

@wanisajad
Copy link

Hi Alex,
Did you got answer for that?
Sajad

@JoseEspinosa JoseEspinosa added question Further information is requested and removed enhancement labels Jul 23, 2024
@JoseEspinosa
Copy link
Member

Hi both,
As documented the bigWig files scaled to 1 million mapped reads, see this line of code for reference.

@wanisajad
Copy link

@JoseEspinosa thanks for your response.
Before plotting ChIP intensity in deepTools heatmaps, do we need to further normalize the data with the input, apart from the normalization to one million mapped reads that is already done? Would this additional step be necessary for accurate visualization of signal enrichment over background?

@JoseEspinosa
Copy link
Member

It depends on the signal you want to identify, the current approach implemented in the pipeline is useful when comparing the overall signal intensity between different ChIP-seq experiments. By normalizing to RPM, we can ensure that variations in sequencing depth do not skew the comparison, this way we can also compare a region of interest across different samples on a similar scale. What you propose will be useful to generate fold enrichment tracks, which will indicate the relative enrichment of the ChIP sample compared to the input control in downstream analysis.

@JoseEspinosa
Copy link
Member

I will close this issue now for lack of activity, if you have any further questions please feel free to re-open it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants