Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Different resolution produce different result #48

Open
biozzq opened this issue Jul 21, 2021 · 1 comment
Open

Different resolution produce different result #48

biozzq opened this issue Jul 21, 2021 · 1 comment

Comments

@biozzq
Copy link

biozzq commented Jul 21, 2021

Dear all

According to your published paper, I defined regions with a z-normalized similarity score ≤−1.2 and a signal-to-noise ratio of r ≥ 0.6 as the significant changing. When using 40kb resolution, I can indentify some changing regions. However, it is difficult to catch any any significant changes in the interaction heatmap in these regions.
40kb
Continuing from the previous discussion (#34), @liz-is has given us some suggestions to adjust appropriate parameters. Thus, I tried 25kb resolution. In this case, no region can be identified as changing regions.
25kb

Does this mean that there is no significant changes in 3D organization of chromatin between these two samples?

Best regards,
Zheng zhuqing

@liz-is
Copy link
Collaborator

liz-is commented Oct 20, 2021

Hi, sorry for the delayed reply. We've been working on refining an approach to select thresholds for SSIM and SN, as the thresholds of z-normalized similarity score ≤−1.2 and a signal-to-noise ratio of r ≥ 0.6 used in the original CHESS paper aren't appropriate for all datasets. Similarity and SN are influenced by the level of noise in the Hi-C matrices, so will vary across resolutions as higher resolutions are typically more noisy.

Our new approach is as follows: if you have biological replicates, and sufficient sequencing depth to analyse these individually, we suggest that you can use a comparison between biological replicates from the same condition to generate a "reference distribution" of SSIM and SN. You can then select thresholds based on this distribution, in order to identify regions that have lower SSIM and higher SN in your treatment-control comparison than you expect to find in the reference distribution.

Alternatively, if you don't have biological replicates but have access to another reference dataset, you can carry out a control-reference comparison and then take the difference of the SSIM profile between this and your real treatment-control comparison. This is an approach that was also used in the original CHESS paper for the Drosophila data.

We've added explanations of these approaches to the CHESS FAQ and they are also decsribed in a preprint. I hope this is helpful, please let us know if you have follow-up questions!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants