-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compare annotation percentages #11
Comments
Hi Irene, Jeff |
Hi Jeff, |
It would be possible to have two nulls, one for the hypo- DMRs and one for the hyper-DMRs. This could get confusing when plotted on a bar graph because it would need 4 bars. One way to plot then is to show the enrichment of each query set (hyper or hypo DMRs) over it's own respective null as a fold change or odds ratio. However, I generally have just combined both of my hyper- and hypo- DMRs and used that as a the genomic null set. The reason is I tried with them separate and didn't see a difference. The genomic background was sampled either way, regardless of changes in length distribution. I also found my hyper- and hypo- DMRs generally had the same length distribution, even if the total number of DMRs was different. Thus, it may be valid and simpler to just treat all DMRs together to generate a genomic null that can be compared to both hyper- and hypo- DMRs. |
Thanks for your answer. So when you talk about length is not the number of DMRs but the width of them? I'm not sure I've understood it properly |
I am talking about the distribution of the lengths of the DMRs rather than
the count of them. For the null set, it does not have to have the same
number of DMRs, but the same length distribution. Ideally, it would have
more so we've well sampled the background space. However, we want it to
match the length distribution of the query so we're looking at a background
of what random regions with this same length distribution look like all
over the genome to compare to.
…On Wed, Jul 24, 2019 at 11:17 PM IRECG ***@***.***> wrote:
Thanks for your answer. So when you talk about length is not the number of
DMRs but the width of them? I'm not sure I've understood it properly
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#11?email_source=notifications&email_token=AAZF3DDNUFVR2NXLGXNWZCTQBFAPZA5CNFSM4IDYDQ4KYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2YPCYQ#issuecomment-514912610>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAZF3DDK3MI4F6VGGAHDKVTQBFAPZANCNFSM4IDYDQ4A>
.
--
*Jeffrey M. Bhasin, PhD*
*Bioinformatics & Data Science*
(949) 424-5343
5405 Alton Parkway Ste. A-210, Irvine, CA, 92604
web: http://jeffb.io/ eml: [email protected]
Add Me on LinkedIn <http://linkedin.com/in/jeffrey-bhasin> | Follow Me on
GitHub <http://github.com/jeffbhasin>
|
Hello again,
I have performed MBD-seq expermients with two groups: cases and controls. After annotating the DMRs I have the distribution showed below in the plot. I have compare the proportions of each annotation between groups, assuming that proportion of each one should be more or less equal between groups. I do have statistical significant differences between them in all the categories, that I can explain by the biological differences between the groups. But I also wonder if it would be possible to compare to a "expected" distribution. I have been reading the manual and the paper and I don't know if the function drawGenomePool does something similar or it there is any other way to do it. Thanks
The text was updated successfully, but these errors were encountered: