Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Interface to export LD matrix for requested region #17

Open
gaow opened this issue Aug 30, 2018 · 2 comments
Open

Interface to export LD matrix for requested region #17

gaow opened this issue Aug 30, 2018 · 2 comments

Comments

@gaow
Copy link
Member

gaow commented Aug 30, 2018

Here is a feature request for an export interface that for given chr, start and end positions, a complete matrix (or sparse matrix in R which can be trivially made complete) from given blocked LD database file can be extracted.

@xiangzhu
Copy link
Collaborator

xiangzhu commented Aug 30, 2018

Thanks @gaow !

To clarify, the ideal output you need is an LD matrix for a given list of SNPs [chr pos], right? If so, I think Nick has already made some functions available along this line (but I couldn't remember the names...).

Perhaps sometimes we can also simply the input as a genomic region (e.g. gene): people only need to provide three numbers: chr, start, end.

@gaow
Copy link
Member Author

gaow commented Aug 30, 2018

Thanks @xiangzhu

Yes extracting a list of SNP is certainly of interest. But more generally (or alternatively) extraction regional LD might have interesting user case by itself. To name a couple that I can immediately think of:

  1. Investigating LD structure of a genomic region of interest without a specific data-set input (list of SNPs). eg I'd like to look at what's going on in MHC region.
  2. Extracting loci for fine-mapping -- these loci are obtained from literature but i'd like to try it with various other data-sets. It would be easier to just extract those loci first all at once; then for each analysis unit i'll match the exact SNPs in my data-set to get data-specific LD info. This should be faster than querying against the big database for every data-set, and also easier to share with others.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants