Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use Gramtools build based on multiple VCF files? #172

Open
cvn001 opened this issue Sep 8, 2023 · 1 comment
Open

How to use Gramtools build based on multiple VCF files? #172

cvn001 opened this issue Sep 8, 2023 · 1 comment

Comments

@cvn001
Copy link

cvn001 commented Sep 8, 2023

Hello,

I want to build a genome graph based on VCF files of variant information for hundreds of samples.

Below I tested two samples using the following commands:

gramtools build -o ./gram --ref ref.fna --vcf A4303.vcf A4408.vcf --max_threads 64 --force

All processes run successfully, but there are the following warnings in the command line output:

2023-09-08 11:02:02,229 gramtools INFO Running vcf_record_clustering on [PosixPath('VCF/A4303.vcf'), PosixPath('VCF/A4408.vcf')].
WARNING:root:Using first sample name found "A4303". Found a different (or no) sample name "A4408", which will not be used

Does the above warning have any impact on the results?

@bricoletc
Copy link
Member

Hi @cvn001, in terms of the built genome graph (i.e., if you don't care about sample names), no impact, the warning just states that variants from your two VCF files were combined together ('clustered') and the resulting combined VCF uses the first sample name found to express the combined variants.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants