Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] explore all overlapping genomes, not just minimum metagenome cover #124

Open
wants to merge 21 commits into
base: latest
Choose a base branch
from

Conversation

ctb
Copy link
Member

@ctb ctb commented Dec 21, 2021

This PR explores the prefetch results (containing all overlapping genomes), as a way of beginning to evaluate strain specificity of the minimum metagenome cover.

Specifically,

  • adds a target map_prefetch that maps the sample reads to all overlapping genomes, not just the minimum set;
  • adds variant calling and SNP estimation in order to look at variation rates (incl ANI) between the metagenome and its various reference genomes.

relevant: #18

Next things to do -


variant calling on gather results

The SNP stuff is already pretty illuminating -- as expected, most but not all of the mock community members have very few SNPs, while the real communities ...do.

podar mock

snps-SRR606249

zymo mock

snps-SRR12324253

human gut real

snps-p8808mo11

oil well real

snps-SRR1976948

@ctb ctb changed the title [WIP] explore prefetch results [WIP] explore all overlapping genomes, not just minimum metagenome cover Jan 1, 2022
@ctb ctb changed the base branch from latest to add/vcf February 15, 2022 15:02
Base automatically changed from add/vcf to latest February 16, 2022 12:26
ctb added a commit that referenced this pull request Feb 16, 2022
* steal VCF code from #124

* fix n_snps calculation
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant