Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add instructions for custom virus database generation #1

Open
mtisza1 opened this issue May 5, 2023 · 1 comment
Open

Add instructions for custom virus database generation #1

mtisza1 opened this issue May 5, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@mtisza1
Copy link
Contributor

mtisza1 commented May 5, 2023

Instructions should include format for .fasta files, .mmi files, and metadata files

@mtisza1 mtisza1 added the enhancement New feature or request label May 5, 2023
@mherold1
Copy link

mherold1 commented Aug 5, 2024

Hi,
thanks for providing the software.
I was wondering if there are any updates in regards to information on custom database generation.

From looking at the DB v2.0.2 I would simply try to modify the files accordingly (descriptions of the files in the zenodo archive: https://zenodo.org/records/7876309):
Is the list of curated viruses contained in the database simply the list of viruses for which sequences are not dereplicated in the initially constructed database or within running the pipeline?

With the output I got from running pipeline v.0.2.3 and DB v2.0.2 it seems like reads are assigned across a lot of different Mamastrovirus and Rotavirus strains and segments. Maybe prior dereplication would have given better results for my samples and I would like to test a different database.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants