Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Index reference genome #10

Open
Adamtaranto opened this issue Apr 8, 2021 · 0 comments
Open

Index reference genome #10

Adamtaranto opened this issue Apr 8, 2021 · 0 comments
Assignees

Comments

@Adamtaranto
Copy link
Owner

The current method loads a reference genome into memory and then uses string operations to extract sequences that match a hmm query. This is fine for most microbial genomes but can become clunky with larger sequences.

For the next major release - replace all sequence operations with pyfaidx which creates an index of the fast file and only loads minimal amounts of the seq into memory as required.

@Adamtaranto Adamtaranto self-assigned this Apr 8, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant