Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Subset small test dataset #27

Closed
kelly-sovacool opened this issue Aug 11, 2023 · 6 comments · Fixed by #107
Closed

Subset small test dataset #27

kelly-sovacool opened this issue Aug 11, 2023 · 6 comments · Fixed by #107
Assignees

Comments

@kelly-sovacool
Copy link
Member

kelly-sovacool commented Aug 11, 2023

Subset to keep all reads that aligned to just one chromosome. Better than random sampling so read depth will still be high.

In progress on branch tests_iss-27

@kelly-sovacool kelly-sovacool self-assigned this Aug 11, 2023
@kelly-sovacool

This comment was marked as resolved.

@kelly-sovacool

This comment was marked as resolved.

@kelly-sovacool
Copy link
Member Author

Selected raw reads that mapped to a small region of chromosome 22. Now testing on biowulf.

https://github.com/CCBR/XAVIER/tree/9fcd76bb9474ee76c919c34bf8a5a99925bae864/tests

@kelly-sovacool
Copy link
Member Author

kelly-sovacool commented Aug 15, 2023

Regions for test dataset need to have enough coverage to make it through somalier analysis: brentp/somalier#50

Solution: if fewer than e.g. 20 chromosomes, just touch the somalier output file instead of running it.

@kelly-sovacool
Copy link
Member Author

Currently this test dataset works with paired/cnv off, but fails otherwise. Will need to further refine it to figure out why.

@kelly-sovacool
Copy link
Member Author

the new subsampled dataset in tests/data/ will fail with --cnv and on somalier, but there's now a larger 25% subset available on biowulf that works for these steps: /data/CCBR_Pipeliner/testdata/XAVIER/human_subset. This should be good enough for our purposes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant