-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
some questions about the usage of 'split_dis' #6
Comments
Hello, And I used the daccord corrected reads as input: in.fasta to run daligner to generate bin/split_dis -t5 -d30 -D200 out1.las ./preads.1.fasta ./raw_data.1.las ./raw_data.dam and stop in the log file like this and it continue to run without outputting more information: DC[186]=0 1333 |
And I try this: the memory surge up to 100G. It was hard to believe because the fasta is only 33Mbp. |
Hi, concerning computeextrinsicqv: I forgot to update the documentation concerning the arguments expected. This is updated now, please try again with the most recent README.md hints. about split_dis: this is so far mainly a proof of concept program. While it works as a general idea, it does not really cope yet with high depth or a large number of repeat instances. I have used it to separate up to 7 or 8 copies at depth 20 each, but this is already pretty slow. Anything more will probably take forever with the current implementation. Best, |
Thanks very much for your help |
Hello,
I think 'split_dis' is a great idea and I want to use it to help me get better genome assembly, for example, to generate more complete repeat region, but I have some question about the usage of 'split_dis':
I know cons.fasta contains the "corrected" fasta. But I am not sure:
the reads of "in.las" or "in.db" refer to "corrected" reads or "raw(uncorrected)" reads?
Thanks!
split_dis
split_dis performs disagreement based read pile splitting for haplotypes and repeats. It expects four arguments
out.las: the name of the output file, which will be written in the LAS file format
cons.fasta: a read consensus file for the reads in the input database in FastA format as produced by daccord
in.las: alignments for in.db as generated by DALIGNER
in.db: input read database
The text was updated successfully, but these errors were encountered: