Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable pdf creation/running R scripts at preprocess step #638

Open
pruzanov opened this issue Aug 30, 2023 · 0 comments
Open

Disable pdf creation/running R scripts at preprocess step #638

pruzanov opened this issue Aug 30, 2023 · 0 comments

Comments

@pruzanov
Copy link

pruzanov commented Aug 30, 2023

It appears that during preprocessing, GRIDSS creates a pdf report with an R script. We find that sometimes this hangs silently (for very long time) and the entire step fails. Is it possible to disable this reporting? I found that the problem arises with inputs with a lot (>1M) lines. This metric reports very long inset sizes and it may be practical to introduce a hard stip, for example after 5-10K.

Adding this lines after loading histogram in picard/analysis/insertSizeHistogram.R
also improves the situation without altering plot significantly:

 # Sub-sample metric data. This will prevent long run times with large inputs
 histogram_rows<-sample(1:nrow(histogram), min(nrow(histogram), 10000), replace = FALSE)
 histogram = histogram[histogram_rows,]
 ORDERED<-order(histogram$insert_size)
 histogram<-histogram[ORDERED,]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant